Tutorials
TUTORIALS
List of tutorials
Permanent link to this article: http://europar2016.inria.fr/tutorials/
Accelerate your application with OpenACC Gunter Röth, NVIDIA Learn how to accelerate your C/C++/Fortran application using OpenACC to harness the massively parallel power of NVIDIA GPUs. OpenACC is a directive based approach to computing where you provide compiler hints to accelerate your code, instead of writing the accelerator code yourself. In 90 minutes, you will …
An overview of fault-tolerant techniques for HPC Yves Robert Ecole Normale Supérieure de Lyon & University of Tennessee Knoxville Yves.Robert@ens-lyon.fr, graal.ens-lyon.fr/~yrobert Resilience is a critical issue for large-scale platforms. This tutorial provides a comprehensive survey on fault-tolerant techniques for high-performance computing. It is organized along four main topics: (i) An overview of failure types (software/hardware, …
“Effective MPI Programming: concepts, advanced features, do’s and dont’s” Jesper Larsson Träff www.par.tuwien.ac.at MPI is a large and complex, but well-structured interface for message-passing based parallel programming for high-performance systems. This tutorial aims to provide an understanding of basic concepts of the interface, showing how concepts and (advanced) features of the interface can be put …
Simon Delamare, Pierre Neyron and Lucas Nussbaum This tutorial will present how to use the Grid’5000 platform to set up scientific experiments, with a focus on those related to High Performance Computing (HPC) and Cloud computing. Grid’5000 is a scientific instrument designed to support experiment-driven research in all areas of computer science related to parallel, large-scale or …
Green Big Data Processing using Hadoop Anne-Cécile Orgerie, Shadi Ibrahim Abstract Data volumes are ever growing, for a large application spectrum going from traditional database applications, scientific simulations to emerging applications including Web 2.0 and online social networks. To cope with this added weight of Big Data, we have recently witnessed a paradigm shift in …
Tuning for Data Parallelism – A practitioners approach to SIMD and AVX-512 Zakhar A. Matveev, Stephen Blair-Chappell, Laurent Duhem, Intel It is well recognised that three ingredients are essential to secure maximum CPU performance – multi-core (thread aware) parallelism; vectorization (data parallelism); and efficient use of the memory subsystem. In this tutorial we focus on vectorization, and ask the questions: How can I best vectorise my code?; What are the typical hurdles to vectorisation, and how do I overcome them?; How can I measure the effectiveness and efficiency of my vectorised code?; How can I profile the modify the memory-access patterns in my code to get best performance?; How can I be certain that my code is ready for the next generation of ISAs such as the AVX-512 architecture – even when I don’t have access to the hardware? In this tutorial we use a specially configure version of DL_MESO – a general purpose computational chemistry mesoscale simulation package – as a ‘playground’ …
Using SimGrid for Research in Large-Scale Distributed Systems Arnaud Legrand, Martin Quinson This tutorial will provide attendees with a clear understanding of current technology and best practice for exper- imenting parallel large-scale distributed computing research through simulation, and a first practical experience with the SimGrid framework. The first part of the tutorial will present and …