O'Reilly logo

High Performance Parallelism Pearls Volume Two by James Reinders, Jim Jeffers

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 17

Coarse-Grained OpenMP for Scalable Hybrid Parallelism

Enda O’Brien    Irish Centre for High-End Computing (ICHEC), Ireland

Abstract

This chapter illustrates the benefit of using OpenMP parallelism in a more “coarse-grained” way. This requires inserting directives at the highest possible level in source code, and using domain decomposition concepts that are closely analogous to those of MPI, so that multiple copies of thread-local arrays do not lead to excessive memory consumption. On massively parallel, heterogeneous hardware systems, an efficient nesting of such coarse-grained OpenMP within distributed-memory MPI parallelism may be the best approach for obtaining optimal performance from a large class of applications. Examples ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required