Optimizing HPC Applications with Intel Cluster Tools: Hunting Petaflops

Optimizing HPC Applications with Intel Cluster Tools: Hunting Petaflops

Alexander Supalov, Andrey Semin, Christopher Dahnken, Michael Klemm

Optimizing HPC functions with Intel® Cluster instruments takes the reader on a journey of the fast-growing sector of excessive functionality computing and the optimization of hybrid courses. those courses in most cases mix dispensed reminiscence and shared reminiscence programming versions and use the Message Passing Interface (MPI) and OpenMP for multi-threading to accomplish the last word objective of excessive functionality at low strength intake on enterprise-class workstations and compute clusters.

The e-book makes a speciality of optimization for clusters along with the Intel® Xeon processor, however the optimization methodologies additionally follow to the Intel® Xeon Phi™ coprocessor and heterogeneous clusters blending either architectures. along with the educational and reference content material, the authors deal with and refute many myths and misconceptions surrounding the subject. The textual content is augmented and enriched via descriptions of real-life situations.

What you’ll learn

  • Practical, hands-on examples convey tips on how to make clusters and workstations in response to Intel® Xeon processors and Intel® Xeon Phi™ coprocessors "sing" in Linux environments

  • How to grasp the synergy of Intel® Parallel Studio XE 2015 Cluster variation, along with Intel® Composer XE, Intel® MPI Library, Intel® hint Analyzer and Collector, Intel® VTune™ Amplifier XE, and plenty of different invaluable tools

  • How to accomplish rapid and tangible optimization effects whereas refining your knowing of software program layout principles

Who this publication is for

software program execs will use this publication to layout, enhance, and optimize their parallel courses on Intel structures. scholars of laptop technological know-how and engineering will worth the publication as a finished reader, compatible to many optimization classes provided all over the world. The beginner reader will take pleasure in an intensive grounding within the fascinating international of parallel computing.

Table of Contents

Foreword by means of Bronis de Supinski, CTO, Livermore Computing, LLNL


Chapter 1: No Time to learn this Book?

Chapter 2: evaluation of Platform Architectures

Chapter three: Top-Down software program Optimization

Chapter four: Addressing method Bottlenecks

Chapter five: Addressing software Bottlenecks: dispensed Memory

Chapter 6: Addressing program Bottlenecks: Shared Memory

Chapter 7: Addressing software Bottlenecks: Microarchitecture

Chapter eight: software layout Considerations

Show sample text content

Download sample