Skip to content

Performance Profiling

jgp edited this page Jul 8, 2015 · 7 revisions

A crucial component of the EuroHack is knowing which components to port to GPUs. Here profiling the code (already on CPUs) will give a good indicator. Alistair Hart generously has provided some guidelines for profiling:

Getting Ready for EuroHack

Profiling OpenACC programs There are a number of ways to get simple profiling and debugging information, often only by turning on environment variables. Alistair has put together a quick overview of these simple techniques for both Cray and PGI.

Getting Feedback at EuroHack

There are more complicated tools to profile OpenACC programs, which give you a nicer presentation of information. As with all technology, the devil is in the details. Jean-Guillaume Piccinali and Ben Cumming put together the following profiling example for various programming paradigms, including OpenACC:

OpenACC profiling examples for CSCS/USI Summer School

Cray CCE -h profile_generate

Cray Perftools API

Cray CRAY_ACC_DEBUG API

Nvidia nvprof

Clone this wiki locally