-
Notifications
You must be signed in to change notification settings - Fork 7
Performance Profiling
A crucial component of the EuroHack is knowing which components to port to GPUs. Here profiling the code (already on CPUs) will give a good indicator. Alistair Hart generously has provided some guidelines for profiling:
Profiling OpenACC programs There are a number of ways to get simple profiling and debugging information, often only by turning on environment variables. Alistair has put together a quick overview of these simple techniques for both Cray and PGI.
There are more complicated tools to profile OpenACC programs, which give you a nicer presentation of information. As with all technology, the devil is in the details. Jean-Guillaume Piccinali and Ben Cumming put together the following profiling example for various programming paradigms, including OpenACC: