Você está aqui

Intel® Trace Analyzer and Collector

This page provides the current Release Notes for Intel® Trace Analyzer and Collector. The notes are categorized by year, from newest to oldest, with individual releases listed within each year.

Click a version to expand it into a summary of new features and changes in that version since the last release, and access the download buttons for the detailed release notes, which include important information, such as pre-requisites, software compatibility, installation instructions, and known issues.

The first step in analyzing a hybrid MPI/OpenMP* application is getting an overview of the application performance. There is a tool called Application Performance Snapshot that can provide the general performance information about your application. This includes MPI and OpenMP time and load balance information, information about memory and disk usage, most utilized MPI operations, and more.

Use Intel® VTune™ Amplifier to understand why the computation time of the 2/64 combination is worse than the 32/4 combination even though the elapsed time is much less. A lower elapsed time for 32/4 is not possible due to the overhead of MPI deployments. As a result, it is better to focus on improving the computation time for the 2/64 combination instead.

You have completed the Analyzing OpenMP* and MPI Applications tutorial with Application Performance Snapshot, Intel® Trace Analyzer and Collector, and Intel® VTune™ Amplifier. Here are some important things to remember when working with your own hybrid application: