Handling of Communicator Names

Intel® Trace Analyzer and Collector User and Reference Guide

Download PDF

ID 767272

Date 3/31/2023

Version 2021.10

Public

Visible to Intel only — GUID: GUID-988AD4F7-B1D6-44E4-98BC-B92966C4E268

View Details

Document Table of Contents

Document Table of Contents x

Intel® Trace Analyzer and Collector User and Reference Guide

Intel® Trace Analyzer and Collector User and Reference Guide x

Introduction Install and Set Up Intel® Trace Analyzer and Collector Trace Your Applications Analyze Your Applications Intel® Trace Collector Reference Intel® Trace Analyzer Reference Notices and Disclaimers

Introduction x

Notational Conventions Get Help

Trace Your Applications x

Tracing Conventional MPI Applications Tracing Failing MPI Applications Tracing OpenSHMEM* Applications Tracing MPI File IO Handling of Communicator Names Tracing MPI Load Imbalance Tracing User Defined Events Configuring the Collector Filtering Trace Data Recording OpenMP* Regions Information Tracing System Calls (Linux* OS) Collecting Lightweight Statistics Recording Source Location Information Recording Hardware Performance Information (Linux* OS) Recording Operating System Counters Tracing Library Calls Correctness Checking Tracing Distributed Non-MPI Applications

Correctness Checking x

Correctness Checking of MPI Applications Running with Valgrind* (Linux* OS) Configuring Error Checks Analyzing the Results Debugger Integration

Debugger Integration x

TotalView* Debugger GNU* Symbolic Debugger Allinea* Distributed Debugging Tool* (DDT*)

Analyze Your Applications x

Starting Intel® Trace Analyzer Intel Trace Analyzer Graphical User Interface Navigating Timelines Concepts Viewing Correctness Checking Reports Comparing Two Trace Files Interoperability with Intel® VTune™ Profiler and Intel® Advisor OpenMP* Regions Display Support OTF2 Format Support

Navigating Timelines x

Zoom Stack

Concepts x

Level of Detail Aggregation Advanced Aggregation Tagging and Filtering

Viewing Correctness Checking Reports x

Event Timeline Correctness Checking Reports Qualitative Timeline Correctness Checking Reports Detailed Dialog

Comparing Two Trace Files x

Mappings in Comparison Views Comparison Charts

Mappings in Comparison Views x

Mapping of Processes Mapping of Functions

Comparison Charts x

Comparison Function Profile Comparison Message Profile Comparison Collective Operations Profile

Intel® Trace Collector Reference x

API Reference Configuration Reference Correctness Checking Errors Structured Tracefile Format stftool Utility Time Stamping Secure Loading of Dynamic Link Libraries* on Windows* OS

API Reference x

Initialization, Termination and Control Defining and Recording Source Locations Defining and Recording Functions or Regions Defining and Recording Scopes Defining Groups of Processes Defining and Recording Counters Recording Communication Events Additional API Calls in libVTcs C++ API

Initialization, Termination and Control x

VT_initialize VT_finalize VT_getrank VT_registerthread VT_registernamed VT_registerprefixed VT_getthrank VT_traceon VT_traceoff VT_tracestate VT_symstate VT_flush VT_timestamp VT_timestart VT_setfinalizecallback VT_getdescription VT_countsetcallback

Defining and Recording Functions or Regions x

New Interface Old Interface State Changes

C++ API x

VT_FuncDef Class Reference VT_SclDef Class Reference VT_Function Class Reference VT_Region Class Reference

Configuration Reference x

Configuration File Format Protocol File Configuration Options

Configuration Options x

ACTIVITY ALTSTACK AUTOFLUSH CHECK CHECK-LEAK-REPORT-SIZE CHECK-MAX-DATATYPES CHECK-MAX-ERRORS CHECK-MAX-PENDING CHECK-MAX-REPORTS CHECK-MAX-REQUESTS CHECK-SUPPRESSION-LIMIT CHECK-TIMEOUT CHECK-TRACING CLUSTER COMPRESS-RAW-DATA COUNTER CURRENT-DIR DEADLOCK-TIMEOUT DEADLOCK-WARNING DEMANGLE DETAILED-STATES ENTER-USERCODE ENVIRONMENT EXTENDED-VTF FLUSH-PID FLUSH-PREFIX GROUP HANDLE-SIGNALS INTERNAL-MPI KEEP-RAW-EVENTS LOGFILE-FORMAT LOGFILE-NAME LOGFILE-PREFIX LOGFILE-RANK MEM-BLOCKSIZE MEM-FLUSHBLOCKS MEM-INFO MEM-MAXBLOCKS MEM-MINBLOCKS MEM-OVERWRITE NMCMD OS-COUNTER-DELAY PCTRACE PCTRACE-CACHE PCTRACE-FAST PLUGIN PROCESS PROGNAME PROTOFILE-NAME STATISTICS STATE STF-PROCS-PER-FILE STF-USE-HW-STRUCTURE STOPFILE-NAME SYMBOL SYNC-MAX-DURATION SYNC-MAX-MESSAGES SYNC-PERIOD SYNCED-CLUSTER SYNCED-HOST TIME-WINDOWS (Experimental) TIMER TIMER-SKIP UNIFY-COUNTERS UNIFY-GROUPS UNIFY-SCLS UNIFY-SYMBOLS VERBOSE VT_START_PAUSED VT_COMPRESS_TRACE

Correctness Checking Errors x

Supported Errors How the Collection Works

How the Collection Works x

Parameter Checking Premature Exit Overlapping Memory Detecting Illegal Buffer Modifications Buffer Given to MPI Cannot Be Read or Written Distributed Memory Checking Illegal Memory Access Request Handling Datatype Handling Buffered Sends Deadlocks Checking Message Transmission Datatype Mismatches Data Modified during Transmission Checking Collective Operations Freeing Communicators

Structured Tracefile Format x

STF Components Single-File STF Configuring STF

stftool Utility x

stftool Utility Options Expanded ASCII output of STF Files

Time Stamping x

Clock Synchronization Choosing a Timer

Choosing a Timer x

gettimeofday/_ftime QueryPerformanceCounter CPU Cycle Counter Normalized CPU Cycle Counter MPI_Wtime() High Precision Event Timers POSIX* clock_gettime

Intel® Trace Analyzer Reference x

Graphical User Interface Reference Intel® Trace Analyzer Command Line Interface Reference Filter Expression Grammar otf2-to-stf Utility

Graphical User Interface Reference x

Welcome Page Summary Page Main Menu Bar View Menu Bar View Bars Charts Dialogs Settings

Main Menu Bar x

File Menu Options Menu Project Menu Windows Menu Help Menu

View Menu Bar x

View Charts Navigate Advanced Layout Comparison Menu

Advanced x

Tagging Specific Events Filtering Events Simulating Ideal Communication Checking Application Imbalance Aggregating Results Aggregating Functions Creating Command Line for Intel® VTune™ Profiler and Intel® Advisor

View Bars x

Toolbar Trace Map Status Bar

Charts x

Event Timeline Qualitative Timeline Quantitative Timeline Counter Timeline Function Profile Message Profile Collective Operations Profile Performance Assistant Common Chart Features

Event Timeline x

Context Menu Filtering and Tagging

Qualitative Timeline x

Context Menu Filtering and Tagging

Quantitative Timeline x

Context Menu Filtering and Tagging

Counter Timeline x

Context Menu Filtering and Tagging

Function Profile x

Flat Profile Load Balance Call Tree Call Graph Context Menu Filtering and Tagging Function Profile Settings

Message Profile x

Context Menu Filtering and Tagging Aggregation Message Profile Settings

Collective Operations Profile x

Context Menu Filtering and Tagging Collective Operations Profile Settings

Dialogs x

Process Aggregation Function Aggregation Function Group Color Editor Filtering Dialog Box Tagging Dialog Box Idealization Dialog Box Imbalance Diagram Dialog Box Trace Merge Dialog Box Details Dialog Box Source View Dialog Time Interval Selection Configuration Dialogs Find Dialog Box Command line for Intel® VTune™ Profiler and Intel® Advisor Dialog Box OTF2 to STF Conversion Dialog Box Configuration Assistant

Process Aggregation x

Comparison Mode

Function Aggregation x

Comparison Mode

Filtering Dialog Box x

Building Filter Expressions Using Graphical Interface Building Filter Expressions Manually Filter Expressions in Comparison Mode

Details Dialog Box x

Detailed Attributes of Function Events Detailed Attributes of Message Events Detailed Attributes of Collective Operation Events

Configuration Dialogs x

Load Configuration File Dialog Edit Configuration File Dialog

Settings x

Preferences Font Settings Number Formatting Settings

Preferences x

General Preferences Tracefile Preferences Event Timeline Settings Qualitative Timeline Settings Quantitative Timeline Settings Counter Timeline Settings

Notices and Disclaimers x

Appendix A Copyright and Licenses

Intel® Trace Analyzer and Collector User and Reference Guide

Introduction

Notational Conventions

Get Help

Install and Set Up Intel® Trace Analyzer and Collector

Trace Your Applications

Tracing Conventional MPI Applications

Tracing Failing MPI Applications

Tracing OpenSHMEM* Applications

Tracing MPI File IO

Handling of Communicator Names

Tracing MPI Load Imbalance

Tracing User Defined Events

Configuring the Collector

Filtering Trace Data

Recording OpenMP* Regions Information

Tracing System Calls (Linux* OS)

Collecting Lightweight Statistics

Recording Source Location Information

Recording Hardware Performance Information (Linux* OS)

Recording Operating System Counters

Tracing Library Calls

Correctness Checking

Correctness Checking of MPI Applications

Running with Valgrind* (Linux* OS)

Configuring Error Checks

Analyzing the Results

Debugger Integration

TotalView* Debugger

GNU* Symbolic Debugger

Allinea* Distributed Debugging Tool* (DDT*)

Tracing Distributed Non-MPI Applications

Analyze Your Applications

Starting Intel® Trace Analyzer

Intel Trace Analyzer Graphical User Interface

Navigating Timelines

Zoom Stack

Concepts

Level of Detail

Aggregation

Advanced Aggregation

Tagging and Filtering

Viewing Correctness Checking Reports

Event Timeline Correctness Checking Reports

Qualitative Timeline Correctness Checking Reports

Detailed Dialog

Comparing Two Trace Files

Mappings in Comparison Views

Mapping of Processes

Mapping of Functions

Comparison Charts

Comparison Function Profile

Comparison Message Profile

Comparison Collective Operations Profile

Interoperability with Intel® VTune™ Profiler and Intel® Advisor

OpenMP* Regions Display Support

OTF2 Format Support

Intel® Trace Collector Reference

API Reference

Initialization, Termination and Control

VT_initialize

VT_finalize

VT_getrank

VT_registerthread

VT_registernamed

VT_registerprefixed

VT_getthrank

VT_traceon

VT_traceoff

VT_tracestate

VT_symstate

VT_flush

VT_timestamp

VT_timestart

VT_setfinalizecallback

VT_getdescription

VT_countsetcallback

Defining and Recording Source Locations

Defining and Recording Functions or Regions

New Interface

Old Interface

State Changes

Defining and Recording Scopes

Defining Groups of Processes

Defining and Recording Counters

Recording Communication Events

Additional API Calls in libVTcs

C++ API

VT_FuncDef Class Reference

VT_SclDef Class Reference

VT_Function Class Reference

VT_Region Class Reference

Configuration Reference

Configuration File Format

Protocol File

Configuration Options

ACTIVITY

ALTSTACK

AUTOFLUSH

CHECK

CHECK-LEAK-REPORT-SIZE

CHECK-MAX-DATATYPES

CHECK-MAX-ERRORS

CHECK-MAX-PENDING

CHECK-MAX-REPORTS

CHECK-MAX-REQUESTS

CHECK-SUPPRESSION-LIMIT

CHECK-TIMEOUT

CHECK-TRACING

CLUSTER

COMPRESS-RAW-DATA

COUNTER

CURRENT-DIR

DEADLOCK-TIMEOUT

DEADLOCK-WARNING

DEMANGLE

DETAILED-STATES

ENTER-USERCODE

ENVIRONMENT

EXTENDED-VTF

FLUSH-PID

FLUSH-PREFIX

GROUP

HANDLE-SIGNALS

INTERNAL-MPI

KEEP-RAW-EVENTS

LOGFILE-FORMAT

LOGFILE-NAME

LOGFILE-PREFIX

LOGFILE-RANK

MEM-BLOCKSIZE

MEM-FLUSHBLOCKS

MEM-INFO

MEM-MAXBLOCKS

MEM-MINBLOCKS

MEM-OVERWRITE

NMCMD

OS-COUNTER-DELAY

PCTRACE

PCTRACE-CACHE

PCTRACE-FAST

PLUGIN

PROCESS

PROGNAME

PROTOFILE-NAME

STATISTICS

STATE

STF-PROCS-PER-FILE

STF-USE-HW-STRUCTURE

STOPFILE-NAME

SYMBOL

SYNC-MAX-DURATION

SYNC-MAX-MESSAGES

SYNC-PERIOD

SYNCED-CLUSTER

SYNCED-HOST

TIME-WINDOWS (Experimental)

TIMER

TIMER-SKIP

UNIFY-COUNTERS

UNIFY-GROUPS

UNIFY-SCLS

UNIFY-SYMBOLS

VERBOSE

VT_START_PAUSED

VT_COMPRESS_TRACE

Correctness Checking Errors

Supported Errors

How the Collection Works

Parameter Checking

Premature Exit

Overlapping Memory

Detecting Illegal Buffer Modifications

Buffer Given to MPI Cannot Be Read or Written

Distributed Memory Checking

Illegal Memory Access

Request Handling

Datatype Handling

Buffered Sends

Deadlocks

Checking Message Transmission

Datatype Mismatches

Data Modified during Transmission

Checking Collective Operations

Freeing Communicators

Structured Tracefile Format

STF Components

Single-File STF

Configuring STF

stftool Utility

stftool Utility Options

Expanded ASCII output of STF Files

Time Stamping

Clock Synchronization

Choosing a Timer

gettimeofday/_ftime

QueryPerformanceCounter

CPU Cycle Counter

Normalized CPU Cycle Counter

MPI_Wtime()

High Precision Event Timers

POSIX* clock_gettime

Secure Loading of Dynamic Link Libraries* on Windows* OS

Intel® Trace Analyzer Reference

Graphical User Interface Reference

Welcome Page

Summary Page

Main Menu Bar

File Menu

Options Menu

Project Menu

Windows Menu

Help Menu

View Menu Bar

View

Charts

Navigate

Advanced

Tagging Specific Events

Filtering Events

Simulating Ideal Communication

Checking Application Imbalance

Aggregating Results

Aggregating Functions

Creating Command Line for Intel® VTune™ Profiler and Intel® Advisor

Layout

Comparison Menu

View Bars

Toolbar

Trace Map

Status Bar

Charts

Event Timeline

Context Menu

Filtering and Tagging

Qualitative Timeline

Context Menu

Filtering and Tagging

Quantitative Timeline

Context Menu

Filtering and Tagging

Counter Timeline

Context Menu

Filtering and Tagging

Function Profile

Flat Profile

Load Balance

Call Tree

Call Graph

Context Menu

Filtering and Tagging

Function Profile Settings

Message Profile

Context Menu

Filtering and Tagging

Aggregation

Message Profile Settings

Collective Operations Profile

Context Menu

Filtering and Tagging

Collective Operations Profile Settings

Performance Assistant

Common Chart Features

Dialogs

Process Aggregation

Comparison Mode

Function Aggregation

Comparison Mode

Function Group Color Editor

Filtering Dialog Box

Building Filter Expressions Using Graphical Interface

Building Filter Expressions Manually

Filter Expressions in Comparison Mode

Tagging Dialog Box

Idealization Dialog Box

Imbalance Diagram Dialog Box

Trace Merge Dialog Box

Details Dialog Box

Detailed Attributes of Function Events

Detailed Attributes of Message Events

Detailed Attributes of Collective Operation Events

Source View Dialog

Time Interval Selection

Configuration Dialogs

Load Configuration File Dialog

Edit Configuration File Dialog

Find Dialog Box

Command line for Intel® VTune™ Profiler and Intel® Advisor Dialog Box

OTF2 to STF Conversion Dialog Box

Configuration Assistant

Settings

Preferences

General Preferences

Tracefile Preferences

Event Timeline Settings

Qualitative Timeline Settings

Quantitative Timeline Settings

Counter Timeline Settings

Font Settings

Number Formatting Settings

Intel® Trace Analyzer Command Line Interface Reference

Filter Expression Grammar

otf2-to-stf Utility

Notices and Disclaimers

Appendix A Copyright and Licenses

Visible to Intel only — GUID: GUID-988AD4F7-B1D6-44E4-98BC-B92966C4E268

View Details

Handling of Communicator Names

By default, Intel® Trace Collector stores names for well-known communicators in the trace: COMM_WORLD, COMM_SELF_#0, COMM_SELF_#1 and so on. When new communicators are created, their names are composed of a prefix, a space and the name of the old communicator. For example, calling MPI_Comm_dup() on MPI_COMM_WORLD will lead to a communicator called DUP COMM_WORLD.

There are the following prefixes for MPI functions:

MPI Function	Prefix
MPI_Comm_create()	CREATE
MPI_Comm_dup()	DUP
MPI_Comm_split()	SPLIT
MPI_Cart_sub()	CART_SUB
MPI_Cart_create()	CART_CREATE
MPI_Graph_create()	GRAPH_CREATE
MPI_Intercomm_merge()	MERGE

MPI_Intercomm_merge() is special because the new communicator is derived from two communicators, not just one as in the other functions. The name of the new inter-communicator will be MERGE <old name 1>/<old name 2> if the two existing names are different, otherwise it will be just MERGE <old name>.

In addition to these automatically generated names, Intel Trace Collector also intercepts MPI_Comm_set_name() and then uses the name provided by the application. Only the last name set with this function is stored in the trace for each communicator. Derived communicators always use the name that is currently set in the old communicator when the new communicator is created.

Intel Trace Collector does not attempt to synchronize the names set for the same communicator in different processes, therefore the application has to set the same name in all processes to ensure that this name is really used by Intel Trace Collector.

Parent topic: Trace Your Applications

Level Two Title

Select Your Language

Using Intel.com Search

Quick Links

Recent Searches

Advanced Search

Only search in

Intel® Trace Analyzer and Collector User and Reference Guide

Handling of Communicator Names