Hi,
I'm currently working on a project that aims to perform
detailed measurements of internal MPI communication, particularly based
on the PML layer. For example, I would like to measure internal
point-to-point communication within collective operations.
I’ve seen that using the monitoring module (enabled via --mca pml_monitoring_enable 2), Open MPI already provides summaries of the underlying point-to-point operations after program execution. This is very helpful.
My goal is to access this information dynamically through the MPI_T interface and trace the data for different MPI functions at runtime.
However, I’m running into a limitation:
The coll_monitoring_messages_count PVAR only shows counts for collective operations.
The pml_monitoring_messages_count PVAR only seems to capture activity from explicit point-to-point operations (like MPI_Send/MPI_Recv), but not from collectives such as MPI_Bcast.
My question is:
Is it currently possible to observe the internal point-to-point operations triggered by collectives (as visible in the monitoring summary) through MPI_T performance variables?
I’d greatly appreciate any insights or recommendations.
Best regards
Anna
To unsubscribe from this group and stop receiving emails from it, send an email to users+un...@lists.open-mpi.org.