Ralph Castain
unread,Aug 24, 2025, 3:14:21 PM (13 days ago) Aug 24Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to pm...@googlegroups.com
Hi folks
After a 3-yr hiatus, PMIx is returning to the SC25 Birds-of-a-Feather schedule! We don’t have an exact date/time yet, but will pass that along once it is assigned.
Here is a preview of the announcement:
Since its introduction in 2014, the PMIx infrastructure initiative has grown into an international standard for integration between applications, tools, and system management environments. PMIx can now be found on clusters and cloud environments across the ecosystem, used by applications spanning a wide range of programming models and libraries. The initiative has been focused on supporting HPC application launch and wire-up; tools that wish to inspect, steer, and/or debug parallel applications; fault tolerance and cross-library coordination; and communication across container boundaries. While system vendors initially concentrated on supporting traditional wireup, they more recently have broadened their offerings to include advanced PMIx features such as event notification, group construction, and logging. In the interim, the community moved forward to further enrich application-to-runtime integration by adding support for monitoring and job control requests.
More recently, the PMIx community’s attention has turned to the final piece of the ecosystem: extending interactions between the application and runtime to include the system scheduler, thereby enabling support for dynamic on-demand changes to allocations. Several research programs have utilized prototype interfaces in their efforts to explore malleable operations by extending existing schedulers, or in some cases developing early versions of fully dynamic schedulers, that interact with applications via PMIx to request/negotiate allocation changes. The results have been reported in journals, conference proceedings, and workshops over the last few years.
Prior to the pandemic, this BOF had been held for 5 years at SC. In 2022, the BoF brought the community back together in person to discuss the PMIx Standard, the OpenPMIx implementation, and related projects. However, that means we now have three years of work to catch up on! We therefore extend an invitation to all community members, PMIx users, and system vendors to come get an update and help define the roadmap for the upcoming year.
This year’s BoF will briefly review in-progress proposed additions to the PMIx Standard, including support for I/O and storage and dynamic workflows; additions that have been formally adopted, including support for tools and debuggers; and updates and new features of the OpenPMIx implementation and PRRTE environment.
Our primary emphasis will then shift to discussion of current use-cases under consideration. This will include a look at the SLAC/NERSC “burst analysis” use-case and explorations on how PMIx/PRRTE can be used to help optimize that situation. We solicit the involvement of participants in identifying new uses-cases, and ask that you bring any questions, problems, and/or challenges to the meeting. In particular, we look forward to your help in defining priorities for the upcoming year, including creating working groups to investigate various aspects of the dynamic workload problem such as application registration and preemption mechanics, and site-level preemption policy. While PMIx doesn't set policy, we do need to understand the types of policies sites might utilize to encourage and execute preemption strategies so we can provide the required supporting infrastructure.
So come and help take dynamic environments mainstream...and for those who attend, there may even be the bonus of cookies!
Ralph