Variorum 0.7 released

1 view
Skip to first unread message

Tapasya Patki

unread,
Jun 13, 2023, 5:55:39 PM6/13/23
to powerstac...@googlegroups.com
Hi everyone:

We’re pleased to announce that Variorum v0.7 has now been released. The current version now has power management support for all upcoming DOE exascale platforms: El Capitan, Aurora and Frontier, as well as the ARM Neoverse N1. This is a major release and provides several useful features, including a new GPU power capping API for AMD, NVIDIA and Intel GPUs, and an improved non-intrusive monitoring support through powmon.

To get to know Variorum as well as our team better, please watch our recently recorded video that explains how Variorum fits into an end-to-end PowerStack that we are working towards: https://www.youtube.com/watch?v=rgJGgPERBao 

We are now working toward further enhancing support for multi-architecture builds (e.g supporting power monitoring and capping where CPU and GPU vendors are different or have different underlying mechanisms requiring builds across devices). As always, we’re happy to answer questions and welcome any contributions/feedback.  Release notes follow.

Sincerely,
Tapasya, on behalf of the LLNL Argo PowerStack Team.

 

Variorum 0.7.0

Repository: LLNL/variorum · Tag: v0.7.0 · Commit: 6ea2644 · Released by: slabasan

Version 0.7.0 is a major release.

New features

  • Enable heterogeneous build support for two architectures
  • Support for new architectures: ARM Neoverse N1 platform, Sapphire Rapids
  • Add power capping API for NVIDIA, AMD and Intel GPUs
  • Add get power limit for intel GPUs

Documentation updates

  • Add links to youtube video and recordings from ECP Tutorial 2023
  • Change knob to dial in documentation
  • Update Variorum Overview Figure
  • Upload new logo

Bug fixes

  • Fix fortran error with gfortran > 10
  • Fix issues coming up with DENABLE_WARNINGS=On on AMD CPU port
  • Fix AMD build errors: add missing headers, undefined variables, etc.
  • Fix inconsistent spacing issue in powmon
  • Fix for multiplatform build (ARM+NVIDIA) error/warnings
  • Remove the undefined variable in AMD debug build

Internal changes

  • Powmon rehaul
  • Fix for JSON example memory leak
  • Remove unnecessary call to obtain number of sockets from JSON example
  • Update filenames and low-level function names, prefix power_features.* with architecture name
  • Replace existing RSMI API call with the correct API to get GPU busy %
  • Resolve missing non-optional arguments in control examples

This release has 2 assets:

  • Source code (zip)
  • Source code (tar.gz)

Visit the release page to download them.


Reply all
Reply to author
Forward
0 new messages