Problem with different EvenSets in PAPI on Intel GPU

46 views
Skip to first unread message

Nikola Kušlaković

unread,
Aug 19, 2024, 1:57:04 PM8/19/24
to ptools-...@icl.utk.edu, John Mellor-Crummey
Hello,

I am a research intern working with the HPCToolkit team at Rice University under the mentorship of Professor John Mellor-Curmmey.
I am using PAPI to get hardware counters for Intel GPU (Ponte Vecchio architecture). I am not able to add events belonging to different measurement groups (e.g. ComputeBasic, GpuOffload, Memory1 etc.)
to different EventSets. My approach was:
  1. Initialize PAPI GPU component
  2. PAPI create EventSet1
  3. PAPI add named events to EventSet1 - events from ComputeBasic group
  4. PAPI create EventSet2 
  5. PAPI add named events to EventSet2 - events from Memory1 group
Using this approach an error occurs in step 5. Note that this error occurs before I do PAPI_start on any EventSet. 

I am running this on a machine (headroom) from the University of Oregon. I built PAPI and intel_gpu component from the master branch and followed instructions for setting up permissions on the system.

Help would be appreciated.

Best regards,
Nikola Kuslakovic

Heike Jagode

unread,
Aug 19, 2024, 2:35:09 PM8/19/24
to Nikola Kušlaković, ptools-...@icl.utk.edu, John Mellor-Crummey
What is the error you're getting?

Thanks,
Heike

--
You received this message because you are subscribed to the Google Groups "ptools-perfapi" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ptools-perfap...@icl.utk.edu.
To view this discussion on the web visit https://groups.google.com/a/icl.utk.edu/d/msgid/ptools-perfapi/CAHa59f4Oxc9mt26u5nOr5vAR9BQ5RLpCfZR%3DempU0gxidb%3D8%3DA%40mail.gmail.com.


--
__________________________________________
Heike Jagode, Ph.D., Research Associate Professor
Innovative Computing Laboratory (ICL)
University of Tennessee Knoxville
http://icl.utk.edu/~jagode/

Heike Jagode

unread,
Aug 19, 2024, 3:17:14 PM8/19/24
to Nikola Kušlaković, ptools-perfapi, John Mellor-Crummey
Are you able to use EventSet1 and EventSet2 with their respective events in isolation? Or does the error still occur when you use them separately?
Also, if possible, it would be helpful if you could share your test code.

Thanks,
Heike


On Mon, Aug 19, 2024 at 2:39 PM Nikola Kušlaković <nk...@rice.edu> wrote:
I am getting:

Error on PAPI_add_named_event <some_metric_name>, retVal -18

Nikola Kušlaković

unread,
Aug 19, 2024, 4:20:55 PM8/19/24
to Heike Jagode, ptools-perfapi, John Mellor-Crummey
Attached to this email is the demo code I created and Makefile to build it. This demo code only initializes the inte_gpu component and tries to register two named events from different groups (ComputeBasic, Memory1) to different EventSets.

The output is the following:
ZET_ENABLE_API_TRACING_EXP is deprecated. Use ZE_ENABLE_TRACING_LAYER instead
Eventset: 0
Eventset: 1
Error on PAPI_add_named_event Memory1.GPU_MEMORY_64B_TRANSACTION_READ_SQIDI0, retVal -18

Thanks,
Nikola
papi_demo.cxx
Makefile

Daniel Barry

unread,
Sep 17, 2024, 9:30:36 AM9/17/24
to ptools-perfapi, Nikola Kušlaković, ptools-perfapi, johnmc, Heike Jagode
Hi Nikola,

We have been looking into this, and we were able to reproduce and partially fix the issue. However, we have encountered other related issues, which are preventing full intel_gpu component functionality. I will post an update here as soon as these issues are all resolved.

Thank you,
Daniel

Daniel Barry

unread,
Oct 9, 2024, 2:55:02 PM10/9/24
to ptools-perfapi, Daniel Barry, Nikola Kušlaković, ptools-perfapi, johnmc, Heike Jagode
Hi Nikola,

We have now addressed these issues in the 'master' branch of the PAPI repository. Please let us know if there are any further issues.

Thank you for your patience.
Daniel

Heike Jagode

unread,
Oct 11, 2024, 11:44:04 AM10/11/24
to Nikola Kušlaković, nikola.kus...@gmail.com, Daniel Barry, ptools-perfapi, johnmc
Nikola,

Thank you for testing the new version.

While working on a fix for the initial issue, we too encountered several other issues along the way. In the PR that Daniel submitted, he addressed a number of these additional issues. Some of them may or may not be the ones you were referring to. Regardless, please keep us posted on anything else you encounter.

Thanks again,
Heike


On Wed, Oct 9, 2024 at 3:03 PM Nikola Kušlaković <nk...@rice.edu> wrote:
Thank you for fixing the issues. I will test the new version.

There are other potential bugs I have encountered. I am investigating those to make sure where and under what condition they occur. Perhaps, you will hear from me soon.

Best regards,
Nikola

Nikola Kušlaković

unread,
Oct 21, 2024, 9:05:06 AM10/21/24
to Daniel Barry, ptools-perfapi, johnmc, Heike Jagode
Thank you for fixing the issues. I will test the new version.

There are other potential bugs I have encountered. I am investigating those to make sure where and under what condition they occur. Perhaps, you will hear from me soon.

Best regards,
Nikola

Reply all
Reply to author
Forward
0 new messages