Prometheus JMX Exporter caching improvement

Яна Илиева

unread,

Dec 12, 2024, 11:02:02 AM12/12/24

to Prometheus Developers

Hello,

While using the Prometheus JMX Exporter as a Java agent in our application, we observed frequent GC clean ups and it turns out it is because the available heap space got exhausted by the caching of the Prometheus JMX Exporter.

For context - we have an exporter yaml file with about 40 rules defined and there are many JMX MBeans which we process in order to get the required metrics. Without caching enabled for the rules, the request can take a minimum of 1 minute to complete. In an attempt to reduce this time, we enabled caching and observed that the MatchedRulesCache object can take around 600 MB in a particular case. There is a realistic potential in out case that this object grows above 1 GB, which is a huge amount of space for this kind of process.

We identified that the major reason for the large size of MatchedRulesCache is that it contains duplicating String objects for the same MBean names. The object keeps all MBean names the rules were matched against previously, in order to avoid expensive pattern matching later. Although each rule caches the same set of MBean names, those are in fact separate objects in the heap.

The origin of the matter was found in https://github.com/prometheus/jmx_exporter/blob/a3dac9acee1464531cd87502579178a1fec1cc76/collector/src/main/java/io/prometheus/jmx/JmxCollector.java#L584

where for each rule with caching enabled, there is a new String object created, although such could already exist in memory from a previous iteration.

The duplication is by a factor of the number of rules with caching enabled.

I experimented with interning the String,

String matchName = (beanName + attributeName + ": " + matchBeanValue).intern();

so that the JVM's string pool is utilized and the String objects reused. The result in speed and heap space used by cache is described below:

Screenshot from 2024-12-12 14-53-26.png

What do you think about this and do you find this suggestion could have negative consequences in certain cases?

Kind regards,

Yana Ilieva

Doug Hoard

unread,

Dec 13, 2024, 1:44:47 AM12/13/24

to Prometheus Developers

Using String intern() can help in some scenarios and cause performance issues in other scenarios.

I made your change on my development branch and ran a quick test (./quick-test.sh) and didn't see any unit test or integration test issues.

I just started a full regression test and will have some results in the morning (US time.) (It takes a little over 3 hours.)

-Doug

Doug Hoard

unread,

Dec 13, 2024, 8:44:59 AM12/13/24

to Prometheus Developers

Yana,

The full integration test passed without issues. Can you create a PR of your findings and the change so we can document it and give credit?

-Doug

Яна Илиева

unread,

Dec 13, 2024, 8:54:31 AM12/13/24

to Prometheus Developers

Hello Doug,

Thank you for the quick reaction and the testing. I will proceed with opening the PR.

Kind regards,

Yana

Francisco Melo junior

unread,

Dec 18, 2024, 7:30:48 AM12/18/24

to yana.il...@gmail.com, Prometheus Developers

Hello I was wondering here what is the result of you disable string deduplication XX:+UseStringDeduplication, or was that the caching you did? It should reduce the footprint considerably if the duplication is the problem, not increase it.

-Francisco

On Dec 13, 2024, at 9:54 AM, Яна Илиева <yana.il...@gmail.com> wrote:

Hello Doug,

--
You received this message because you are subscribed to the Google Groups "Prometheus Developers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-devel...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/prometheus-developers/6c2d05ad-6674-424e-afe3-3289b370b286n%40googlegroups.com.

Doug Hoard

unread,

Dec 19, 2024, 2:19:40 PM12/19/24

to Prometheus Developers

Francisco,

Using "XX:+UseStringDeduplication" to enable String deduplication for the whole JVM process could have unwanted side effects, so performing the "intern()" of the cache keys directly was deemed a much safer option.

Francisco Melo junior

unread,

Dec 19, 2024, 6:57:27 PM12/19/24

to Doug Hoard, Prometheus Developers

Sure thank you.

Can you be more specific on the consequences?

Evidently there will be an overhead for the comparison for each string, if that’s what you mean. But if the focus is less footprint. And should be (almost) similar to intern(), because the options are done the same number of times.

Also it should only remove the duplicates but keep the strings all in the pool.

Unless the strings are used for as the actual keys in the caching so removing the duplicates will indeed be problematic because you will lose access to the value associated with it.

Sent from Earth

On Dec 19, 2024, at 3:19 PM, Doug Hoard <doug....@gmail.com> wrote:

Francisco,

To view this discussion visit https://groups.google.com/d/msgid/prometheus-developers/b8828a92-6422-4fb6-9103-e038e1c4b191n%40googlegroups.com.

Яна Илиева

unread,

Jan 23, 2025, 7:39:04 AM1/23/25

to Prometheus Developers

Hi,

Our intention was to minimize the scope of the desired effect to only those strings which are used in the caching mechanism of the JMX exporter. UseStringDeduplication is a global setting that affects the main application as well and this is undesirable in our case.

If there are other solutions to the problem at hand, I'll be glad to explore them, of course.

P.S.
Doug, when do you think this change will be available in a release, be it minor or major?

Doug Hoard

unread,

Jan 24, 2025, 6:48:10 PM1/24/25

to Prometheus Developers

Also, UseStringDuplication can’t be enabled from code.

@Yana I have a few PRs that I need find time to review before the next release.

I’m targeting a couple of weeks.

Reply all

Reply to author

Forward