1. Does the
process_resident_memory_bytes[link] is the right metric to monitor and alert about prometheus memory or should we use the k8s pod metric
container_memory_working_set_bytes[link] of the prometheus pod?
2. Is the
process_resident_memory_bytes metric can go above VM physical RAM or OOM Kill will hit first? (is there any memory that count in this metric can be evicted by the kernel to avoid OOMkill?)
3. Assuming prometheus compaction may cause huge memory spikes(as mentioned above and like
issue1 and
issue2).
Is there a way to tune prometheus to avoid such huge spikes during compaction (e.g: tune prometheus settings or to increase the instance RAM)?