I am deploying a Kubernetes environment and hope to use Prometheus to monitor all of my pods. I will have hundreds of pods each generating its own set of metrics with a unique kubernetes_pod_name label. I am trying to figure out how the caution statement about creating too many time series due to multiple key-value label pairs (
https://prometheus.io/docs/practices/naming/#labels) is applicable in a Kubernetes environment. We do multiple deployments every day which will potentially create a thousand unique kubernetes_pod_name labels per day. This doesn't even include autoscaling impacts. I don't see a way to follow the caution statements advice in my environment.
Am I missing a workaround for this issue? I don't think there is any other option than to apply a unique label to each pod's set of metrics. What is the real impact with this scenario? Is storage space preallocated for each unique time series, will aggregation across this number of time series be too CPU intensive, will I need large amounts of RAM to handle the promql queries?
Thanks,