I would like to estimate how many servers I would need to monitor our network infrastructure. I would be using the snmp_exporter to poll the devices and expose the metrics to Prometheus.
Estimate of setup:
2,000 network devices
3,500 metrics per device
50 alarms that would cover just about ever metric/label
8 labels per metric
1 minute polling interval
I saw in the FAQ the benchmark data(525k samples/sec in 1.4M TS, 1650 targets) but am new to time series stuff and don't know how to translate my setup to compare to the benchmark. Is a timeseries in my case (total metrics) * labels?
I would also like to estimate disk space usage over a period of time to get an idea of how long I can keep data without aggregating. Is there a formula that people are using to estimate disk space(average or worst case)? If not would it be fair to monitor a single device and extrapolate out? Is there a metric exposed by the Prometheus metrics that I could use to see the disk utilization over time if I setup a single device?
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscribe@googlegroups.com.
To post to this group, send email to prometheus-users@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/bec1acdb-e225-44a5-887e-bd53504bedc8%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
There's some overhead for indexing time series, but after that, you can get an idea of average bytes-per-sample from the table in https://prometheus.io/docs/operating/storage/#chunk-encoding. It depends a bit on the exact shape of your data though.Depending on what kind of network devices you are monitoring, the network devices may actually be your bottleneck when you want a scrape interval of 1m. Some devices take *minutes* each to return all the requested metrics over SNMP (because production-grade network device manufacturers seemingly haven't seen pressure to improve their metrics situation yet). I don't have any experience with that myself, but have seen other people run into that problem multiple times.
--
You received this message because you are subscribed to the Google Groups "Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscribe@googlegroups.com.
To post to this group, send email to prometheus-users@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/7d2661d1-9049-4711-b420-59ff15322df3%40googlegroups.com.