Hi,
I must admit, I am currently neither yet very familiar with opentsdb nor with influxdb in detail.
I am currently in the architect phase of a large farm of debian vm's which are going to host a bunch of llinux containers, each.
The VM's are spread across different hosting locations, differend layer2 subnets, etc. all without any common network infrastructure and no VPN.
For monitoring of certain metrics and logs I am considering to run influxdb (along with fluentd) locally as a container on each vm. This container would be the datastore for each other containers metric (running on the same VM as well as for the VM itself) and syslog data. It will store application log and performance data, system (OS) performance data and also be target of syslog. Each data source (on the vm itself or from other local containers) will use an alias to reach the vm-local fluentd/influxdb container - so the alias will always point to the local influxdb/fluentd instance and if the container moves, its data might so be spread across different influxdb instances. The main reason for selecting influxdb here is because influxdb is lightweight and easy to deploy and maintain compared to opentsdb. I do not want to analyze the data directly from there.
For graphing and central analysis, I do want to ship the data of all influxdb instances to a central opentsdb with a graphana frontend, backed by an existing large hbase cluster.
I just wanted to check, if this sounds reasonable and if there are any available solutions already to allow asynchronous pull (push is no option!) from influxdb data into openTSDB, 1:1. From what I have read so far abouth both, influxdb and opentsdb, the data model should be compatible with each other to do something like this. I am thinking about letting a central instance pull each influxdb instance data into the locally available opentsdb.
Any gotachs I have to be aware or anyone who can point me to available solutions / technical documentation / etc. so that I would not have to re-invent the wheel regarding this?
Or is my basic idea completely crap and if yes, why?
thx
Ralf