Hello again, beauty community!
I'm really fascinated with KairosDB and this new world of big data and data mining that I'm discovering. I've some very basic questions about bringing KairosDB for production that I hope can help more people in the future. Anyway, I'll write a tutorial about that when I get it, so hope you can help me helping you (in the future :p ). As I said before, a article about Spark-KairosDB integration is on the way.
I want to make a distributed database using Cassandra + KairosDB and I don't have any idea on how to do that. Currently I've the following setup:
- Two home computers running over Ubuntu 14.04.
- Each computer has KairosDB and Cassandra up and running. I already made some tests and everything is okay.
- To simplify the question I've cleaned the KairosDB keyspace from both computers (actually, deleted and created a new keyspace. It's valid to note that I've data for populate a database with more than 1 million point values).
Far I know, in order to create a distributed database, I should share the same database with all my nodes; that's right? Should I create a KairosDB instance only for reads and another for writes? If so, how can I achieve that? Should I configure Cassandra and KairosDB or just configuring Cassandra is okay?
Please, be patient and try to make this newbie understand the concepts behind a distributed KairosDB database. Of course, this will become a newbie #101 guide in the future :)
Thank you!
Fernando Paladini.