I'm looking at using Terrastore/ElasticSearch as a datastore for a
realtime news aggregation server. Everything I've read so far here
looks like it should be good for my needs, but I keep running across
this slide and it worries me:
http://nosql.mypopescu.com/post/1403577624/terrastore-sweet-spot
Does recommending it for "throw-away data" mean that it is not ideal
as a permanent datastore for some reason? Are there any drawbacks to
using Terrastore as a database replacement in a production environment
(versus, say, MongoDB)? Right now we are storing > 16GB of news
stories, and plan on growing that significantly.
I also have a question on cluster configuration: our current
configuration has one master server that feeds the datastore, and
multiple slaves that are query-only. We have two physical locations,
which have high latency between them. Is there a way to configuration
two independent clusters fed by the same master? (i.e., each physical
location's cluster should have a full set of data to avoid an
expensive trip to the other location.)