On Thursday, 19 November 2015 02:07:53 UTC+11, Darshan Shah wrote:
I have a 10 shard cluster with each shard being a 3 member replicaset.
Is it possible to setup a (possibly delayed) slave having all the data from all the shards?
Hi Darshan,
What version of MongoDB are you using? Are you currently having any issues running your reporting queries against the existing sharded cluster, or are you planning for future scaling or usage segregation?
A node can only be a member of a single replica set. If you want to sync all the data from a sharded cluster to a separate deployment you will need to look at a sync solution such as mongo-connector.
Given you have 10 shards, that sounds like a potentially significant challenge to replicate to a single server.
This can be used for running long queries for reporting purposes
Depending on your reporting requirements, there may be better ways to approach long queries. For example, common approaches include using pre-aggregated reports and incremental data updates to reduce unnecessary re-aggregation of data.
and possibly may also serve as a backup
Each of your replica sets already provides data redundancy and failover. However, having a full copy of the data does not provide a backup strategy in the event you need to restore data from a previous point in time. I would look into a more complete backup solution like MongoDB Cloud Manager, which can take cluster-wide snapshots based on data retention policies (i.e. how often to capture snapshots and how long to store daily/weekly/monthly snapshots).
Regards,
Stephen
I finally got around to checking out Mongo-Connector - it is very good.However, it is very heavy in sense that it takes quite some time to do the initial sync and requires a target MongoDb (possibly replicaset) instance.Is there any other way to get a continuous hot backup from MongoDb other than the File System Backup as mentioned in the MongoDb backup of sharded cluster with filesystem snapshots ?