Your oplog size is 10 megabytes, which is very much on the small side.
Depending on the size of your operations (i.e. inserts and updates)
this may only be able to hold a few operations before it "rolls over,"
which could cause this RS102 error. Did you set it to so small a size
on purpose?
As far as getting back to a healthy state, if there truly is no data
in any of the databases, you can probably just stop mongod on port
27100, remove the data files from the data path used by that mongod,
then restart it. It will reconnect to the other replica set members
and resync any data that there is.
- Dan
On May 4, 5:58 pm, Anil Lingamallu <
lsa...@gmail.com> wrote:
> I am hitting an issue when a creating a replicaset to be used as a Shrad. I
> started replicaset with 3 replicas. I am creating this replicaset for the
> first time. One of the replicas is in "Recovering" state forever. The error
> message shown when I type rs.status() on shell is "error RS102 too stale to
> catch up". I read that if oplogsize is less, I get this error when one of
> the replica comes backup after lot of writes. I do not understand how can
> be a replicaset be stale when there is no data (apart from Mongo initial
> data). Not sure if I am missing anything obvious. This does not happen all
> the times. Please clarify.
>
> The version I am using is 2.1.1-pre- downloaded fromhttp://
www.mongodb.org/display/DOCS/MongoDB+on+Azure.
>
> Attaching the zipped log files of three mongod instances.
>
> Here is output of *db.printReplicationInfo()*
>
> Shard1:RECOVERING> db.printReplicationInfo()
> configured oplog size: 10MB
> log length start to end: 0secs (0hrs)
> oplog first event time: Fri May 04 2012 13:59:32 GMT-0700 (Pacific
> Daylight Time)
> oplog last event time: Fri May 04 2012 13:59:32 GMT-0700 (Pacific
> Daylight Time)
> now: Fri May 04 2012 14:39:37 GMT-0700 (Pacific
> Daylight Time)
>
> Here is the output of *rs.status()*
> Logs.zip
> 229KViewDownload