MongoDB Replica Set SECONDARY Stuck in `ROLLBACK` State

Chris Wilcox

unread,

May 16, 2012, 4:04:37 PM5/16/12

to mongod...@googlegroups.com

During a recent automated update of our mongodb `PRIMARY`, when the `PRIMARY` stepped down it permanently went into a `ROLLBACK` state.

After several hours in the `ROLLBACK` state, there was still no rollback `.bson` file in the `rollback` directory in the mongodb database directory. That, and also this line in our log file: `[rsSync] replSet syncThread: 13410 replSet too much data to roll back`, seems to indicate that the `ROLLBACK` process failed.

I would like some help analyzing what exactly went wrong.

- It appears two different rollbacks occurred in our logs. Is that the case or was it one that took 3 hours?

- If the first rollback (at 19:00 hours) was successful, why didnt anything appears in ou `rollback` directory?

- Any guess as to the cause of all those warnings? Could that be related to the rollback failure?

- Did we lose 18 seconds of data due to the first `ROLLBACK`?

- Is there a generic solution to the "stuck in `ROLLBACK` state" problem? We ended up having to hose our entire DB and re-sync from primary.

The relevant log lines are:

# Primary coming back after restart...

Tue May 15 19:01:01 [initandlisten] MongoDB starting : pid=3684 port=27017 dbpath=/var/lib/mongodb 64-bit host=magnesium

Tue May 15 19:01:01 [initandlisten] db version v2.0.5, pdfile version 4.5

# ... init stuff

Tue May 15 19:01:01 [initandlisten] journal dir=/var/lib/mongodb/journal

Tue May 15 19:01:01 [initandlisten] recover : no journal files present, no recovery needed

# ... More init stuff

Tue May 15 19:01:03 [rsStart] trying to contact rs1arb1.c9w.co:27017

Tue May 15 19:01:03 [rsStart] trying to contact rs1m2.c9w.co:27017

Tue May 15 19:01:03 [rsStart] replSet STARTUP2

Tue May 15 19:01:03 [rsHealthPoll] replSet member rs1arb1.c9w.co:27017 is up

Tue May 15 19:01:03 [rsHealthPoll] replSet member rs1arb1.c9w.co:27017 is now in state ARBITER

Tue May 15 19:01:03 [rsSync] replSet SECONDARY

Tue May 15 19:01:05 [rsHealthPoll] replSet member rs1m2.c9w.co:27017 is up

Tue May 15 19:01:05 [rsHealthPoll] replSet member rs1m2.c9w.co:27017 is now in state PRIMARY

Tue May 15 19:01:09 [rsSync] replSet syncing to: rs1m2.c9w.co:27017

Tue May 15 19:01:09 [rsSync] replSet our last op time written: May 15 19:00:51:6

Tue May 15 19:01:09 [rsSync] replSet rollback 0

Tue May 15 19:01:09 [rsSync] replSet ROLLBACK

Tue May 15 19:01:09 [rsSync] replSet rollback 1

Tue May 15 19:01:09 [rsSync] replSet rollback 2 FindCommonPoint

Tue May 15 19:01:09 [rsSync] replSet info rollback our last optime: May 15 19:00:51:6

Tue May 15 19:01:09 [rsSync] replSet info rollback their last optime: May 15 19:01:09:19

Tue May 15 19:01:09 [rsSync] replSet info rollback diff in end of log times: -18 seconds

Tue May 15 19:01:10 [rsSync] replSet WARNING ignoring op on rollback no _id TODO : nimbus.system.indexes { ts: Timestamp 1337108400000|17, h: 1628369028235805797, op: "i", ns: "nimbus.system.indexes", o: { unique: true, name: "pascalquery_ns_key_start_ts_keyvals", key: { __ns__: 1, _key: 1, start_ts: 1, _keyval.a: 1, _keyval.b: 1, _keyval.c: 1, _keyval.d: 1, _keyval.e: 1, _keyval.f: 1, _keyval.g: 1, _keyval.h: 1 }, ns: "nimbus.wifi_daily_series", background: true } }

# ...

# Then for several minutes there are similar warnings

# ...

Tue May 15 19:03:52 [rsSync] replSet WARNING ignoring op on rollback no _id TODO : nimbus.system.indexes { ts: Timestamp 1337097600000|204, h: -3526710968279064473, op: "i", ns: "nimbus.system.indexes", o: { unique: true, name: "pascalquery_ns_key_start_ts_keyvals", key: { __ns__: 1, _key: 1, start_ts: 1, _keyval.a: 1, _keyval.b: 1, _keyval.c: 1, _keyval.d: 1, _keyval.e: 1, _keyval.f: 1, _keyval.g: 1, _keyval.h: 1 }, ns: "nimbus.wifi_daily_series", background: true } }

Tue May 15 19:03:54 [rsSync] replSet rollback found matching events at May 15 15:59:13:181

Tue May 15 19:03:54 [rsSync] replSet rollback findcommonpoint scanned : 6472020

Tue May 15 19:03:54 [rsSync] replSet replSet rollback 3 fixup

Then later for some reason another rollback occurs...

Tue May 15 22:14:24 [rsSync] replSet rollback re-get objects: 13410 replSet too much data to roll back

Tue May 15 22:14:26 [rsSync] replSet syncThread: 13410 replSet too much data to roll back

Tue May 15 22:14:37 [rsSync] replSet syncing to: rs1m2.c9w.co:27017

Tue May 15 22:14:37 [rsSync] replSet syncThread: 13106 nextSafe(): { $err: "capped cursor overrun during query: local.oplog.rs", code: 13338 }

Tue May 15 22:14:48 [rsSync] replSet syncing to: rs1m2.c9w.co:27017

Tue May 15 22:15:30 [rsSync] replSet our last op time written: May 15 19:00:51:6

Tue May 15 22:15:30 [rsSync] replSet rollback 0

Tue May 15 22:15:30 [rsSync] replSet rollback 1

Tue May 15 22:15:30 [rsSync] replSet rollback 2 FindCommonPoint

Tue May 15 22:15:30 [rsSync] replSet info rollback our last optime: May 15 19:00:51:6

Tue May 15 22:15:30 [rsSync] replSet info rollback their last optime: May 15 22:15:30:9

Tue May 15 22:15:30 [rsSync] replSet info rollback diff in end of log times: -11679 seconds

# More warnings matching the above warnings

Tue May 15 22:17:30 [rsSync] replSet rollback found matching events at May 15 15:59:13:181

Tue May 15 22:17:30 [rsSync] replSet rollback findcommonpoint scanned : 7628640

Tue May 15 22:17:30 [rsSync] replSet replSet rollback 3 fixup

The only useful information on rollbacks I've found are these notes which dont address the "stuck in rollback situation".

http://www.mongodb.org/display/DOCS/Replica+Sets+-+Rollbacks

http://www.snailinaturtleneck.com/blog/2011/01/19/how-to-use-replica-set-rollbacks/

Eliot Horowitz

unread,

May 16, 2012, 5:02:08 PM5/16/12

to mongod...@googlegroups.com

the key message is this:

Tue May 15 22:14:24 [rsSync] replSet rollback re-get objects:
13410 replSet too much data to roll back

that means the delta between that node and the current primary is too
large to do an automated rollback.
I would recommend creating a copy of the old primaries data, then
wiping and doing a full resync.
you can then go look at the data and see if you want to merge.

> --
> You received this message because you are subscribed to the Google
> Groups "mongodb-user" group.
> To post to this group, send email to mongod...@googlegroups.com
> To unsubscribe from this group, send email to
> mongodb-user...@googlegroups.com
> See also the IRC channel -- freenode.net#mongodb

Chris Wilcox

unread,

May 16, 2012, 5:20:39 PM5/16/12

to mongod...@googlegroups.com

Elliot, thanks for your response,

What I'm trying to figure out is why the second ROLLBACK occurred at all.

It looks like the first ROLLBACK succeeded and the new primary had only been 18 seconds behind the old primary. Is that your understanding of the logs as well?

> mongodb-user+unsubscribe@googlegroups.com

Eliot Horowitz

unread,

May 16, 2012, 5:22:56 PM5/16/12

to mongod...@googlegroups.com

Can you send the replica stats?

db.printReplicationInfo()

>> > mongodb-user...@googlegroups.com

>> > See also the IRC channel -- freenode.net#mongodb
>

> --
> You received this message because you are subscribed to the Google
> Groups "mongodb-user" group.
> To post to this group, send email to mongod...@googlegroups.com
> To unsubscribe from this group, send email to

> mongodb-user...@googlegroups.com

Chris Wilcox

unread,

May 16, 2012, 5:34:45 PM5/16/12

to mongod...@googlegroups.com

We've already done a full re-sync but I still have the data from when we were initially debugging:

PRIMARY> db.printReplicationInfo()

configured oplog size: 12525.780078125MB

log length start to end: 194770secs (54.1hrs)

oplog first event time: Sun May 13 2012 19:49:50 GMT+0000 (UTC)

oplog last event time: Wed May 16 2012 01:56:00 GMT+0000 (UTC)

now: Wed May 16 2012 01:56:00 GMT+0000 (UTC)

PRIMARY> rs.status()

{

"set" : "rs1",

"date" : ISODate("2012-05-16T03:09:48Z"),

"myState" : 1,

"members" : [

{

"_id" : 0,

"name" : "rs1m1.c9w.co:27017",

"health" : 1,

"state" : 9,

"stateStr" : "ROLLBACK",

"uptime" : 10657,

"optime" : {

"t" : 1337108451000,

"i" : 6

},

"optimeDate" : ISODate("2012-05-15T19:00:51Z"),

"lastHeartbeat" : ISODate("2012-05-16T03:09:47Z"),

"pingMs" : 0

},

{

"_id" : 1,

"name" : "rs1m2.c9w.co:27017",

"health" : 1,

"state" : 1,

"stateStr" : "PRIMARY",

"optime" : {

"t" : 1337137788000,

"i" : 56

},

"optimeDate" : ISODate("2012-05-16T03:09:48Z"),

"self" : true

},

{

"_id" : 2,

"name" : "rs1arb1.c9w.co:27017",

"health" : 1,

"state" : 7,

"stateStr" : "ARBITER",

"uptime" : 47329,

"optime" : {

"t" : 0,

"i" : 0

},

"optimeDate" : ISODate("1970-01-01T00:00:00Z"),

"lastHeartbeat" : ISODate("2012-05-16T03:09:48Z"),

"pingMs" : 0

}

],

"ok" : 1

}

>> > mongodb-user+unsubscribe@googlegroups.com

>> > See also the IRC channel -- freenode.net#mongodb
>
> --
> You received this message because you are subscribed to the Google
> Groups "mongodb-user" group.
> To post to this group, send email to mongod...@googlegroups.com
> To unsubscribe from this group, send email to

> mongodb-user+unsubscribe@googlegroups.com

Eliot Horowitz

unread,

May 16, 2012, 9:10:35 PM5/16/12

to mongod...@googlegroups.com

Can you send the log from the interim?
Is there any clock skew on those 2 machines?

>> >> > mongodb-user...@googlegroups.com

>> >> > See also the IRC channel -- freenode.net#mongodb
>> >
>> > --
>> > You received this message because you are subscribed to the Google
>> > Groups "mongodb-user" group.
>> > To post to this group, send email to mongod...@googlegroups.com
>> > To unsubscribe from this group, send email to

>> > mongodb-user...@googlegroups.com

>> > See also the IRC channel -- freenode.net#mongodb
>
> --
> You received this message because you are subscribed to the Google
> Groups "mongodb-user" group.
> To post to this group, send email to mongod...@googlegroups.com
> To unsubscribe from this group, send email to

> mongodb-user...@googlegroups.com

Reply all

Reply to author

Forward