Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
Is it making progress? and should we wait?
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  1 message - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Steve  
View profile  
 More options Nov 12 2012, 10:19 pm
From: Steve <fbkt...@gmail.com>
Date: Mon, 12 Nov 2012 19:19:02 -0800 (PST)
Local: Mon, Nov 12 2012 10:19 pm
Subject: [Rollback State] Is it making progress? and should we wait?

Hi,

There's situation in our mongodb cluster. In one of the replica set (say
machine A, B), B was down due to server failure and required repair.
Unfortunately, right before B was done repairing, A was down due to
segfault:

Mon Nov 12 16:37:24 [conn233105274]   Uncaught std::exception:
St9bad_alloc, terminating
Mon Nov 12 16:37:24 dbexit:
Mon Nov 12 16:37:24 Backtrace:
0x8ad399 0x8ad970 0x367ee0eb70 0x2279a90
 mongod(_ZN5mongo10abruptQuitEi+0x399) [0x8ad399]
 mongod(_ZN5mongo24abruptQuitWithAddrSignalEiP7siginfoPv+0x220) [0x8ad970]
 /lib64/libpthread.so.0 [0x367ee0eb70]
 [0x2279a90]

Then, we tried to restart A, while B was master. At that moment, A goes
into ROLLBACK state as expected.

From mongostat, we see "UNK" for A; "M" for B.

From the log, we see some:

Mon Nov 12 17:53:18 [replica set sync] replSet info rollback of
renameCollection is slow in this version of mongod
Mon Nov 12 17:53:18 [replica set sync] replSet WARNING ignoring op on
rollback no _id TODO : xs.system.indexes { ts: Timestamp 1351583022000|218,
h: 5274814664110145128, op: "i", ns: "xs.system.indexes", o: { ns:
"xs.tmp.mr.profile_tmp.mrs.profile_1351583022_86493_1139205_inc", key: { 0:
1 }, name: "0_1", v: 0 } }

However, for quiet sometime, we only see:

Mon Nov 12 18:41:04 [initandlisten] connection accepted from
10.28.120.169:44420 #154
Mon Nov 12 18:41:04 [conn154] end connection 10.28.120.169:44420
Mon Nov 12 18:41:16 [conn153] end connection 10.28.6.91:55265
Mon Nov 12 18:41:16 [initandlisten] connection accepted from
10.28.6.91:55271 #155

and now more ROLLBACK logs. And we have also checked the currentOp, and it
does show things like below

{
"opid" : "rs_c:1265572266",
"active" : false,
"waitingForLock" : false,
"op" : "none",
"ns" : "?xs.profile",
"query" : {
 },
"client_s" : "(NONE)",
"desc" : "replica set sync"

},

So, is the system still in the process of rolling back? or it's stuck?

*The DB cluster is v1.8.3

Thanks.


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »