Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
OOM on secondary re-sync, bgsync buffer consumes all ram and swap
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  4 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Brent Miller  
View profile  
 More options Oct 11 2012, 4:10 pm
From: Brent Miller <brentalanmil...@gmail.com>
Date: Thu, 11 Oct 2012 13:10:51 -0700 (PDT)
Local: Thurs, Oct 11 2012 4:10 pm
Subject: OOM on secondary re-sync, bgsync buffer consumes all ram and swap

We're running into this issue where we cannot re-sync our secondaries after
taking them off line for more than 10 minutes or so. When we take try to
bring a secondary back on-line it will consume all ram and swap and then
crash.

When we start mongod with -vv, the log files on the crashed secondary is
filled with:

Thu Oct 11 12:38:22 [rsBackgroundSync] bgsync buffer has 5171004922 bytes
Thu Oct 11 12:38:22 [rsBackgroundSync] bgsync buffer has 5171152019 bytes
Thu Oct 11 12:38:22 [rsBackgroundSync] bgsync buffer has 5171257666 bytes
Thu Oct 11 12:38:22 [rsBackgroundSync] bgsync buffer has 5171402953 bytes
Thu Oct 11 12:38:22 [rsBackgroundSync] bgsync buffer has 5171534548 bytes
Thu Oct 11 12:38:22 [rsBackgroundSync] bgsync buffer has 5171644793 bytes
Thu Oct 11 12:38:22 [rsBackgroundSync] bgsync buffer has 5171751196 bytes
Thu Oct 11 12:38:22 [rsBackgroundSync] bgsync buffer has 5171896415 bytes
Thu Oct 11 12:38:22 [rsBackgroundSync] bgsync buffer has 5172027296 bytes
Thu Oct 11 12:38:23 [rsBackgroundSync] bgsync buffer has 5172138255 bytes

The primary is running 2.2.0, Ubuntu 12.04 with 48G ram. We have two
secondaries that are running 2.2.0, Ubuntu 12.04, but only have 4G of ram.
(We see the same behavior when running 2.2.1rc0 too.)

Is there a reason why the bgsync buffer is growing so large? Is there a way
to minimize this?

The rs.config() is as follows:
{
"_id" : "apm",
"version" : 27,
"members" : [
{
"_id" : 0,
"host" : "apc:27017",
"priority" : 2

},

{
"_id" : 3,
"host" : "apm-dev:27017"
},

{
"_id" : 4,
"host" : "apm:27017"

}
]
}

Thanks,
Brent

 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Dwight Merriman  
View profile  
 More options Oct 11 2012, 10:50 pm
From: Dwight Merriman <dwi...@10gen.com>
Date: Thu, 11 Oct 2012 19:50:09 -0700 (PDT)
Local: Thurs, Oct 11 2012 10:50 pm
Subject: Re: OOM on secondary re-sync, bgsync buffer consumes all ram and swap

are you running 64 bit or 32 bit?
did you configure --oplogsize on the command line (the default behaves well
usually so the question is mainly if it is non-default)?
if you can post rs.status() that would be helpful.


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Brent Miller  
View profile  
 More options Oct 12 2012, 2:35 pm
From: Brent Miller <brentalanmil...@gmail.com>
Date: Fri, 12 Oct 2012 11:35:10 -0700 (PDT)
Local: Fri, Oct 12 2012 2:35 pm
Subject: Re: OOM on secondary re-sync, bgsync buffer consumes all ram and swap

All machines are 64bit, and yeah, we've configured the oplog manually to
32G (oplogSize = 32768) This was due to the fact that when we first started
testing, we only partitioned out 256G of disk for Mongo, and found that the
oplog ended up being too small to add new secondaries. (It might be worth
noting that we have a fairly write-heavy load and we're making a fair
number of updates to documents that are on the order of a megabyte.)

In order to keep the replica set up, I've had to demote one of the
secondaries to be an arbiter and just disabled the other, so I don't know
if rs.status() would be any help at the moment. I can start a re-sync later
on today if it will be helpful. However, once we get the secondaries
through their inital sync, their oplog will trail the primary anywhere from
1 - 60 seconds.

Thanks,
Brent


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Dwight Merriman  
View profile  
 More options Oct 13 2012, 10:20 am
From: Dwight Merriman <dwi...@10gen.com>
Date: Sat, 13 Oct 2012 07:20:40 -0700 (PDT)
Local: Sat, Oct 13 2012 10:20 am
Subject: Re: OOM on secondary re-sync, bgsync buffer consumes all ram and swap

please run this in the shell and post the result?

db.printReplicationInfo()


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »