We keep getting the dreaded "error RS102 too stale to catch up" - and
it's driving me nuts. Installation is backing a wiki site so it's
* Why did the SECONDARYs keep their status as SECONDARY - isn't
that misleading? According to their optime, they hadn't taken any
updates from the PRIMARY for almost 3 days! (yes, I need to fix our
monitoring)
* The SECONDARY's status changed to RECOVERING with errmsg "error
RS102 too stale to catch up" only after restarting the PRIMARY - why
is that? Shouldn't the secondaries have detected their stale condition
without me having to kick it?
* Any ideas on how I can further debug the stoppage in the first
place? Do I need to do a custom build of mongod with some debugging
options enabled?
Cheers
- Paul Harvey
--
You received this message because you are subscribed to the Google Groups "mongodb-user" group.
To post to this group, send email to mongod...@googlegroups.com.
To unsubscribe from this group, send email to mongodb-user...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/mongodb-user?hl=en.
1.2GB, 1.7GB, 2.2GB. We get very few updates, except during the bulk-
> How big is your oplog? You can run db.printReplicationInfo() to get the
> details from each node.
load process - in the 5 hours leading up to the "stoppage" there were
only 82 save operations. Surely an > 1GB oplog is big enough to cope
with 16 saves per hour? :)
Fair enough.
> No, it needs manual intervention instead of putting lots of load on the
> other replicas. When it gets so stale that is require a resync or such then
> it is a admin tasks since different solution affect the other replicas in
> other ways.
Thanks, I'll set that up now.
> You can run with -vvvvv to get more logging.
No, the db names are filtered to replace such characters with
> Does you database name have "." in it? What operating system/fs are you
> using?
underscores.
Ubuntu 9.04 (ext3) and 10.04 LTS (ext4). Filesystems have ~50% space
free.
- Paul
--
let me begin by saying, I'm not using $where clauses most of the time -
though I'm sure there are more that can be removed.
I didn't see any docco suggesting that there might be a limit to the
number of $where clauses that could be run before mongodb crashes, so I
have focused more on completeness (we're still not able to generate the
full set of foswiki queries into MongoDB).
I have specifically coded $where clauses when there are multiple and
clauses on the same key (yup, users do that), when I need to query
multiple documents (getSisterDB type stuff) or when there's a mismatch
between how foswiki (and its perl oriented evaluation) and
javascript/mongo think about things, and various other little corners of
weirdness on both sides.
the remainder is when the generator gets confused (or when someone
changes the parse tree its generated from and I've not yet updated
enough code to make a 'good' mongodb query.
Its worth noting that using _only_ $where queries is already so much of
a speed improvement for foswiki, that converting those to native queries
is almost insignificant - if it wheren't for these crashes.
(I do love the $where: '(1)' querys that I'm generating :( )
I guess the question from my perspective (and wrt to getting the foswiki
on mongodb project functioning reliably) is should I focus on completing
the foswiki query to mongodb converter, or should I drop that, and work
on this memory leak in monogdb? Both are show stoppers for us :/ as
MongoDB's query language is simply not expressive enough to avoid $where
in the complex queries.
Sven
- --
________________________________________
Professional Wiki Innovation and Support
Enterprise Support Contracts for Foswiki
Sven Dowideit http://fosiki.com
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iEYEARECAAYFAk3a+roACgkQPAwzu0QrW+mR5wCfbn/G4lKFnsw81Xz8G8WcMuD4
cDUAnjY8CprPuhdd0MQrww2b63aQ9Tda
=6lvy
-----END PGP SIGNATURE-----
mmm, wow -
Nat, what are suggesting? that the MonogDB JS engine out of memory gets
reset if we close and reopen the connection?
if that really is an option, can't we just make our fastCGI backends
have a shorter life?
Sven
On 24/05/11 14:04, Nat wrote:
> Paul,
>
> If you restart only apache but not mongod, did it help at all?
>
> --
> You received this message because you are subscribed to the Google
> Groups "mongodb-user" group.
> To post to this group, send email to mongod...@googlegroups.com.
> To unsubscribe from this group, send email to
> mongodb-user...@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/mongodb-user?hl=en.
- --
________________________________________
Professional Wiki Innovation and Support
Enterprise Support Contracts for Foswiki
Sven Dowideit http://fosiki.com
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iEYEARECAAYFAk3bOocACgkQPAwzu0QrW+lQHgCgnP+rtue4fsybcKCVdr0Li0b7
M14AoKdYm2RktQUBThkzKgiyom8XOjgY
=TaRq
-----END PGP SIGNATURE-----