Hello
We're running Mongo 1.8.1 on our production environment, with 8
shards, 1 config database, 1 mongos, no replication. After writing
some data in all the shards (there were a few chunks per shard), the
mongos process crashed, yielding the following errors:
Received signal 11
Backtrace: 0x52e235 0x2aaaab6c6040 0x2aaaaaf8e58c 0x621403 0x69afab
0x576ba6 0x5774b6 0x575630 0x575886 0x583914 0x586464 0x66c41a
0x6362c9 0x66432c 0x6761c7 0x57ea3c 0x69ec30 0x2aaaaacd33ba
0x2aaaab778fcd
bin/mongos(_ZN5mongo17printStackAndExitEi+0x75)[0x52e235]
/lib/libc.so.6[0x2aaaab6c6040]
/usr/lib/libstdc++.so.6(_ZNSs6assignERKSs+0x1c)[0x2aaaaaf8e58c]
bin/mongos(_ZN5mongo5Shard5resetERKSs+0x93)[0x621403]
bin/mongos[0x69afab]
bin/
mongos(_ZN5boost6detail8function17function_invoker4IPFbRN5mongo12DBClientBa
seERKSsbiEbS5_S7_biE6invokeERNS1_15function_bufferES5_S7_bi
+0x16)[0x576ba6]
bin/mongos(_ZN5mongo17ClientConnections13checkVersionsERKSs+0x1c6)
[0x5774b6]
bin/mongos(_ZN5mongo15ShardConnection5_initEv+0x2d0)[0x575630]
bin/mongos(_ZN5mongo15ShardConnectionC1ERKSsS2_+0x76)[0x575886]
bin/mongos(_ZN5mongo15ClusteredCursor5queryERKSsiNS_7BSONObjEi+0x124)
[0x583914]
bin/mongos(_ZN5mongo27SerialServerClusteredCursor4moreEv+0x134)
[0x586464]
bin/
mongos(_ZN5mongo19ShardedClientCursor13sendNextBatchERNS_7RequestEi
+0x8a)[0x66c41a]
bin/mongos(_ZN5mongo13ShardStrategy7queryOpERNS_7RequestE+0xe39)
[0x6362c9]
bin/mongos(_ZN5mongo7Request7processEi+0x29c)[0x66432c]
bin/
mongos(_ZN5mongo21ShardedMessageHandler7processERNS_7MessageEPNS_21Abstract
MessagingPortEPNS_9LastErrorE
+0x77)[0x6761c7]
bin/mongos(_ZN5mongo3pms9threadRunEPNS_13MessagingPortE+0x34c)
[0x57ea3c]
bin/mongos(thread_proxy+0x80)[0x69ec30]
/lib/libpthread.so.0[0x2aaaaacd33ba]
/lib/libc.so.6(clone+0x6d)[0x2aaaab778fcd]
===
Fri May 13 19:05:17 ERROR: couldn't unset sharding : std::bad_alloc
Fri May 13 19:05:17 ERROR: couldn't unset sharding : std::bad_alloc
Fri May 13 19:05:17 ERROR: couldn't unset sharding : std::bad_alloc
The mongos crashed at 19:05:17, right after the previous error.
However, a few other errors are in the mongos.log file, similar to
the
following:
Fri May 13 09:53:40 [WriteBackListener] ~ScopedDBConnection: _conn !=
null
Fri May 13 09:53:40 [WriteBackListener] ERROR: error processing
writeback: 10429 setShardVersion failed
host[
someip.compute-1.amazonaws.com:27022] { oldVersion: Timestamp
1633000|1, ns: "ubervu.mentions", newVersion: Timestamp 19000|1,
globalVersion: Timestamp 19000|0, errmsg: "you already have a newer
version of collection 'ubervu.mentions'", ok: 0.0 }
Any ideas why this might have happened?
Thank you,
Mihnea @ uberVU