2014-11-12T15:39:09.836-0600 [repl writer worker 1] ERROR: writer worker caught exception: :: caused by :: 11000 insertDocument :: caused by :: 11000 E11000 duplicate key error index: somedatabase.colname.$field_1_morefield_1 dup key: { : "NAME/141119", : "YOLO" } on: { ts: Timestamp 1415827255000|1, h: -7641148999367123021, v: 2, op: "i", ns: "somedatabase.colname", o: { _id: ObjectId('5463cf37e7c5240e5f096dea'), morefield: "YOLO", field: "NAME/141119", server: [ "SERVERFORME038" ] } }
2014-11-12T15:39:09.836-0600 [repl writer worker 1] Fatal Assertion 16360
2014-11-12T15:39:09.836-0600 [repl writer worker 1]
***aborting after fassert() failureYes they elected a new primary but it won't let me bring up the crashed node as a secondary. Any time I start up that node (with the data left alone) it just crashes again. It only lets me bring up the node if I get rid of all the data files for the database that was throwing the error, which seems like a terrible solution and one I should only use of there are no other options because then I lose all the data from that node that didn't get to sync.
--
You received this message because you are subscribed to the Google Groups "mongodb-user"
group.
For other MongoDB technical support options, see: http://www.mongodb.org/about/support/.
---
You received this message because you are subscribed to a topic in the Google Groups "mongodb-user" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/mongodb-user/tWquUQCvEX4/unsubscribe.
To unsubscribe from this group and all its topics, send an email to mongodb-user...@googlegroups.com.
To post to this group, send email to mongod...@googlegroups.com.
Visit this group at http://groups.google.com/group/mongodb-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/214a1805-3213-471b-9211-15700eefde6f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
I should add that in my replica set, I have a primary, secondary and arbiter. So only 2 nodes have data.
You received this message because you are subscribed to the Google Groups "mongodb-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mongodb-user...@googlegroups.com.
To post to this group, send email to mongod...@googlegroups.com.
Visit this group at http://groups.google.com/group/mongodb-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/CAM%3DPFgpu%3DhJzD_gPF2RA6PjztMq1Y0Y6TzDDnYxXeFOxf5iUOQ%40mail.gmail.com.
This was the original cause of the crash when it was the primary node.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/CAOe6dJCFmPWnrdikawcvLttOt%2B4vkCdF7wO76%2B%2BVLWMeFGe99A%40mail.gmail.com.
# database
dbpath = /opt/mongodata
fork = true
# logfile
logpath = /var/log/mongodb/mongod.log
logappend = true
# net
rest = true
httpinterface = true
# Apply special variable options if they exist
# replication
replSet = ats-Production
# authentication
keyFile = /etc/mongodb.key
# authorization
auth = true2014-11-12T15:14:36.957-0600 [rsHealthPoll] replset info MONGOSERVER2:27017 thinks that we are down
2014-11-12T15:14:36.957-0600 [rsHealthPoll] replSet member MONGOSERVER2:27017 is now in state PRIMARY
2014-11-12T15:14:36.958-0600 [rsMgr] stepping down; another primary was elected more recently
2014-11-12T15:14:36.964-0600 [rsMgr] replSet relinquishing primary state
2014-11-12T15:14:36.964-0600 [rsMgr] replSet SECONDARY
2014-11-12T15:14:36.964-0600 [rsMgr] replSet closing client sockets after relinquishing primary2014-11-12T15:14:38.235-0600 [rsBackgroundSync] replSet our last op time fetched: Nov 12 15:14:36:9
2014-11-12T15:14:38.235-0600 [rsBackgroundSync] replset source's GTE: Nov 12 15:14:38:1
2014-11-12T15:14:38.256-0600 [rsBackgroundSync] replSet rollback 0
2014-11-12T15:14:38.256-0600 [rsBackgroundSync] replSet ROLLBACK
2014-11-12T15:14:38.256-0600 [rsBackgroundSync] replSet rollback 1
2014-11-12T15:14:38.256-0600 [rsBackgroundSync] replSet rollback 2 FindCommonPoint
2014-11-12T15:14:38.265-0600 [rsBackgroundSync] replSet info rollback our last optime: Nov 12 15:14:36:9
2014-11-12T15:14:38.265-0600 [rsBackgroundSync] replSet info rollback their last optime: Nov 12 15:14:38:2
2014-11-12T15:14:38.265-0600 [rsBackgroundSync] replSet info rollback diff in end of log times: -2 seconds
2014-11-12T15:14:38.270-0600 [rsBackgroundSync] replSet rollback found matching events at Nov 12 15:14:24:9
2014-11-12T15:14:38.270-0600 [rsBackgroundSync] replSet rollback findcommonpoint scanned : 146
2014-11-12T15:14:38.270-0600 [rsBackgroundSync] replSet replSet rollback 3 fixup
2014-11-12T15:14:38.289-0600 [rsBackgroundSync] replSet rollback 3.5
2014-11-12T15:14:38.289-0600 [rsBackgroundSync] replSet rollback 4 n:77
2014-11-12T15:14:38.289-0600 [rsBackgroundSync] replSet minvalid=Nov 12 15:14:38 5463cdbe:2
2014-11-12T15:14:38.289-0600 [rsBackgroundSync] replSet rollback 4.6
2014-11-12T15:14:38.289-0600 [rsBackgroundSync] replSet rollback 4.7
2014-11-12T15:14:38.342-0600 [rsBackgroundSync] replSet exception in rollback ns: somedatabase.colname { _id: ObjectId('53fdcaa3051819f15fea2025') } 11000 E11000 duplicate key error index: somedatabase.colname.$field_1_morefield_1 dup key: { : null, : null } ndeletes:0
2014-11-12T15:14:38.347-0600 [rsBackgroundSync] replSet rollback 5 d:44 u:55
2014-11-12T15:14:38.351-0600 [rsBackgroundSync] replSet rollback 6
2014-11-12T15:14:38.355-0600 [rsBackgroundSync] replSet rollback 7
2014-11-12T15:14:38.359-0600 [rsBackgroundSync] replSet issues during syncRollback, see log
2014-11-12T15:14:38.359-0600 [rsBackgroundSync] replSet RECOVERING
2014-11-12T15:14:38.359-0600 [rsBackgroundSync] replSet syncing to: MONGOSERVER2:27017
2014-11-12T15:14:38.360-0600 [rsBackgroundSync] replset setting syncSourceFeedback to MONGOSERVER2:270172014-11-12T15:20:55.480-0600 [repl writer worker 1] ERROR: writer worker caught exception: :: caused by :: 11000 insertDocument :: caused by :: 11000 E11000 duplicate key error index: somedatabase.colname.$field_1_morefield_1 dup key: { : "NAME/141119", : "YOLO" } on: { ts: Timestamp 1415827255000|1, h: -7641148999367123021, v: 2, op: "i", ns: "somedatabase.colname", o: { _id: ObjectId('5463cf37e7c5240e5f096dea'), exchange: "YOLO", helixSymbol: "NAME/141119", orders: [ "SERVERFORME38" ] } }
2014-11-12T15:20:55.480-0600 [repl writer worker 1] Fatal Assertion 16360
2014-11-12T15:20:55.480-0600 [repl writer worker 1]
***aborting after fassert() failureI'm not sure what you are referring to as data you didn't get to sync. The error is about a record that is a duplicate which means it doesn't belong - I.e. It shouldn't exist on this node.What was the original cause of the crash of this machine when it was primary? How was it configured. - all default options?Asya
On Thursday, November 13, 2014, rhea wrote:
Yes they elected a new primary but it won't let me bring up the crashed node as a secondary. Any time I start up that node (with the data left alone) it just crashes again. It only lets me bring up the node if I get rid of all the data files for the database that was throwing the error, which seems like a terrible solution and one I should only use of there are no other options because then I lose all the data from that node that didn't get to sync.
On Nov 12, 2014 6:00 PM, "Ekin Koc" wrote:
Did your secondaries elect a new primary and keep the set online--I believe the crashed node, now joining as a secondary, has some incompatible data with the rest of your replica set and it crashes while replicating that. Not sure how though. If the rest of your set is running, you might wanna consider resyncing your old primary.
On Thursday, November 13, 2014 1:10:32 AM UTC+2, rhea ghosh wrote:I have a 3 node replica set and I'm running MongoDB 2.6.1My primary crashed this afternoon and won't come up because of this error.2014-11-12T15:39:09.836-0600 [repl writer worker 1] ERROR: writer worker caught exception: :: caused by :: 11000 insertDocument :: caused by :: 11000 E11000 duplicate key error index: somedatabase.colname.$field_1_morefield_1 dup key: { : "NAME/141119", : "YOLO" } on: { ts: Timestamp 1415827255000|1, h: -7641148999367123021, v: 2, op: "i", ns: "somedatabase.colname", o: { _id: ObjectId('5463cf37e7c5240e5f096dea'), morefield: "YOLO", field: "NAME/141119", server: [ "SERVERFORME038" ] } }
2014-11-12T15:39:09.836-0600 [repl writer worker 1] Fatal Assertion 16360
2014-11-12T15:39:09.836-0600 [repl writer worker 1]
***aborting after fassert() failureI didn't think an index key error would cause a primary to crash and not be able to come up. How can I fix this and how do I keep as much of my data as possible?
You received this message because you are subscribed to the Google Groups "mongodb-user"
group.
For other MongoDB technical support options, see: http://www.mongodb.org/about/support/.
---
You received this message because you are subscribed to a topic in the Google Groups "mongodb-user" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/mongodb-user/tWquUQCvEX4/unsubscribe.
To unsubscribe from this group and all its topics, send an email to mongodb-user+unsubscribe@googlegroups.com.
To post to this group, send email to mongod...@googlegroups.com.
Visit this group at http://groups.google.com/group/mongodb-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/214a1805-3213-471b-9211-15700eefde6f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "mongodb-user"
group.
For other MongoDB technical support options, see: http://www.mongodb.org/about/support/.
---
You received this message because you are subscribed to the Google Groups "mongodb-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mongodb-user+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/CAOe6dJDObPFcVJe1z%2BEyzdb9i9H3wT6x2sOAKdPwpAYZ96E6SQ%40mail.gmail.com.
--Secondary
2014-11-12T15:14:14.532-0600 [rsHealthPoll] DBClientCursor::init call() failed
2014-11-12T15:14:14.542-0600 [rsHealthPoll] can't authenticate to MONGOSERVER1:27017 (10.0.240.250) failed as internal user, error: DBClientBase::findN: transport error: MONGOSERVER1:27017 ns: local.$cmd query: { getnonce: 1 }
--Arbiter
2014-11-12T15:14:22.484-0600 [rsHealthPoll] DBClientCursor::init call() failed
2014-11-12T15:14:22.485-0600 [rsHealthPoll] can't authenticate to MONGOSERVER1:27017 (10.0.240.250) failed as internal user, error: DBClientBase::findN: transport error: MONGOSERVER1:27017 ns: local.$cmd query: { getnonce: 1 }
2014-11-12T15:14:24.544-0600 [conn529946] MONGOSERVER2:27017 is trying to elect itself but MONGOSERVER1:27017 is already primary2014-11-12T15:14:26.983-0600 [conn529947] authenticate db: local { authenticate: 1, nonce: "xxx", user: "__system", key: "xxx" }2014-11-12T15:14:30.984-0600 [conn529947] MONGOSERVER2:27017 is trying to elect itself but MONGOSERVER1:27017 is already primary2014-11-12T15:14:32.485-0600 [rsHealthPoll] DBClientCursor::init call() failed2014-11-12T15:14:32.486-0600 [rsHealthPoll] replSet info MONGOSERVER1:27017 is down (or slow to respond):2014-11-12T15:14:32.486-0600 [rsHealthPoll] replSet member MONGOSERVER1:27017 is now in state DOWN2014-11-12T15:14:36.545-0600 [conn529947] replSet info voting yea for MONGOSERVER2:27017 (1)
2014-11-12T15:14:36.941-0600 [rsHealthPoll] replSet member MONGOSERVER1:27017 is up2014-11-12T15:14:36.941-0600 [rsHealthPoll] replSet member MONGOSERVER1:27017 is now in state SECONDARY$ ulimit -a
core file size (blocks, -c) unlimited
data seg size (kbytes, -d) unlimited
scheduling priority (-e) 0
file size (blocks, -f) unlimited
pending signals (-i) 2066228
max locked memory (kbytes, -l) 64
max memory size (kbytes, -m) unlimited
open files (-n) 32768
pipe size (512 bytes, -p) 8
POSIX message queues (bytes, -q) 819200
real-time priority (-r) 0
stack size (kbytes, -s) 10240
cpu time (seconds, -t) unlimited
max user processes (-u) 1024
virtual memory (kbytes, -v) unlimited
file locks (-x) unlimitedTo view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/6f19dba6-915a-4ab6-b039-910de815f9b0%40googlegroups.com.
Ok, that helps. I'll focus my efforts looking into potential networking issues instead of eating time looking into the OS/local hardware.
Thanks so much!
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/CAOe6dJCz5PYQ5c2yzxE-QJApP%2BFf7ZegixjtGpnHXwS%3D4Ov-Vw%40mail.gmail.com.