Hi all -- we're trying to import 3 million rows of test data in to
mongo (1.8.2). Each line in the JSON fie looks very simliar to this:
{ "_id" : { "$oid" : "4e711ac4d413190fcc0000c2" }, "popularids" : [],
"listing_ids" : [], "email_optin" : true, "sms_age_verification" :
null, "created_at" : { "$date" : 1316035268000 }, "sms_alert" : false,
"updated_at" : { "$date" : 1316035268000 }, "categoryids" : [ 5118494,
5118936, 5118495, 5118487, 5118492, 5118924, 5118993, 5118498,
5118497, 5118521 ], "email_processed" : false, "mobile_number" : null,
"email_alert" : true, "last_name" : null, "storeid" : 2395871,
"email_age_verification" : true, "remind_time" : null, "sms_optin" :
true, "brandids" :
[ -2099999884, -
2099997271, -
2099998575 ],
"email" : "
te...@test.com", "newsletter_signup" : true, "first_name" :
null, "remind_timezone" : null, "sms_processed" : false }
We import the file via:
mongoimport -d (database) -c performance_test --file test_data.json
And we're seeing between 8 and 20 records per second for the import!
This collection has no indexes on it:
.getIndexes()
[ ]
staging:PRIMARY>
and mongostat doesn't seem like anything funny is going on:
insert query update delete getmore command flushes mapped
vsize res faults locked % idx miss % qr|qw ar|aw netIn
netOut conn set repl time
*0 *0 *0 *0 0 1|0 0 14.2g
28.5g 1.07g 0 0 0 0|0 0|0 62b
1k 5 staging SEC 19:51:37
2 5 0 0 0 2 0 14.2g
29g 373m 0 0 0 5|1 9|1 1k
1k 29 staging M 19:51:37
*0 *0 *0 *0 0 3|0 0 14.2g
28.5g 1.07g 0 0 0 0|0 0|0 326b
1k 5 staging SEC 19:51:38
0 0 0 0 0 2 0 14.2g
29g 373m 0 0 0 7|1 8|1 431b
1k 29 staging M 19:51:38
*0 *0 *0 *0 0 1|0 0 14.2g
28.5g 1.07g 0 0 0 0|0 0|0 62b
1k 5 staging SEC 19:51:39
1 1 0 0 0 2 0 14.2g
29g 373m 0 0 0 2|1 4|1 1k
1k 29 staging M 19:51:39
*0 *0 *0 *0 0 3|0 0 14.2g
28.5g 1.07g 0 0 0 0|0 0|0 326b
1k 5 staging SEC 19:51:40
4 8 0 0 0 2 0 14.2g
29g 373m 0 0.1 0 4|1 5|1 3k
1k 29 staging M 19:51:40
*0 *0 *0 *0 0 1|0 0 14.2g
28.5g 1.07g 0 0 0 0|0 0|0 62b
1k 5 staging SEC 19:51:41
1 8 0 0 0 2 0 14.2g
29g 373m 0 0 0 6|1 8|1 1k
1k 29 staging M 19:51:41
insert query update delete getmore command
flushes mapped vsize res faults locked % idx miss % qr|qw ar|
aw netIn netOut conn set repl time
*0 *0 *0 *0 0 3|0 0 14.2g
28.5g 1.07g 0 0 0 0|0 0|0 326b
1k 5 staging SEC 19:51:42
0 0 0 0 0 2 0 14.2g
29g 373m 0 0 0 7|1 8|1 194b
1k 29 staging M 19:51:42
*0 *0 *0 *0 0 2|0 0 14.2g
28.5g 1.07g 0 0 0 0|0 0|0 194b
1k 5 staging SEC 19:51:43
3 1 0 0 0 2 0 14.2g
29g 373m 0 0 0 7|1 7|1 2k
1k 29 staging M 19:51
What could be causing this dramatic of a slow down? I am used to
20-50k per second. Thanks in avance!