iterating across sharded collection

progolferyo

unread,

Apr 27, 2012, 12:45:03 PM4/27/12

to mongodb-user

We have a sharded cluster, 3 shards, each with a replica set.
Standard stuff. I am trying to write a script that simply iterates
over the entire collection.

So I connect to the mongos server, and iterate across the entire
collection for the purposes of testing, im just counting how many
items i iterate over. My java code looks like this:

DB db = m.getDB("main_db");
DBCollection coll = db.getCollection("users");
DBCursor cur = coll.find();
int counter = 0;

while(cur.hasNext()) {
cur.next();
counter++;
}

So the issue is that i have about 30M items in the collection and when
I run the script, counter finishes with around 11M items, which means
it quit after the first shard. If I run the script again, shortly
after the first time I ran it, I sometimes get to 20M items, which
looks like it got to the second shard but not the third. I have yet
to get all the way to the third shard.

So question is how do I actually iterate over an entire collection
with the guarantee that I'm hitting all shards in the cluster?

And am I doing something wrong in my implementation, it seems pretty
straightforward.

Greg Studer

unread,

Apr 27, 2012, 7:01:04 PM4/27/12

to mongodb-user

There's a known issue where your cursors may be timing out - have you
added the NoTimeout option to the query to mongodb? How to do so
depends on your driver.

progolferyo

unread,

May 2, 2012, 2:05:54 PM5/2/12

to mongod...@googlegroups.com

I've tried it in multiple languages and it still happens. I'm using the Java driver, which seems to have no way to set the cursor timeout, unless I'm mistaken.

Should I just use another language?

Brendan W. McAdams

unread,

May 2, 2012, 2:30:15 PM5/2/12

to mongod...@googlegroups.com

With the java driver, invoke

cursor.setOption(Bytes.QUERYOPTION_NOTIMEOUT)

--
You received this message because you are subscribed to the Google Groups "mongodb-user" group.
To view this discussion on the web visit https://groups.google.com/d/msg/mongodb-user/-/ufaFz8bwwlsJ.
To post to this group, send email to mongod...@googlegroups.com.
To unsubscribe from this group, send email to mongodb-user...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/mongodb-user?hl=en.

Reply all

Reply to author

Forward