legacy mapreduce library only running one shard

60 views
Skip to first unread message

Jason Collins

unread,
Dec 20, 2012, 4:34:52 PM12/20/12
to google-a...@googlegroups.com
We are using the "legacy" (non-PipelineAPI) version of the mapreduce library: http://code.google.com/p/appengine-mapreduce/

The issue is that we can only ever get one shard processing, even for kinds that have >150,000 entities. We have tried different shard_count configurations, e.g, 4, 16, 128, but always only one shard processing entire dataset, which is very slow.

I feel like I've missed a step (e.g., creating an index or something).

Crossing my fingers that someone knows an offhand answer.

Thanks,
j

Jason Collins

unread,
Jan 3, 2013, 4:31:20 PM1/3/13
to google-a...@googlegroups.com
I found the bug that leads to this in the mapreduce framework. The patch to fix is attached to the issue: http://code.google.com/p/appengine-mapreduce/issues/detail?id=154

Johan Euphrosine

unread,
Jan 5, 2013, 10:14:15 AM1/5/13
to Google App Engine .
Thanks for the report (and the patch!).

--
You received this message because you are subscribed to the Google Groups "Google App Engine" group.
To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/_anjmdcsBXgJ.

To post to this group, send email to google-a...@googlegroups.com.
To unsubscribe from this group, send email to google-appengi...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.



--
Johan Euphrosine (proppy)
Developer Programs Engineer
Google Developer Relations

Xiaolong Cheng

unread,
Jan 16, 2018, 5:25:49 PM1/16/18
to Google App Engine
This is suddenly happening to me, since Jan 2018.  May I know what had been done to fix the issue? The original link is broken.
Reply all
Reply to author
Forward
0 new messages