Hi all,
The appengine-mapreduce repository now contains a new version of App
Engine MapReduce for Java. It finally includes support for shuffling
and reducing. It also has reliability improvements and integrates
with appengine-pipeline. See
http://code.google.com/p/appengine-mapreduce/wiki/GettingStartedInJava
if you want to start using it.
This is an early release; the code is still somewhat rough and has not
been tested extensively, but it's superior to the previous version in
many ways already. Some features are missing (such as a Hadoop
compatibility layer and an overview page that shows past and current
jobs); we'll publish improvements as we make them.
Please let us know through this list and the issue tracker what bugs
you run into and what features you'd like to see.
For documentation, see the above link and the Javadoc. More
documentation will be coming over the next months.
See
http://code.google.com/p/appengine-mapreduce/wiki/LargeDatasets
for limitations on dataset size and what to do if you need to process
larger datasets.
Happy hacking,
Christian.