Following the example provided at http://dumbotics.com/2009/06/16/integration-with-java-code/#comments I've created a class MyCustomPartitioner that extends the hadoop HashPartitioner, compiled it correctly and then put it in a jar MyCustomPartitioner.jar. The problem is that I can't make hadoop find the new partitioner; running the program with:
dumbo start [python program, -hadoop, -input, -output, etc as usual] -partitioner MyCustomPartitioner -libjar MyCustomPartitioner.jar
I get the following error:
-partitioner : class not found : MyCustomPartitioner
Any suggestion? Thanks.
I'm using dumbo 0.21, (oracle) java 1.7.0_17 and hadoop 1.1.2