We are trying to query our mongo collection in hive, as it described in example.
It works with relatively small collections,
but when we issuing queries to our biggest collection it always fails on stage Map 1 with errors:
java.lang.OutOfMemoryError: GC overhead limit exceeded
java.lang.OutOfMemoryError: Java heap space
How can we configure mongo-hadoop to increase maps count and get results for our queries?
select count(*) from mongodb;
select * from mongodb where id=1;
Also tried to set memory option with no effect:
set mapred.child.java.opts="-Xmx12g -XX:+UseConcMarkSweepGC";
Hive 1.2.1.2.3.0.0-2557
mongo-hadoop 1.3.3 / mongo-hadoop 1.5.0