Dear all,
I am planing to do extensive data mining in Mahout with data that are collected in mongodb.
In the meantime, I found out that I can use hadoop mongodb connector to run hadoop jobs seemlessly on top of mongodb without moving gigabytes of data to hdfs.
I wonder if you may help pointing out what is the most popular way to do extensive hadoop mining with mongodb data? I really mean in production.
Should I definitively stick with the official hadoop mongo connector?
Thanks very much,
Peter