2014-03-11 13:22:54,881 INFO org.apache.hadoop.hive.ql.exec.MapOperator: 7 forwarding 10000 rows 2014-03-11 13:22:54,881 INFO org.apache.hadoop.hive.ql.exec.TableScanOperator: 3 forwarding 10000 rows 2014-03-11 13:22:54,881 INFO org.apache.hadoop.hive.ql.exec.SelectOperator: 2 forwarding 10000 rows 2014-03-11 13:22:54,881 INFO ExecMapper: ExecMapper: processing 10000 rows: used memory = 144388608 2014-03-11 13:22:55,162 INFO org.apache.hadoop.hive.ql.exec.MapOperator: 7 forwarding 100000 rows 2014-03-11 13:22:55,163 INFO org.apache.hadoop.hive.ql.exec.TableScanOperator: 3 forwarding 100000 rows 2014-03-11 13:22:55,163 INFO org.apache.hadoop.hive.ql.exec.SelectOperator: 2 forwarding 100000 rows 2014-03-11 13:22:55,163 INFO ExecMapper: ExecMapper: processing 100000 rows: used memory = 152418384 2014-03-11 13:22:56,001 INFO com.mongodb.hadoop.mapred.input.BSONFileRecordReader: closing bson file split. 2014-03-11 13:22:56,001 INFO com.mongodb.hadoop.mapred.input.BSONFileRecordReader: closing bson file split. 2014-03-11 13:22:56,003 INFO org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader: Processing file hdfs://localhost:8020/user/hive/warehouse/jb.db/visit/visits.bson 2014-03-11 13:22:56,005 ERROR com.mongodb.hadoop.mapred.input.BSONFileRecordReader: Error reading key/value from bson file: BSONDecoder doesn't understand type : 57 name: 4091990.531827d6e2076 2014-03-11 13:22:56,005 INFO com.mongodb.hadoop.mapred.input.BSONFileRecordReader: closing bson file split. 2014-03-11 13:22:56,005 INFO com.mongodb.hadoop.mapred.input.BSONFileRecordReader: closing bson file split. 2014-03-11 13:22:56,007 INFO org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader: Processing file hdfs://localhost:8020/user/hive/warehouse/jb.db/visit/visits.bson 2014-03-11 13:22:56,008 ERROR com.mongodb.hadoop.mapred.input.BSONFileRecordReader: Error reading key/value from bson file: null 2014-03-11 13:22:56,008 INFO com.mongodb.hadoop.mapred.input.BSONFileRecordReader: closing bson file split. 2014-03-11 13:22:56,008 INFO com.mongodb.hadoop.mapred.input.BSONFileRecordReader: closing bson file split. 2014-03-11 13:22:56,010 INFO org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader: Processing file hdfs://localhost:8020/user/hive/warehouse/jb.db/visit/visits.bson 2014-03-11 13:22:56,011 ERROR com.mongodb.hadoop.mapred.input.BSONFileRecordReader: Error reading key/value from bson file: BSONDecoder doesn't understand type : 117 name: id 2014-03-11 13:22:56,011 INFO com.mongodb.hadoop.mapred.input.BSONFileRecordReader: closing bson file split. 2014-03-11 13:22:56,011 INFO com.mongodb.hadoop.mapred.input.BSONFileRecordReader: closing bson file split. 2014-03-11 13:22:56,011 INFO org.apache.hadoop.hive.ql.exec.MapOperator: 7 finished. closing... 2014-03-11 13:22:56,011 INFO org.apache.hadoop.hive.ql.exec.MapOperator: 7 forwarded 541194 rows 2014-03-11 13:22:56,011 INFO org.apache.hadoop.hive.ql.exec.MapOperator: DESERIALIZE_ERRORS:0 2014-03-11 13:22:56,011 INFO org.apache.hadoop.hive.ql.exec.TableScanOperator: 3 finished. closing... 2014-03-11 13:22:56,011 INFO org.apache.hadoop.hive.ql.exec.TableScanOperator: 3 forwarded 541194 rows 2014-03-11 13:22:56,011 INFO org.apache.hadoop.hive.ql.exec.SelectOperator: 2 finished. closing... 2014-03-11 13:22:56,012 INFO org.apache.hadoop.hive.ql.exec.SelectOperator: 2 forwarded 541194 rows 2014-03-11 13:22:56,012 INFO org.apache.hadoop.hive.ql.exec.GroupByOperator: 1 finished. closing...
Below is my table definition in Hive:
CREATE EXTERNAL TABLE test(
visitId STRING,
browserId STRING,
softUserId STRING,
userId STRING,
matchType BOOLEAN,
ts TIMESTAMP
)
ROW FORMAT SERDE 'com.mongodb.hadoop.hive.BSONSerDe'
WITH SERDEPROPERTIES('mongo.columns.mapping'='{"visitid":"_id", "browserid":"bid", "softuserid":"uid0", "userid":"uid", "matchtype":"um"}')
STORED AS INPUTFORMAT 'com.mongodb.hadoop.mapred.BSONFileInputFormat'
OUTPUTFORMAT 'com.mongodb.hadoop.hive.output.HiveBSONFileOutputFormat';The number of rows in MongoDB is about 3.2 million and in Hive I see only .5 million rows.
I am using 1.0.3 version of Hadoop and 1.2 version of mongo hadoop project. The mongo java driver is 2.11.3.
Any ideas what is causing this?
Thanks
-Mayur
--
--
You received this message because you are subscribed to the Google
Groups "mongodb-user" group.
To post to this group, send email to mongod...@googlegroups.com
To unsubscribe from this group, send email to
mongodb-user...@googlegroups.com
See also the IRC channel -- freenode.net#mongodb
---
You received this message because you are subscribed to the Google Groups "mongodb-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mongodb-user...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
For other MongoDB technical support options, see: http://www.mongodb.org/about/support/.
---
You received this message because you are subscribed to the Google Groups "mongodb-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mongodb-user...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/6891fadb-1521-4a46-8f97-9da5f37f2429%40googlegroups.com.To post to this group, send email to mongod...@googlegroups.com.
Visit this group at http://groups.google.com/group/mongodb-user.
You received this message because you are subscribed to a topic in the Google Groups "mongodb-user" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/mongodb-user/g2dxZmEGqiM/unsubscribe.
To unsubscribe from this group and all its topics, send an email to mongodb-user...@googlegroups.com.
To post to this group, send email to mongod...@googlegroups.com.
Visit this group at http://groups.google.com/group/mongodb-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/CAH7JHGeM6oXUy2wuY_W88%2BPxd%3DzNHYUa0_WSoJMP7wC2oAn4SA%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/CABeCSNUHuWVe49snRbVgj1KqmpMV_98pWam%3D39R5VND5V%3D7V1w%40mail.gmail.com.
--
You received this message because you are subscribed to the Google Groups "mongodb-user"
group.
For other MongoDB technical support options, see: http://www.mongodb.org/about/support/.
---
You received this message because you are subscribed to a topic in the Google Groups "mongodb-user" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/mongodb-user/g2dxZmEGqiM/unsubscribe.
To unsubscribe from this group and all its topics, send an email to mongodb-user...@googlegroups.com.
To post to this group, send email to mongod...@googlegroups.com.
Visit this group at http://groups.google.com/group/mongodb-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/d2090f23-645c-4a49-9b10-3c3f580e7ff9%40googlegroups.com.
Mayur
I have the same problem with you.Bsonfileinputformat not work in hive.Have you resolved?
--
You received this message because you are subscribed to the Google Groups "mongodb-user"
group.
For other MongoDB technical support options, see: http://www.mongodb.org/about/support/.
---
You received this message because you are subscribed to a topic in the Google Groups "mongodb-user" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/mongodb-user/g2dxZmEGqiM/unsubscribe.
To unsubscribe from this group and all its topics, send an email to mongodb-user...@googlegroups.com.
To post to this group, send email to mongod...@googlegroups.com.
Visit this group at http://groups.google.com/group/mongodb-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/dc802837-d7ee-4dd9-a027-9a55ece7901b%40googlegroups.com.