Pig Protobuf Error

Samir Madhavan

unread,

Jun 6, 2013, 4:50:53 AM6/6/13

to elephant...@googlegroups.com

Hi,

I'm trying to read protobuf data using PIG but I'm getting the following error. I'm not able to narrow down the problem.

grunt> raw = LOAD '/prt/LogCopy.bin' USING Logs;

2013-06-06 13:16:52,555 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1200: Pig script failed to parse:

<line 2, column 6> pig script failed to validate: java.lang.RuntimeException: could not instantiate 'com.twitter.elephantbird.pig.piggybank.ProtobufBytesToTuple' with arguments '[com.protobuf.LogProtos.Logs]'

2013-06-06 13:16:52,555 [main] WARN org.apache.pig.tools.grunt.Grunt - There is no log file to write to.

2013-06-06 13:16:52,555 [main] ERROR org.apache.pig.tools.grunt.Grunt - Failed to parse: Pig script failed to parse:

<line 2, column 6> pig script failed to validate: java.lang.RuntimeException: could not instantiate 'com.twitter.elephantbird.pig.piggybank.ProtobufBytesToTuple' with arguments '[com.protobuf.LogProtos.Logs]'

at org.apache.pig.parser.QueryParserDriver.parse(QueryParserDriver.java:191)

at org.apache.pig.PigServer$Graph.validateQuery(PigServer.java:1572)

at org.apache.pig.PigServer$Graph.registerQuery(PigServer.java:1545)

at org.apache.pig.PigServer.registerQuery(PigServer.java:518)

at org.apache.pig.tools.grunt.GruntParser.processPig(GruntParser.java:991)

at org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:412)

at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:194)

at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:170)

at org.apache.pig.tools.grunt.Grunt.run(Grunt.java:69)

at org.apache.pig.Main.run(Main.java:538)

at org.apache.pig.Main.main(Main.java:157)

at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)

at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)

at java.lang.reflect.Method.invoke(Method.java:601)

at org.apache.hadoop.util.RunJar.main(RunJar.java:208)

Caused by:

<line 2, column 6> pig script failed to validate: java.lang.RuntimeException: could not instantiate 'com.twitter.elephantbird.pig.piggybank.ProtobufBytesToTuple' with arguments '[com.protobuf.LogProtos.Logs]'

at org.apache.pig.parser.LogicalPlanBuilder.buildLoadOp(LogicalPlanBuilder.java:835)

at org.apache.pig.parser.LogicalPlanGenerator.load_clause(LogicalPlanGenerator.java:3236)

at org.apache.pig.parser.LogicalPlanGenerator.op_clause(LogicalPlanGenerator.java:1315)

at org.apache.pig.parser.LogicalPlanGenerator.general_statement(LogicalPlanGenerator.java:799)

at org.apache.pig.parser.LogicalPlanGenerator.statement(LogicalPlanGenerator.java:517)

at org.apache.pig.parser.LogicalPlanGenerator.query(LogicalPlanGenerator.java:392)

at org.apache.pig.parser.QueryParserDriver.parse(QueryParserDriver.java:184)

... 15 more

Caused by: java.lang.RuntimeException: could not instantiate 'com.twitter.elephantbird.pig.piggybank.ProtobufBytesToTuple' with arguments '[com.protobuf.LogProtos.Logs]'

at org.apache.pig.impl.PigContext.instantiateFuncFromSpec(PigContext.java:618)

at org.apache.pig.parser.LogicalPlanBuilder.buildLoadOp(LogicalPlanBuilder.java:823)

... 21 more

Caused by: java.lang.reflect.InvocationTargetException

at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)

at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)

at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)

at java.lang.reflect.Constructor.newInstance(Constructor.java:525)

at org.apache.pig.impl.PigContext.instantiateFuncFromSpec(PigContext.java:586)

... 22 more

Caused by: java.lang.NoClassDefFoundError: Could not initialize class com.protobuf.LogProtos

at java.lang.Class.forName0(Native Method)

at java.lang.Class.forName(Class.java:188)

at com.twitter.elephantbird.util.Protobufs.getInnerClass(Protobufs.java:92)

at com.twitter.elephantbird.util.Protobufs.getInnerProtobufClass(Protobufs.java:87)

at com.twitter.elephantbird.util.Protobufs.getProtobufClass(Protobufs.java:69)

at com.twitter.elephantbird.util.Protobufs.getProtobufClass(Protobufs.java:55)

at com.twitter.elephantbird.pig.util.PigUtil.getProtobufClass(PigUtil.java:55)

at com.twitter.elephantbird.pig.util.PigUtil.getProtobufTypeRef(PigUtil.java:89)

at com.twitter.elephantbird.pig.piggybank.ProtobufBytesToTuple.<init>(ProtobufBytesToTuple.java:37)

... 27 more

Regards,

samir

Dmitriy Ryaboy

unread,

Jun 6, 2013, 9:07:13 AM6/6/13

to elephant...@googlegroups.com

It sounds like your class com.protobuf.LogProtos is either missing on the classpath, or created with the wrong version of protoc binary.

--
You received this message because you are subscribed to the Google Groups "elephantbird-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elephantbird-d...@googlegroups.com.
To post to this group, send email to elephant...@googlegroups.com.
Visit this group at http://groups.google.com/group/elephantbird-dev?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.

--
Dmitriy V Ryaboy
Twitter Analytics
http://twitter.com/squarecog

Samir Madhavan

unread,

Jun 6, 2013, 12:09:46 PM6/6/13

to elephant...@googlegroups.com

Thanks Dmitriy, I'll check that out.

I was also trying to use the default example of addressbook. But I get the following error. The data is in a binary file. Am I passing in the wrong data or something else is going wrong?

Sorry if the question sounds naive but I'm new to protobufs

raw = LOAD 'AddressBook.bin' USING AddressBook;

2013-06-06 20:52:56,475 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1200: Pig script failed to parse:

<line 2, column 6> pig script failed to validate: java.lang.ClassCastException: com.twitter.elephantbird.pig.piggybank.ProtobufBytesToTuple cannot be cast to org.apache.pig.LoadFunc

2013-06-06 20:52:56,475 [main] WARN org.apache.pig.tools.grunt.Grunt - There is no log file to write to.

2013-06-06 20:52:56,475 [main] ERROR org.apache.pig.tools.grunt.Grunt - Failed to parse: Pig script failed to parse: