Trying to run this code
Main.java:
public static void main(String[] args) {
...
..
Properties properties = new Properties();
AppProps.setApplicationJarClass(properties, Main.class);
HadoopFlowConnector flowConnector = new HadoopFlowConnector(properties);
Scheme sourceScheme = new queries.ParquetTupleScheme(new Fields("a", "b", "c"));
Tap inTap = new Hfs(sourceScheme, inPath);
...
...
...
}
And i'm getting this error:
java.lang.NoClassDefFoundError: cascading/scheme/Scheme
Here is what i tried so far
1)
When I replace this:
Scheme sourceScheme = new ParquetTupleScheme(new Fields("a", "b", "c"));
with this:
Scheme sourceScheme = null;
The error goes away
2)
When I'm creating class that extends Scheme<JobConf, RecordReader, OutputCollector, Object[], Object[]> like ParquetTupleScheme
The error goes away
3)
When I'm trying to check if this is a specific parquet-cascading error
Object a = new PigCombiner()
the error goes away
I'm using :
cascading 2.5.1
parquet-cascading 1.3.0
hadoop-core 1.2.1
What i'm doing wrong?
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at http://groups.google.com/group/cascading-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/30119812-263f-409e-951c-88d6966fbb00%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
Exception in thread "main" java.lang.reflect.InvocationTargetException
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
at com.twitter.scalding.Job$.apply(Job.scala:49)
at com.twitter.scalding.Tool.getJob(Tool.scala:51)
at com.twitter.scalding.Tool.run(Tool.scala:71)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at JobRunner$.main(JobRunner.scala:28)
at RawLogsJobRunner$delayedInit$body.apply(RawLogsJobRunner.scala:21)
at scala.Function0$class.apply$mcV$sp(Function0.scala:40)
at scala.runtime.AbstractFunction0.apply$mcV$sp(AbstractFunction0.scala:12)
at scala.App$$anonfun$main$1.apply(App.scala:71)
at scala.App$$anonfun$main$1.apply(App.scala:71)
at scala.collection.immutable.List.foreach(List.scala:318)
at scala.collection.generic.TraversableForwarder$class.foreach(TraversableForwarder.scala:32)
at scala.App$class.main(App.scala:71)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:208)
Caused by: java.lang.NoSuchMethodError: parquet.cascading.ParquetTupleScheme.<init>(Lcascading/tuple/Fields;Lcascading/tuple/Fields;Ljava/lang/String;)V
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/9be3738f-5cc0-4015-bb52-f2f26c2952b5%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/39071f55-7501-4a41-bf7e-413dd336e705%40googlegroups.com.
echo "Fixing CDH lib/Parquet"
echo "----------------------"
cd /opt/cloudera/parcels/CDH-4.5.0-1.cdh4.5.0.p0.30/lib/parquet
wget http://central.maven.org/maven2/com/twitter/parquet-cascading/1.5.0/parquet-cascading-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-column/1.5.0/parquet-column-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-common/1.5.0/parquet-common-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-encoding/1.5.0/parquet-encoding-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-format/1.5.0/parquet-format-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-generator/1.5.0/parquet-generator-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-hadoop/1.5.0/parquet-hadoop-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-hadoop-bundle/1.5.0/parquet-hadoop-bundle-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-jackson/1.5.0/parquet-jackson-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-pig/1.5.0/parquet-pig-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-pig-bundle/1.5.0/parquet-pig-bundle-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-hive/1.5.0/parquet-hive-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-thrift/1.5.0/parquet-thrift-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-avro/1.5.0/parquet-avro-1.5.0.jar
mkdir BACKUP
mv -f *-cdh4.5.0.jar BACKUP/
echo "Fixing CDH lib/hadoop"
echo "---------------------"
cd /opt/cloudera/parcels/CDH-4.5.0-1.cdh4.5.0.p0.30/lib/hadoop/
mkdir BACKUP
mv original-parquet-* BACKUP/
mv parquet-* BACKUP/
wget http://central.maven.org/maven2/com/twitter/parquet-cascading/1.5.0/parquet-cascading-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-column/1.5.0/parquet-column-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-common/1.5.0/parquet-common-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-encoding/1.5.0/parquet-encoding-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-format/1.5.0/parquet-format-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-generator/1.5.0/parquet-generator-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-hadoop/1.5.0/parquet-hadoop-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-hadoop-bundle/1.5.0/parquet-hadoop-bundle-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-jackson/1.5.0/parquet-jackson-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-pig/1.5.0/parquet-pig-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-pig-bundle/1.5.0/parquet-pig-bundle-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-hive/1.5.0/parquet-hive-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-thrift/1.5.0/parquet-thrift-1.5.0.jar
wget http://central.maven.org/maven2/com/twitter/parquet-avro/1.5.0/parquet-avro-1.5.0.jar
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/a2d8cf0b-fde0-4560-936b-062a772e60f1%40googlegroups.com.
2014-07-02 15:49:49,128 WARN org.apache.hadoop.mapred.Child: Error running child java.io.EOFException at java.io.DataInputStream.readFully(DataInputStream.java:180) at java.io.DataInputStream.readUTF(DataInputStream.java:592) at java.io.DataInputStream.readUTF(DataInputStream.java:547) at parquet.hadoop.ParquetInputSplit.readFields(ParquetInputSplit.java:177) at parquet.hadoop.mapred.DeprecatedParquetInputFormat$ParquetInputSplitWrapper.readFields(DeprecatedParquetInputFormat.java:196) at cascading.tap.hadoop.io.MultiInputSplit.readFields(MultiInputSplit.java:151) at org.apache.hadoop.io.serializer.WritableSerialization$WritableDeserializer.deserialize(WritableSerialization.java:73) at org.apache.hadoop.io.serializer.WritableSerialization$WritableDeserializer.deserialize(WritableSerialization.java:44) at org.apache.hadoop.mapred.MapTask.getSplitDetails(MapTask.java:356) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:388) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:332) at org.apache.hadoop.mapred.Child$4.run(Child.java:268) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1408) at org.apache.hadoop.mapred.Child.main(Child.java:262) 2014-07-02 15:49:49,132 INFO org.apache.hadoop.mapred.Task: Runnning cleanup for the task