trying to run scalding on hadoop

282 views
Skip to first unread message

Marton Trencseni

unread,
Nov 10, 2012, 6:25:49 AM11/10/12
to cascadi...@googlegroups.com
Hi guys, great job with Scalding.
I've installed Scala, Scalding, and can run the tutorials fine in "local" mode. I'm now trying to run it on the Hadoop cluster.
I've extracted the appropriate command from the scald.rb script to be:

HADOOP_CLASSPATH=/usr/share/java/hadoop-lzo-0.4.15.jar:/usr/local/scalding/target/scalding-assembly-0.8.2-SNAPSHOT.jar:/tmp/Tutorial0.jar hadoop jar /usr/local/scalding/target/scalding-assembly-0.8.2-SNAPSHOT.jar -libjars /tmp/Tutorial0.jar -Dmapred.reduce.tasks=20 -Dmapred.min.split.size=2000000000 Tutorial0 --hdfs

I don't have hadoop-lzo-0.4.15.jar but everything else is in order, but executing this on my Hadoop cluster (2.0.0-mr1-cdh4.1.2) I get, after minutes of waiting:

SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/usr/lib/zookeeper/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/usr/local/scalding/target/scalding-assembly-0.8.2-SNAPSHOT.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
Exception in thread "main" java.lang.UnsupportedOperationException: Not implemented by the DistributedFileSystem FileSystem implementation
    at org.apache.hadoop.fs.FileSystem.getScheme(FileSystem.java:200)
    at org.apache.hadoop.fs.FileSystem.loadFileSystems(FileSystem.java:2186)
    at org.apache.hadoop.fs.FileSystem.getFileSystemClass(FileSystem.java:2196)
    at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2213)
    at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:80)
    at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2252)
    at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2234)
    at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:300)
    at org.apache.hadoop.fs.FileSystem.getLocal(FileSystem.java:271)
    at org.apache.hadoop.util.GenericOptionsParser.validateFiles(GenericOptionsParser.java:383)
    at org.apache.hadoop.util.GenericOptionsParser.processGeneralOptions(GenericOptionsParser.java:281)
    at org.apache.hadoop.util.GenericOptionsParser.parseGeneralOptions(GenericOptionsParser.java:422)
    at org.apache.hadoop.util.GenericOptionsParser.<init>(GenericOptionsParser.java:168)
    at org.apache.hadoop.util.GenericOptionsParser.<init>(GenericOptionsParser.java:151)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:64)
    at com.twitter.scalding.Tool$.main(Tool.scala:128)
    at com.twitter.scalding.Tool.main(Tool.scala)
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
    at java.lang.reflect.Method.invoke(Method.java:597)
    at org.apache.hadoop.util.RunJar.main(RunJar.java:208)

What's the problem, and what is it doing for minutes before it throws this exception?

Oscar Boykin

unread,
Nov 12, 2012, 2:29:53 PM11/12/12
to cascadi...@googlegroups.com
I'm not sure if Cascading is supported on Hadoop 2.0, is it Chris?

Am I reading that version number correctly?



--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To view this discussion on the web visit https://groups.google.com/d/msg/cascading-user/-/OEPGRrQkMqUJ.
To post to this group, send email to cascadi...@googlegroups.com.
To unsubscribe from this group, send email to cascading-use...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/cascading-user?hl=en.



--
Oscar Boykin :: @posco :: https://twitter.com/intent/user?screen_name=posco

Chris K Wensel

unread,
Nov 12, 2012, 7:29:33 PM11/12/12
to cascadi...@googlegroups.com
> I'm not sure if Cascading is supported on Hadoop 2.0, is it Chris?
>
> Am I reading that version number correctly?
>

as there is no Hadoop 2.0, that would be negatory..

http://www.cascading.org/support/compatibility/

we only support stable Apache releases or those on the list above.

ckw

--
Chris K Wensel
ch...@concurrentinc.com
http://concurrentinc.com

Sam Ritchie

unread,
Nov 12, 2012, 7:58:36 PM11/12/12
to cascading-user
It's in alpha: http://hadoop.apache.org/releases.html#9+October%2C+2012%3A+Release+2.0.2-alpha+available


--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To post to this group, send email to cascadi...@googlegroups.com.
To unsubscribe from this group, send email to cascading-use...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/cascading-user?hl=en.




--
Sam Ritchie, Twitter Inc
@sritchie

(Too brief? Here's why! http://emailcharter.org)

Chris K Wensel

unread,
Nov 12, 2012, 8:10:17 PM11/12/12
to cascadi...@googlegroups.com
Reply all
Reply to author
Forward
0 new messages