Scalding-user group

505 views
Skip to first unread message

Zafar Gilani

unread,
Jan 15, 2014, 8:46:58 AM1/15/14
to scaldi...@googlegroups.com
Hi,

Is there a scalding-user group? Such as one that answers questions related to twitter/scalding tutorial (https://github.com/twitter/scalding/wiki/Getting-Started)?

I'm getting the following error while trying to do: "scripts/scald.rb --local WorldCountJob.scala --input inputFile.txt --output outputFile.tsv".
The suggestion "scald.rb --clean" doesn't help.

can not find /home/zafar/.sbt/boot/scala-2.9.3/lib/scala-library.jar appending SBT_VERSION [0.12.0] to SBT_HOME
../scripts/scald.rb:146: warning: already initialized constant SBT_HOME
../scripts/scald.rb:147: warning: already initialized constant SCALA_LIB_DIR
downloading hadoop-core-1.1.2.jar from http://repo1.maven.org/maven2/org/apache/hadoop/hadoop-core/1.1.2/hadoop-core-1.1.2.jar...
SocketError in downloading hadoop-core-1.1.2.jar: getaddrinfo: Name or service not known
downloading commons-codec-1.8.jar from http://repo1.maven.org/maven2/commons-codec/commons-codec/1.8/commons-codec-1.8.jar...
SocketError in downloading commons-codec-1.8.jar: getaddrinfo: Name or service not known
downloading commons-configuration-1.9.jar from http://repo1.maven.org/maven2/commons-configuration/commons-configuration/1.9/commons-configuration-1.9.jar...
SocketError in downloading commons-configuration-1.9.jar: getaddrinfo: Name or service not known
downloading jackson-asl-0.9.5.jar from http://repo1.maven.org/maven2/org/codehaus/jackson/jackson-asl/0.9.5/jackson-asl-0.9.5.jar...
SocketError in downloading jackson-asl-0.9.5.jar: getaddrinfo: Name or service not known
downloading jackson-mapper-asl-1.9.13.jar from http://repo1.maven.org/maven2/org/codehaus/jackson/jackson-mapper-asl/1.9.13/jackson-mapper-asl-1.9.13.jar...
SocketError in downloading jackson-mapper-asl-1.9.13.jar: getaddrinfo: Name or service not known
downloading slf4j-log4j12-1.6.6.jar from http://repo1.maven.org/maven2/org/slf4j/slf4j-log4j12/1.6.6/slf4j-log4j12-1.6.6.jar...
SocketError in downloading slf4j-log4j12-1.6.6.jar: getaddrinfo: Name or service not known
downloading log4j-1.2.15.jar from http://repo1.maven.org/maven2/log4j/log4j/1.2.15/log4j-1.2.15.jar...
SocketError in downloading log4j-1.2.15.jar: getaddrinfo: Name or service not known
downloading commons-httpclient-3.1.jar from http://repo1.maven.org/maven2/commons-httpclient/commons-httpclient/3.1/commons-httpclient-3.1.jar...
SocketError in downloading commons-httpclient-3.1.jar: getaddrinfo: Name or service not known
downloading commons-cli-1.2.jar from http://repo1.maven.org/maven2/commons-cli/commons-cli/1.2/commons-cli-1.2.jar...
SocketError in downloading commons-cli-1.2.jar: getaddrinfo: Name or service not known
downloading commons-logging-1.1.1.jar from http://repo1.maven.org/maven2/commons-logging/commons-logging/1.1.1/commons-logging-1.1.1.jar...
SocketError in downloading commons-logging-1.1.1.jar: getaddrinfo: Name or service not known
downloading zookeeper-3.3.4.jar from http://repo1.maven.org/maven2/org/apache/zookeeper/zookeeper/3.3.4/zookeeper-3.3.4.jar...
SocketError in downloading zookeeper-3.3.4.jar: getaddrinfo: Name or service not known
compiling WorldCountJob.scala
scalac -classpath /tmp/maven/hadoop-core-1.1.2.jar:/tmp/maven/commons-codec-1.8.jar:/tmp/maven/commons-configuration-1.9.jar:/tmp/maven/jackson-asl-0.9.5.jar:/tmp/maven/jackson-mapper-asl-1.9.13.jar:/tmp/maven/slf4j-log4j12-1.6.6.jar:/tmp/maven/log4j-1.2.15.jar:/tmp/maven/commons-httpclient-3.1.jar:/tmp/maven/commons-cli-1.2.jar:/tmp/maven/commons-logging-1.1.1.jar:/tmp/maven/zookeeper-3.3.4.jar:/home/zafar/.sbt/0.12.0/boot/scala-2.9.3/lib/scala-library.jar:/home/zafar/Documents/scalding-work/scalding/scalding-core/target/scala-2.9.3/scalding-core-assembly-0.9.0rc4.jar:/usr/local/jdk1.7.0_45/jre/lib/rt.jar:/usr/local/aspectj1.7/lib/aspectjrt.jar:/usr/local/commons-logging-1.1.3/commons-logging-1.1.3.jar:/usr/local/jdk1.7.0_45/jre/lib/rt.jar:/usr/local/aspectj1.7/lib/aspectjrt.jar:/usr/local/commons-logging-1.1.3/commons-logging-1.1.3.jar:/usr/local/jdk1.7.0_45/jre/lib/rt.jar:/usr/local/aspectj1.7/lib/aspectjrt.jar:/usr/local/commons-logging-1.1.3/commons-logging-1.1.3.jar:/usr/local/jdk1.7.0_45/jre/lib/rt.jar:/usr/local/aspectj1.7/lib/aspectjrt.jar:/usr/local/commons-logging-1.1.3/commons-logging-1.1.3.jar: -d /tmp/script-build WorldCountJob.scala
Error: Could not find or load main class scala.tools.nsc.Main
[SUGGESTION]: Try scald.rb --clean, you may have corrupt jars lying around

Thanks,
Zafar

Koert Kuipers

unread,
Jan 15, 2014, 12:59:43 PM1/15/14
to Zafar Gilani, scaldi...@googlegroups.com
currently cascading user group is used for scalding related questions


--
You received this message because you are subscribed to the Google Groups "Scalding Development" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scalding-dev...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Oscar Boykin

unread,
Jan 15, 2014, 2:35:12 PM1/15/14
to Zafar Gilani, scaldi...@googlegroups.com
Looks like your networking is not set up right:

 getaddrinfo: Name or service not known

see it repeated below.

--
You received this message because you are subscribed to the Google Groups "Scalding Development" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scalding-dev...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.



--
Oscar Boykin :: @posco :: http://twitter.com/posco

Zafar Gilani

unread,
Jan 15, 2014, 2:49:21 PM1/15/14
to Oscar Boykin, scaldi...@googlegroups.com
That is the strange part. I can wget the resource from any URL and all the other things work fine too. An unrelated but similar error is documented here. Apparently, adding Google DNS solved it for somebody. However, I don't think I am facing that issue. Any other ideas?

Zafar Gilani

unread,
Jan 16, 2014, 9:38:16 AM1/16/14
to scaldi...@googlegroups.com, Oscar Boykin
OK so a system restart sorted out the getaddrinfo error (still not sure why because wget and everything else worked fine). However, I still get the "Could not find or load main class scala.tools.nsc.Main" error message. Suggestion "../scripts/scald.rb --clean" didn't resolve anything.

$ ../scripts/scald.rb --local WorldCountJob.scala --input inputFile.txt --output outputFile.tsv

can not find /home/zafar/.sbt/boot/scala-2.9.3/lib/scala-library.jar appending SBT_VERSION [0.12.0] to SBT_HOME
../scripts/scald.rb:146: warning: already initialized constant SBT_HOME
../scripts/scald.rb:147: warning: already initialized constant SCALA_LIB_DIR
downloading hadoop-core-1.1.2.jar from http://repo1.maven.org/maven2/org/apache/hadoop/hadoop-core/1.1.2/hadoop-core-1.1.2.jar...
Successfully downloaded hadoop-core-1.1.2.jar!
Successfully downloaded commons-codec-1.8.jar!
Successfully downloaded commons-configuration-1.9.jar!
Successfully downloaded jackson-asl-0.9.5.jar!
Successfully downloaded jackson-mapper-asl-1.9.13.jar!
Successfully downloaded slf4j-log4j12-1.6.6.jar!
Successfully downloaded log4j-1.2.15.jar!
Successfully downloaded commons-httpclient-3.1.jar!
Successfully downloaded commons-cli-1.2.jar!
Successfully downloaded commons-logging-1.1.1.jar!
Successfully downloaded zookeeper-3.3.4.jar!
compiling WorldCountJob.scala
scalac -classpath /tmp/maven/hadoop-core-1.1.2.jar:/tmp/maven/commons-codec-1.8.jar:/tmp/maven/commons-configuration-1.9.jar:/tmp/maven/jackson-asl-0.9.5.jar:/tmp/maven/jackson-mapper-asl-1.9.13.jar:/tmp/maven/slf4j-log4j12-1.6.6.jar:/tmp/maven/log4j-1.2.15.jar:/tmp/maven/commons-httpclient-3.1.jar:/tmp/maven/commons-cli-1.2.jar:/tmp/maven/commons-logging-1.1.1.jar:/tmp/maven/zookeeper-3.3.4.jar:/home/zafar/.sbt/0.12.0/boot/scala-2.9.3/lib/scala-library.jar:/home/zafar/Documents/scalding-work/scalding/scalding-core/target/scala-2.9.3/scalding-core-assembly-0.9.0rc4.jar:/usr/local/jdk1.7.0_45/jre/lib/rt.jar:/usr/local/aspectj1.7/lib/aspectjrt.jar:/usr/local/commons-logging-1.1.3/commons-logging-1.1.3.jar: -d /tmp/script-build WorldCountJob.scala

Error: Could not find or load main class scala.tools.nsc.Main
[SUGGESTION]: Try scald.rb --clean, you may have corrupt jars lying around


see it repeated below.

To unsubscribe from this group and stop receiving emails from it, send an email to scalding-dev+unsubscribe@googlegroups.com.

For more options, visit https://groups.google.com/groups/opt_out.

Zafar Gilani

unread,
Jan 17, 2014, 10:30:57 AM1/17/14
to scaldi...@googlegroups.com, Oscar Boykin
Here's a problem that I earlier overlooked:

can not find /home/zafar/.sbt/boot/scala-2.9.3/lib/scala-library.jar appending SBT_VERSION [0.12.0] to SBT_HOME

sbt (sbt update) creates scala-2.10.2 under ~/.sbt/boot and the scald.rb script complains about not finding scala-2.9.3 under ~/.sbt/boot. I cleaned up sbt environment, installed scala-2.9.3 (had scala-2.10.3 before), pointed all the scala environment variables to scala-2.9.3 and re-ran. It still produced the same error (sbt still points to scala-2.10.2). After no luck, I manually created ~/.sbt/boot/scala-2.9.3. It then threw the following error.

$ ../scripts/scald.rb --local WordCountJob.scala --input data/inputWordCount.txt --output data/outputWordCount.tsv
Exception in thread "main" java.lang.NoClassDefFoundError: scala/reflect/internal/Trees
    at java.lang.ClassLoader.defineClass1(Native Method)
    at java.lang.ClassLoader.defineClass(ClassLoader.java:800)
    at java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142)
    at java.net.URLClassLoader.defineClass(URLClassLoader.java:449)
    at java.net.URLClassLoader.access$100(URLClassLoader.java:71)
    at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
    at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
    at java.security.AccessController.doPrivileged(Native Method)
    at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
    at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
    at java.lang.ClassLoader.defineClass1(Native Method)
    at java.lang.ClassLoader.defineClass(ClassLoader.java:800)
    at java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142)
    at java.net.URLClassLoader.defineClass(URLClassLoader.java:449)
    at java.net.URLClassLoader.access$100(URLClassLoader.java:71)
    at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
    at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
    at java.security.AccessController.doPrivileged(Native Method)
    at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
    at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
    at java.lang.Class.getDeclaredMethods0(Native Method)
    at java.lang.Class.privateGetDeclaredMethods(Class.java:2531)
    at java.lang.Class.getMethod0(Class.java:2774)
    at java.lang.Class.getMethod(Class.java:1663)
    at sun.launcher.LauncherHelper.getMainMethod(LauncherHelper.java:494)
    at sun.launcher.LauncherHelper.checkAndLoadMain(LauncherHelper.java:486)
Caused by: java.lang.ClassNotFoundException: scala.reflect.internal.Trees
    at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
    at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
    at java.security.AccessController.doPrivileged(Native Method)
    at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
    at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
    ... 30 more

[SUGGESTION]: Try scald.rb --clean, you may have corrupt jars lying around


Oscar Boykin

unread,
Jan 17, 2014, 4:03:56 PM1/17/14
to Zafar Gilani, scaldi...@googlegroups.com
So, scald.rb is a bit brittle. You don't need to use that class, and I think largely it is becoming a distraction.

If you want to hack it into working, great. Otherwise, I think it depends on having a specific sbt + scala installation.

Can I suggest one of the sbt only based examples:

for instance?

Zafar Gilani

unread,
Jan 20, 2014, 7:27:17 AM1/20/14
to scaldi...@googlegroups.com, Zafar Gilani
So, the sbt only example works but not before requiring some work. Can you please put the following execution command under "Running on your own Hadoop cluster":

hadoop jar target/scalding-example-project-0.0.4.jar com.snowplowanalytics.hadoop.scalding.WordCountJob --local --input data/inputWC.txt --output outputWC.txt
see it repeated below.

To unsubscribe from this group and stop receiving emails from it, send an email to scalding-dev...@googlegroups.com.

For more options, visit https://groups.google.com/groups/opt_out.



--
Oscar Boykin :: @posco :: http://twitter.com/posco
Reply all
Reply to author
Forward
0 new messages