Hi All,
I have just set up RHadoop and I am trying to run a very simple map reduce job.
Sys.setenv(HADOOP_HOME="/opt/mapr/hadoop/hadoop-0.20.2/")
Sys.setenv(HADOOP_CMD="/opt/mapr/hadoop/hadoop-0.20.2/bin/hadoop")
Sys.setenv(HADOOP_CONF="/opt/mapr/hadoop/conf")
Sys.setenv(HADOOP_STREAMING="/opt/mapr/hadoop/hadoop-0.20.2/contrib/streaming/hadoop-0.20.2-dev-streaming.jar")
Sys.getenv("HADOOP_CMD")
library(rmr2);library(rhdfs);library(plyrmr); library(jsonlite);library(functional);library(ravro);
hdfs.init()
# help(hadoop.settings)
hdfs.delete("/user/user1/testInds")
ints = to.dfs(1:100,"/user/user1A/testInds")
I am getting the following error.
packageJobJar: [/tmp/hadoop-user1/hadoop-unjar4761534670667064596/] [] /tmp/streamjob6026865890801465662.jar tmpDir=null
15/03/25 05:04:59 INFO fs.JobTrackerWatcher: Current running JobTracker is: hdp0029/
10.106.128.39:900115/03/25 05:04:59 INFO mapred.FileInputFormat: Total input paths to process : 1
15/03/25 05:04:59 INFO mapred.JobClient: Creating job's output directory at maprfs:/tmp/file8392754a6419
15/03/25 05:04:59 INFO mapred.JobClient: Creating job's user history location directory at maprfs:/tmp/file8392754a6419/_logs
15/03/25 05:04:59 INFO mapred.JobClient: user1, realuser: null
15/03/25 05:04:59 INFO streaming.StreamJob: getLocalDirs(): [/tmp/mapr-hadoop/mapred/local]
15/03/25 05:04:59 INFO streaming.StreamJob: Running job: job_201503181218_4375
15/03/25 05:04:59 INFO streaming.StreamJob: To kill this job, run:
15/03/25 05:04:59 INFO streaming.StreamJob: /opt/mapr/hadoop/hadoop-0.20.2/bin/../bin/hadoop job -Dmapred.job.tracker=maprfs:/// -kill job_201503181218_4375
15/03/25 05:05:00 INFO streaming.StreamJob: map 0% reduce 0%
15/03/25 05:05:22 INFO streaming.StreamJob: map 100% reduce 100%
15/03/25 05:05:22 INFO streaming.StreamJob: To kill this job, run:
15/03/25 05:05:22 INFO streaming.StreamJob: /opt/mapr/hadoop/hadoop-0.20.2/bin/../bin/hadoop job -Dmapred.job.tracker=maprfs:/// -kill job_201503181218_4375
15/03/25 05:05:22 ERROR streaming.StreamJob: Job not successful. Error: NA
15/03/25 05:05:22 INFO streaming.StreamJob: killJob...
Streaming Command Failed!
Error in mr(map = map, reduce = reduce, combine = combine, vectorized.reduce, :
hadoop streaming failed with error code 1
Deleted maprfs:/tmp/file8392352a63e7
The log file indicates the following issues:
stderr logs
log4j:WARN No appenders could be found for logger (org.apache.hadoop.util.NativeCodeLoader).
log4j:WARN Please initialize the log4j system properly.
java.io.IOException: /tmp/mapr-hadoop/mapred/local/taskTracker/user1/jobcache/job_201503181218_4375/attempt_201503181218_4375_m_000000_3/work/./Rscript is not a file or does not have read permissions
at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:202)
at org.apache.hadoop.streaming.PipeMapper.configure(PipeMapper.java:66)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:88)
at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:64)
at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:117)
at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:34)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:88)
at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:64)
at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:117)
at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:439)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:353)
at org.apache.hadoop.mapred.Child$4.run(Child.java:282)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1151)
at org.apache.hadoop.mapred.Child.main(Child.java:271)
2015-03-25 05:05:17,2238 ERROR Client fs/client/fileclient/cc/writebuf.cc:154 Thread: 32459 FlushWrite failed: File part-00000, error: Stale File handle(116), pfid 2049.2904.1785684, off 0, fid 2049.2904.1785684
Can anyone provide assistance?
Thanks,
Konstantinos