Rhipe Broken Pipe IOException

26 views
Skip to first unread message

Ravi Puranik

unread,
Mar 27, 2015, 3:14:46 PM3/27/15
to rh...@googlegroups.com
Hi Folks,

I am trying to run a simple R M/R job (my first with Rhipe) and am getting the following exception (excerpt from the jobtracker below).

I do have the env variables setup as requested in the installation instructions:

Prerequisite 4: Environment Variables

In order for RHIPE to compile and work correctly, it is good to ensure that the following environment variables are set appropriately:
  • PKG_CONFIG_PATH: make sure this contains a path to where the protobuf*.pc files are located.
  • LD_LIBRARY_PATH: points to the path where R.so and Rhipe.so are located (typically something like __PREFIX__/lib64/R/lib).
  • HADOOP: points to the Hadoop installation directory. It is expected that$HADOOP/bin contains the Hadoop shell executable hadoop.

I am using a single node hadoop env with CDH4.7 and using Rhipe_0.74.0 (also tried with Rhipe_0.73.1 with the same results).

Has anyone encountered a similar issue?

Regards,
Ravi


syslog logs

2015-03-27 09:12:38,125 WARN mapreduce.Counters: Group org.apache.hadoop.mapred.Task$Counter is deprecated. Use org.apache.hadoop.mapreduce.TaskCounter instead
2015-03-27 09:12:40,173 INFO org.apache.hadoop.mapred.TaskRunner: Creating symlink: /mapred/local/taskTracker/distcache/2555781393405466156_-365566873_1543590495/localhost.localdomain/tmp/rhipe-temp-params-424222f73ab50857404d53c521f8e37d <- /mapred/local/taskTracker/cloudera/jobcache/job_201503262000_0005/attempt_201503262000_0005_m_000001_1/work/rhipe-temp-params-424222f73ab50857404d53c521f8e37d
2015-03-27 09:12:40,198 INFO org.apache.hadoop.filecache.TrackerDistributedCacheManager: Creating symlink: /mapred/local/taskTracker/cloudera/jobcache/job_201503262000_0005/jars/.job.jar.crc <- /mapred/local/taskTracker/cloudera/jobcache/job_201503262000_0005/attempt_201503262000_0005_m_000001_1/work/.job.jar.crc
2015-03-27 09:12:40,226 INFO org.apache.hadoop.filecache.TrackerDistributedCacheManager: Creating symlink: /mapred/local/taskTracker/cloudera/jobcache/job_201503262000_0005/jars/job.jar <- /mapred/local/taskTracker/cloudera/jobcache/job_201503262000_0005/attempt_201503262000_0005_m_000001_1/work/job.jar
2015-03-27 09:12:40,551 WARN org.apache.hadoop.conf.Configuration: session.id is deprecated. Instead, use dfs.metrics.session-id
2015-03-27 09:12:40,562 INFO org.apache.hadoop.metrics.jvm.JvmMetrics: Initializing JVM Metrics with processName=MAP, sessionId=
2015-03-27 09:12:42,219 INFO org.apache.hadoop.util.ProcessTree: setsid exited with exit code 0
2015-03-27 09:12:42,254 INFO org.apache.hadoop.mapred.Task:  Using ResourceCalculatorPlugin : org.apache.hadoop.util.LinuxResourceCalculatorPlugin@7df1bd98
2015-03-27 09:12:42,793 INFO org.apache.hadoop.mapred.MapTask: Processing split: hdfs://localhost.localdomain:8020/user/cloudera/access_log:134217728+38239156
2015-03-27 09:12:42,811 INFO org.apache.hadoop.mapred.MapTask: Map output collector class = org.apache.hadoop.mapred.MapTask$MapOutputBuffer
2015-03-27 09:12:42,819 INFO org.apache.hadoop.mapred.MapTask: io.sort.mb = 50
2015-03-27 09:12:43,027 INFO org.apache.hadoop.mapred.MapTask: data buffer = 39845888/49807360
2015-03-27 09:12:45,270 INFO org.apache.hadoop.mapred.MapTask: record buffer = 131072/163840
2015-03-27 09:12:45,622 INFO org.godhuli.rhipe.RHMRHelper: Mapper:Started external program:/home/cloudera/rhipe-runner.sh
2015-03-27 09:12:45,623 INFO org.godhuli.rhipe.RHMRHelper: Mapper:Started Error Thread
2015-03-27 09:12:45,639 INFO org.godhuli.rhipe.RHMRHelper: Mapper:Started Output Thread
2015-03-27 09:12:45,732 INFO org.godhuli.rhipe.RHMRHelper: Mapper:MROutputThread done
2015-03-27 09:12:45,732 INFO org.godhuli.rhipe.RHMRHelper: Mapper:MRErrorThread done
2015-03-27 09:12:45,737 INFO org.godhuli.rhipe.RHMRMapper: QUIIIITING:127
2015-03-27 09:12:45,749 INFO org.apache.hadoop.mapred.TaskLogsTruncater: Initializing logs' truncater with mapRetainSize=-1 and reduceRetainSize=-1
2015-03-27 09:12:45,758 ERROR org.apache.hadoop.security.UserGroupInformation: PriviledgedActionException as:cloudera (auth:SIMPLE) cause:java.io.IOException: java.io.IOException: Broken pipe
2015-03-27 09:12:45,758 WARN org.apache.hadoop.mapred.Child: Error running child
java.io.IOException: java.io.IOException: Broken pipe
at org.godhuli.rhipe.RHMRMapper.map(RHMRMapper.java:113)
at org.godhuli.rhipe.RHMRMapper.run(RHMRMapper.java:56)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:672)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:330)
at org.apache.hadoop.mapred.Child$4.run(Child.java:268)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1438)
at org.apache.hadoop.mapred.Child.main(Child.java:262)
Caused by: java.io.IOException: Broken pipe
at java.io.FileOutputStream.writeBytes(Native Method)
at java.io.FileOutputStream.write(FileOutputStream.java:282)
at java.io.BufferedOutputStream.write(BufferedOutputStream.java:105)
at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65)
at java.io.BufferedOutputStream.write(BufferedOutputStream.java:109)
at java.io.DataOutputStream.write(DataOutputStream.java:90)
at org.godhuli.rhipe.RHBytesWritable.write(RHBytesWritable.java:121)
at org.godhuli.rhipe.RHMRHelper.write(RHMRHelper.java:305)
at org.godhuli.rhipe.RHMRMapper.map(RHMRMapper.java:107)
... 8 more
Reply all
Reply to author
Forward
0 new messages