I don’t have a reproducible example, just a question. I am running a MapReduce job (using a combiner) and I first ran it with stock Hadoop configuration and then with backend parameters option setting the number of reducers. I am aware that using the backend parameters to set number of mappers/reducers is not recommended and this option may not be available in the future versions of rmr. The problem is that I could never get it to use more than 1 reducer. Both approaches used 1 reducer. I have three unique keys as output from my map job and so I tried to force it to use three, which didn’t work. What am I doing wrong? Is this more of a Hadoop question than an rmr one? I know that 3 is not a high number of reduce input groups but expected Hadoop to use three. Even with a higher number of reduce groups I couldn’t get it to use more than one.
I am using version 2.0.2 of rmr and a virtual cluster with five nodes, 8-core, 16 GB RAM. No load on the cluster other than my job.
Thanks in advance!
--
post: rha...@googlegroups.com ||
unsubscribe: rhadoop+u...@googlegroups.com ||
web: https://groups.google.com/d/forum/rhadoop?hl=en-US
---
You received this message because you are subscribed to the Google Groups "RHadoop" group.
To unsubscribe from this group and stop receiving emails from it, send an email to rhadoop+u...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
2013-05-03 15:10:50,769 INFO org.apache.hadoop.mapred.Child: JVM: jvm_201303121101_0397_m_205155044 pid: 31422
2013-05-03 15:10:51,070 INFO org.apache.hadoop.filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/mapr-hadoop/mapred/local/taskTracker/asingh2/jobcache/job_201303121101_0397/jars/rmr-local-env18e23ac907ef <- /tmp/mapr-hadoop/mapred/local/taskTracker/asingh2/jobcache/job_201303121101_0397/attempt_201303121101_0397_m_000000_1/work/rmr-local-env18e23ac907ef
2013-05-03 15:10:51,071 INFO org.apache.hadoop.filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/mapr-hadoop/mapred/local/taskTracker/asingh2/jobcache/job_201303121101_0397/jars/job.jar <- /tmp/mapr-hadoop/mapred/local/taskTracker/asingh2/jobcache/job_201303121101_0397/attempt_201303121101_0397_m_000000_1/work/job.jar
2013-05-03 15:10:51,072 INFO org.apache.hadoop.filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/mapr-hadoop/mapred/local/taskTracker/asingh2/jobcache/job_201303121101_0397/jars/rmr-streaming-map18e24735a88c <- /tmp/mapr-hadoop/mapred/local/taskTracker/asingh2/jobcache/job_201303121101_0397/attempt_201303121101_0397_m_000000_1/work/rmr-streaming-map18e24735a88c
2013-05-03 15:10:51,073 INFO org.apache.hadoop.filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/mapr-hadoop/mapred/local/taskTracker/asingh2/jobcache/job_201303121101_0397/jars/.job.jar.crc <- /tmp/mapr-hadoop/mapred/local/taskTracker/asingh2/jobcache/job_201303121101_0397/attempt_201303121101_0397_m_000000_1/work/.job.jar.crc
2013-05-03 15:10:51,074 INFO org.apache.hadoop.filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/mapr-hadoop/mapred/local/taskTracker/asingh2/jobcache/job_201303121101_0397/jars/rmr-global-env18e213a8df24 <- /tmp/mapr-hadoop/mapred/local/taskTracker/asingh2/jobcache/job_201303121101_0397/attempt_201303121101_0397_m_000000_1/work/rmr-global-env18e213a8df24
2013-05-03 15:10:51,104 INFO org.apache.hadoop.mapred.Child: Starting task attempt_201303121101_0397_m_000000_1
2013-05-03 15:10:51,105 INFO org.apache.hadoop.metrics.jvm.JvmMetrics: Initializing JVM Metrics with processName=MAP, sessionId=
2013-05-03 15:10:51,139 INFO org.apache.hadoop.mapreduce.util.ProcessTree: setsid exited with exit code 0
2013-05-03 15:10:51,146 WARN org.apache.hadoop.mapreduce.util.ProcfsBasedProcessTree: /proc/<pid>/status does not have information about swap space used(VmSwap). Can not track swap usage of a task.
2013-05-03 15:10:51,146 INFO org.apache.hadoop.mapred.Task: Using ResourceCalculatorPlugin : org.apache.hadoop.mapreduce.util.LinuxResourceCalculatorPlugin@4c53ab04
2013-05-03 15:10:51,493 WARN org.apache.hadoop.io.compress.snappy.LoadSnappy: Snappy native library not loaded
2013-05-03 15:10:51,599 ERROR org.apache.hadoop.streaming.PipeMapRed: configuration exception
java.io.FileNotFoundException: File /tmp/mapr-hadoop/mapred/local/taskTracker/asingh2/jobcache/job_201303121101_0397/attempt_201303121101_0397_m_000000_1/work/Rscript does not exist.
at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:395)
at org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:257)
at org.apache.hadoop.fs.FileUtil.chmod(FileUtil.java:738)
at org.apache.hadoop.fs.FileUtil.chmod(FileUtil.java:713)
at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:185)
at org.apache.hadoop.streaming.PipeMapper.configure(PipeMapper.java:66)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:88)
at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:64)
at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:117)
at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:34)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:88)
at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:64)
at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:117)
at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:413)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:348)
at org.apache.hadoop.mapred.Child$4.run(Child.java:270)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1127)
at org.apache.hadoop.mapred.Child.main(Child.java:264)
2013-05-03 15:10:51,612 INFO org.apache.hadoop.mapred.TaskLogsTruncater: Initializing logs' truncater with mapRetainSize=-1 and reduceRetainSize=-1
2013-05-03 15:10:51,626 WARN org.apache.hadoop.mapred.Child: Error running child
java.lang.RuntimeException: Error in configuring object
at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:93)
at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:64)
at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:117)
at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:413)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:348)
at org.apache.hadoop.mapred.Child$4.run(Child.java:270)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1127)
at org.apache.hadoop.mapred.Child.main(Child.java:264)
Caused by: java.lang.reflect.InvocationTargetException
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:88)
... 9 more
Caused by: java.lang.RuntimeException: Error in configuring object
at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:93)
at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:64)
at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:117)
at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:34)
... 14 more
Caused by: java.lang.reflect.InvocationTargetException
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:88)
... 17 more
Caused by: java.lang.RuntimeException: configuration exception
at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:228)
at org.apache.hadoop.streaming.PipeMapper.configure(PipeMapper.java:66)
... 22 more
Caused by: java.io.FileNotFoundException: File /tmp/mapr-hadoop/mapred/local/taskTracker/asingh2/jobcache/job_201303121101_0397/attempt_201303121101_0397_m_000000_1/work/Rscript does not exist.
at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:395)
at org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:257)
at org.apache.hadoop.fs.FileUtil.chmod(FileUtil.java:738)
at org.apache.hadoop.fs.FileUtil.chmod(FileUtil.java:713)
at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:185)
... 23 more
2013-05-03 15:10:51,631 INFO org.apache.hadoop.mapred.Task: Runnning cleanup for the task