Antonio
Thank you very much for the quick response.Sorry, I did not provide enough details earlier. Attached are the R script and the input file.
This is what we installed.
-bash-4.1$ R --version
Oracle Distribution of R version 3.1.1 (--) -- "Sock it to Me"
Copyright (C) The R Foundation for Statistical Computing
Platform: x86_64-unknown-linux-gnu (64-bit)
-bash-4.1$ rpm -qa | sort | grep 'R'
libRmath-3.1.1-2.el6.x86_64
libRmath-devel-3.1.1-2.el6.x86_64
perl-URI-1.40-2.el6.noarch
R-3.1.1-2.el6.x86_64
R-core-3.1.1-2.el6.x86_64
R-devel-3.1.1-2.el6.x86_64
git clone git://
github.com/RevolutionAnalytics/rmr2.gitPackage: rmr2
Type: Package
Title: R and Hadoop Streaming Connector
Version: 3.3.1
Date: 2014-12-2
Author: Revolution Analytics
Depends: R (>= 2.6.0), methods
Imports: Rcpp, RJSONIO (>= 0.8-2), digest, functional, reshape2, stringr, plyr, caTools (>= 1.16)
Suggests: quickcheck (>= 3.0.0), ravro, rhdfs, testthat
Collate: basic.R extras.R hdfs.R keyval.R IO.R local.R mapreduce.R parse-url.R quickcheck-rmr.R streaming.R
Maintainer: Revolution Analytics <
rha...@revolutionanalytics.com>
Description: Supports the map reduce programming model on top of hadoop streaming
License: Apache License (== 2.0)
ByteCompile: TRUE
BuildVignettes: FALSE
-bash-4.1$ hadoop version
Hadoop 2.5.0-cdh5.3.0
I tried without reducer(s).
with
stocks.txt (fields separated by comma) as input
and
mapreduce(input = input,
input.format = "text",
map = s.map,
combine = NULL)it works (invokes 2 mappers)
with
stocks.csv (fields separated by comma) as input...
and
mapreduce(input = input,
input.format = "csv",
map = s.map,
combine = NULL)I get this error
15/02/25 11:03:49 INFO mapreduce.Job: Task Id : attempt_1424375261738_0109_m_000001_0, Status : FAILED
Error: java.lang.RuntimeException: PipeMapRed.waitOutputThreads(): subprocess failed with code 1
at org.apache.hadoop.streaming.PipeMapRed.waitOutputThreads(PipeMapRed.java:320)
at org.apache.hadoop.streaming.PipeMapRed.mapRedFinished(PipeMapRed.java:533)
at org.apache.hadoop.streaming.PipeMapper.close(PipeMapper.java:130)
at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:61)
at org.apache.hadoop.streaming.PipeMapRunner.run(PipeMapRunner.java:34)
at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:450)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:343)
at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1642)
at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:163)
---
with
stocks.txt (fields separated by comma) as input and