a)
/ex1.1
13/03/20 14:32:29 INFO jvm.JvmMetrics: Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already initialized
13/03/20 14:32:29 WARN mapred.JobClient: Use GenericOptionsParser for parsing the arguments. Applications should implement Tool for the same.
13/03/20 14:32:29 INFO input.FileInputFormat: Total input paths to process : 1
13/03/20 14:32:29 WARN conf.Configuration: dfs.df.interval is deprecated. Instead, use fs.df.interval
13/03/20 14:32:29 WARN conf.Configuration: dfs.max.objects is deprecated. Instead, use dfs.namenode.max.objects
13/03/20 14:32:29 WARN conf.Configuration: hadoop.native.lib is deprecated. Instead, use io.native.lib.available
13/03/20 14:32:29 WARN conf.Configuration: dfs.data.dir is deprecated. Instead, use dfs.datanode.data.dir
13/03/20 14:32:29 WARN conf.Configuration: dfs.name.dir is deprecated. Instead, use dfs.namenode.name.dir
13/03/20 14:32:29 WARN conf.Configuration:
fs.default.name is deprecated. Instead, use fs.defaultFS
13/03/20 14:32:29 WARN conf.Configuration: fs.checkpoint.dir is deprecated. Instead, use dfs.namenode.checkpoint.dir
13/03/20 14:32:29 WARN conf.Configuration: dfs.block.size is deprecated. Instead, use dfs.blocksize
13/03/20 14:32:29 WARN conf.Configuration: dfs.access.time.precision is deprecated. Instead, use dfs.namenode.accesstime.precision
13/03/20 14:32:29 WARN conf.Configuration: dfs.replication.min is deprecated. Instead, use dfs.namenode.replication.min
13/03/20 14:32:29 WARN conf.Configuration: dfs.name.edits.dir is deprecated. Instead, use dfs.namenode.edits.dir
13/03/20 14:32:29 WARN conf.Configuration: dfs.replication.considerLoad is deprecated. Instead, use dfs.namenode.replication.considerLoad
13/03/20 14:32:29 WARN conf.Configuration: dfs.balance.bandwidthPerSec is deprecated. Instead, use dfs.datanode.balance.bandwidthPerSec
13/03/20 14:32:29 WARN conf.Configuration: dfs.safemode.threshold.pct is deprecated. Instead, use dfs.namenode.safemode.threshold-pct
13/03/20 14:32:29 WARN conf.Configuration: dfs.http.address is deprecated. Instead, use dfs.namenode.http-address
13/03/20 14:32:29 WARN conf.Configuration: dfs.name.dir.restore is deprecated. Instead, use dfs.namenode.name.dir.restore
13/03/20 14:32:29 WARN conf.Configuration: dfs.https.client.keystore.resource is deprecated. Instead, use dfs.client.https.keystore.resource
13/03/20 14:32:29 WARN conf.Configuration: dfs.backup.address is deprecated. Instead, use dfs.namenode.backup.address
13/03/20 14:32:29 WARN conf.Configuration: dfs.backup.http.address is deprecated. Instead, use dfs.namenode.backup.http-address
13/03/20 14:32:29 WARN conf.Configuration: dfs.permissions is deprecated. Instead, use dfs.permissions.enabled
13/03/20 14:32:29 WARN conf.Configuration: dfs.safemode.extension is deprecated. Instead, use dfs.namenode.safemode.extension
13/03/20 14:32:29 WARN conf.Configuration: dfs.datanode.max.xcievers is deprecated. Instead, use dfs.datanode.max.transfer.threads
13/03/20 14:32:29 WARN conf.Configuration: dfs.https.need.client.auth is deprecated. Instead, use dfs.client.https.need-auth
13/03/20 14:32:29 WARN conf.Configuration: dfs.https.address is deprecated. Instead, use dfs.namenode.https-address
13/03/20 14:32:29 WARN conf.Configuration: dfs.replication.interval is deprecated. Instead, use dfs.namenode.replication.interval
13/03/20 14:32:29 WARN conf.Configuration: fs.checkpoint.edits.dir is deprecated. Instead, use dfs.namenode.checkpoint.edits.dir
13/03/20 14:32:29 WARN conf.Configuration: dfs.write.packet.size is deprecated. Instead, use dfs.client-write-packet-size
13/03/20 14:32:29 WARN conf.Configuration: dfs.permissions.supergroup is deprecated. Instead, use dfs.permissions.superusergroup
13/03/20 14:32:29 WARN conf.Configuration: topology.script.number.args is deprecated. Instead, use net.topology.script.number.args
13/03/20 14:32:29 WARN conf.Configuration: dfs.secondary.http.address is deprecated. Instead, use dfs.namenode.secondary.http-address
13/03/20 14:32:29 WARN conf.Configuration: fs.checkpoint.period is deprecated. Instead, use dfs.namenode.checkpoint.period
13/03/20 14:32:29 WARN conf.Configuration: topology.node.switch.mapping.impl is deprecated. Instead, use net.topology.node.switch.mapping.impl
13/03/20 14:32:29 WARN conf.Configuration: io.bytes.per.checksum is deprecated. Instead, use dfs.bytes-per-checksum
13/03/20 14:32:30 INFO filecache.TrackerDistributedCacheManager: Creating rhipe-temp-params-96dffe81e6cd9d4d3a8532b1307641ae in /tmp/hadoop-root/mapred/local/archive/-193089761312830351_133345639_118032155/file/tmp-work-8333888260353126556 with rwxr-xr-x
13/03/20 14:32:30 INFO filecache.TrackerDistributedCacheManager: Cached /tmp/rhipe-temp-params-96dffe81e6cd9d4d3a8532b1307641ae#rhipe-temp-params-96dffe81e6cd9d4d3a8532b1307641ae as /tmp/hadoop-root/mapred/local/archive/-193089761312830351_133345639_118032155/file/tmp/rhipe-temp-params-96dffe81e6cd9d4d3a8532b1307641ae
13/03/20 14:32:30 INFO filecache.TrackerDistributedCacheManager: Cached /tmp/rhipe-temp-params-96dffe81e6cd9d4d3a8532b1307641ae#rhipe-temp-params-96dffe81e6cd9d4d3a8532b1307641ae as /tmp/hadoop-root/mapred/local/archive/-193089761312830351_133345639_118032155/file/tmp/rhipe-temp-params-96dffe81e6cd9d4d3a8532b1307641ae
13/03/20 14:32:30 WARN mapred.LocalJobRunner: LocalJobRunner does not support symlinking into current working dir.
13/03/20 14:32:30 INFO mapred.TaskRunner: Creating symlink: /tmp/hadoop-root/mapred/local/archive/-193089761312830351_133345639_118032155/file/tmp/rhipe-temp-params-96dffe81e6cd9d4d3a8532b1307641ae <- /tmp/hadoop-root/mapred/local/localRunner/rhipe-temp-params-96dffe81e6cd9d4d3a8532b1307641ae
13/03/20 14:32:30 INFO filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/hadoop-root/mapred/staging/root1083697648/.staging/job_local_0001/.job.splitmetainfo.crc <- /tmp/hadoop-root/mapred/local/localRunner/.job.splitmetainfo.crc
13/03/20 14:32:30 INFO filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/hadoop-root/mapred/staging/root1083697648/.staging/job_local_0001/.job.xml.crc <- /tmp/hadoop-root/mapred/local/localRunner/.job.xml.crc
13/03/20 14:32:30 INFO filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/hadoop-root/mapred/staging/root1083697648/.staging/job_local_0001/job.xml <- /tmp/hadoop-root/mapred/local/localRunner/job.xml
13/03/20 14:32:30 INFO filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/hadoop-root/mapred/staging/root1083697648/.staging/job_local_0001/job.split <- /tmp/hadoop-root/mapred/local/localRunner/job.split
13/03/20 14:32:30 INFO filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/hadoop-root/mapred/staging/root1083697648/.staging/job_local_0001/.job.split.crc <- /tmp/hadoop-root/mapred/local/localRunner/.job.split.crc
13/03/20 14:32:30 INFO filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/hadoop-root/mapred/staging/root1083697648/.staging/job_local_0001/.job.jar.crc <- /tmp/hadoop-root/mapred/local/localRunner/.job.jar.crc
13/03/20 14:32:30 INFO filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/hadoop-root/mapred/staging/root1083697648/.staging/job_local_0001/job.jar <- /tmp/hadoop-root/mapred/local/localRunner/job.jar
13/03/20 14:32:30 INFO filecache.TrackerDistributedCacheManager: Creating symlink: /tmp/hadoop-root/mapred/staging/root1083697648/.staging/job_local_0001/job.splitmetainfo <- /tmp/hadoop-root/mapred/local/localRunner/job.splitmetainfo
13/03/20 14:32:30 WARN conf.Configuration:
fs.default.name is deprecated. Instead, use fs.defaultFS
13/03/20 14:32:30 WARN conf.Configuration: io.bytes.per.checksum is deprecated. Instead, use dfs.bytes-per-checksum
Error in .jcall("RJavaTools", "Ljava/lang/Object;", "invokeMethod", cl, :
java.lang.ArrayIndexOutOfBoundsException: 1
> 13/03/20 14:32:30 INFO mapred.LocalJobRunner: OutputCommitter set in config null
13/03/20 14:32:30 INFO mapred.LocalJobRunner: OutputCommitter is org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter
13/03/20 14:32:30 WARN mapreduce.Counters: Group org.apache.hadoop.mapred.Task$Counter is deprecated. Use org.apache.hadoop.mapreduce.TaskCounter instead
13/03/20 14:32:31 INFO util.ProcessTree: setsid exited with exit code 0
13/03/20 14:32:31 INFO mapred.Task: Using ResourceCalculatorPlugin : org.apache.hadoop.util.LinuxResourceCalculatorPlugin@4987b287
mapred.input.file == file:/bySpecies/part_1
13/03/20 14:32:31 INFO rhipe.RHMRHelper: Mapper:Started external program:R CMD /home/user1/software/amal/R-2.15.1/library/Rhipe/bin/RhipeMapReduce --slave --silent --vanilla
13/03/20 14:32:31 INFO rhipe.RHMRHelper: Mapper:Started Error Thread
13/03/20 14:32:31 INFO rhipe.RHMRHelper: Mapper:Started Output Thread
13/03/20 14:32:32 WARN rhipe.RHMRHelper: Mapper:java.lang.RuntimeException:
R ERROR BEGIN (map.setup):
=============
Error in readChar(con, 5L, useBytes = TRUE) : cannot open the connection
R ERROR END
===========
at org.godhuli.rhipe.RHMRHelper$MRErrorThread.run(RHMRHelper.java:391)
13/03/20 14:32:32 INFO rhipe.RHMRHelper: Mapper:MROutputThread done
13/03/20 14:32:32 WARN mapred.LocalJobRunner: job_local_0001
java.io.IOException: MROutput/MRErrThread failed:java.lang.RuntimeException:
R ERROR BEGIN (map.setup):
=============
Error in readChar(con, 5L, useBytes = TRUE) : cannot open the connection
R ERROR END
===========
at org.godhuli.rhipe.RHMRHelper$MRErrorThread.run(RHMRHelper.java:391)
at org.godhuli.rhipe.RHMRHelper.checkOuterrThreadsThrowable(RHMRHelper.java:232)
at org.godhuli.rhipe.RHMRMapper.run(RHMRMapper.java:68)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:645)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:325)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:263)
13/03/20 14:32:32 INFO filecache.TrackerDistributedCacheManager: Deleted path /tmp/hadoop-root/mapred/local/archive/-193089761312830351_133345639_118032155/file/tmp/rhipe-temp-params-96dffe81e6cd9d4d3a8532b1307641ae
13/03/20 14:32:37 INFO mapred.LocalJobRunner:
b) I will attach the job.xml soon
c)
> rhoptions()$HADOOP.TMP
[1] "/tmp"