I can't get rmr2 to work on hadoop. It seems to be doing so many things incorrectly.
1) On install, it searches for some hbase files..through the entire filesystem...even if it's mounted on an NFS share. Why is this necessary and is there a way to get it to not do that? I found that I can hit Ctrl-C and kill the search (because if I didn't, it'd be searching through 20TB of backups....)
2) After getting it to install and setting the Renv, it tries to use "hadoop <cmd>". This generates depreciation messages and my client will probably think that there is something wrong. Hadoop has two commands: yarn and hdfs. Is there a place to set this so that this API correctly uses those commands?
3) rmr.options( backend = "hadoop" ) returns NULL . I really don't know why that doesn't work, maybe it has to do with the above issues, maybe it's something wrong elsewhere.
4) I downloaded the github release and built it in R (R CMD build). That seems to be the version.
About my configuration:
Hadoop 2.2.0 (not the latest, but next to latest)
CentOS 6.4
Hadoop/HBase/Pig/Derby/Hive installed in /opt/hadoop
Java is at jdk1.7.0_25
Please help? Thanks!