Accessing Large hadoop ready S3 files from Rstudio using RHadoop on AWS-EMR

36 views
Skip to first unread message

Miguel Laino

unread,
Apr 20, 2015, 12:21:25 PM4/20/15
to rha...@googlegroups.com
Hi guys

I am trying to run a large analytical process on AWS-EMR.

The data is EMR ready on chunks of 6GB for the larger files on S3. I have trouble using RStudio on EMR to access these files. 

Any idea on how can I use Rhadoop on RStudio running on EMR to read these files? Any packages/links/ideas are welcome.
Reply all
Reply to author
Forward
0 new messages