Prepare data using Pig to processing with R

37 views
Skip to first unread message

Vinicius Brito Rocha

unread,
Sep 28, 2014, 11:47:48 PM9/28/14
to rha...@googlegroups.com

Hi, 

There is a way where the prepare data fase be made using pig and just use R for start the mapreduce function? I try some tutorials in RevolutionR site, but in all of then the data is loaded in R and then transfered to Hadoop with to.dfs() function.

Thanks

Vinicius

Antonio Piccolboni

unread,
Sep 29, 2014, 12:19:07 PM9/29/14
to rha...@googlegroups.com
I don't have a lot of experience with pig, but there is a pig.hive format that should work with either. In Hive, you'd need to work on an external table so that you know a path to it. Create and populate external table in Hive, then process in rmr2. I am not sure what the equivalent in pig would be though.

A
Reply all
Reply to author
Forward
0 new messages