You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to spark...@googlegroups.com
Hi!
Is there a way to do cross validation on a training set loaded in an RDD? I know I can take a sample, but what I want is n-fold cross validation.
Thanks for any tips!
Rand
Matei Zaharia
unread,
Nov 27, 2012, 5:16:31 PM11/27/12
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to spark...@googlegroups.com
Hi Rand,
Unfortunately there isn't anything specifically for this right now. You'd have to either take samples by hand, or partition the data multiple ways by hand before going through there. It would be interesting to add an operation similar to sample() that produces multiple samples for cross validation but there isn't one yet.
Matei
Rand Hindi
unread,
Nov 27, 2012, 10:31:20 PM11/27/12
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to spark...@googlegroups.com
Great, thanks!
-- ---------------------------------------------------- Rand Hindi