Benjamin Bluhm
unread,Mar 9, 2018, 8:20:31 AM3/9/18Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to spar...@googlegroups.com
Hi,
I have been working on a project with spark-ts, but more recently I have been working on a time series project where I implemented my own spark distribution logic which is the following: I first create a partitioned RDD with time series IDs. I then map these partitions to the worker nodes where I import the time series data and perform model training in python. The advantage of this approach is that I can use the full range of Python machine learning libraries.
What is the disadvantage of this approach relative to using spark-ts?
Many thanks & Kind regards,
Benjamin
Von meinem iPhone gesendet