![]() | Grega Kešpret celtra.com | @celtramobileAnalytics engineer Celtra — Rich Media Mobile Advertising |
Typically I would collocate driver programs on the master node (actually I submit jobs to a job server similar to this one: https://github.com/apache/incubator-spark/pull/222).
Jobs can be tested locally, and for production either submitted via the job server interface or ssh into the master node and run the jobs from there. I store jars in s3, download them and add to spark class path as required.
This is similar to how I used to do things with Hadoop.