How to edit mrjob.conf to submit to an existing emr cluster?
163 views
Skip to first unread message
newb1er
unread,
Dec 7, 2015, 12:36:30 PM12/7/15
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to mrjob
Kind of an EMR noob. I know the value of EMR is you can start them up only as needed, but if I've set something up through the GUI and want to send my job to that cluster, how would I do that?
that instead of starting a new cluster in my emr control panel at:
my_analysis.username.20151207.022413.797727
...With ID: j-YYYYYYYYYYYYYY
Would send this job to the existing one?
David Marin
unread,
Dec 7, 2015, 2:32:34 PM12/7/15
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to mr...@googlegroups.com
You wouldn't put the cluster in mrjob.conf, but you can specify a cluster by running your job with --emr-job-flow-id <cluster ID>.
But... if you started a cluster with the AWS CLI, it won't have the mrjob library installed. Instead, I recommend starting a cluster with `mrjob create-job-flow`, which does the relevant bootstrapping so that you can run mrjobs on the cluster.
You might also want to look into job flow pooling, which will either start a new cluster or reuse an existing one as appropriate:
> --
> You received this message because you are subscribed to the Google Groups "mrjob" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to mrjob+un...@googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.