Autoscaling Dataproc Jobs

201 views
Skip to first unread message

Vadim Solovey

unread,
Mar 9, 2018, 12:18:58 PM3/9/18
to Google Cloud Dataproc Discussions
Cloud Dataproc is an amazingly fast to provision, easy-to-use, fully-managed cloud service for running Apache Spark and Apache Hadoop clusters in a simple and very cost-efficient way. While you can resize Google Cloud Dataproc clusters at any time but it doesn't have built-in autoscaling for clusters. We have multiple teams running the jobs on the same cluster and we had to have some autoscaling. Recently, we open sourced Shamash - autoscaling Dataproc clusters - https://github.com/doitintl/shamash. Enjoy!

Pranjul Ahuja

unread,
Jan 2, 2019, 9:42:17 AM1/2/19
to Google Cloud Dataproc Discussions
Hi Vadim,
Can this be used to autoscale  presto which is deployed on data-proc cluster ?
Reply all
Reply to author
Forward
0 new messages