Hello everyone,
Late last week on January 27th, we released a new set of updates to Google Cloud Dataproc.
gcloud
command for adding tags and metadata to virtual machines used in Cloud Dataproc clusters. These tags and metadata will apply to both regular and preemptible instances.--tags
option will add tags to the Google Compute Engine instances in a cluster. For example, using the argument --tags foo,bar,baz
will add three tags to the virtual machine instances in the cluster.--metadata
option will add metadata to the compute engine instances. For example, using --metadata 'meta1=value1,key1=value2'
will add two key-value pairs of metadata.Line wrapping
option to make it easier to view job output containing very long linesmaxExecutors
setting is now set to 10000
to avoid the AppMaster failing on jobs with many tasksBest,
Google Cloud Dataproc / Google Cloud Spark & Hadoop Team