gcloud dataproc clusters create haz00 \
--initialization-actions gs://zookeeper.sh,gs://accumulo.sh \
--region=us-east4 \
--zone=us-east4-a \
--single-nodeERROR: (gcloud.dataproc.clusters.create) INVALID_ARGUMENT: Google Cloud Storage object does not exist
'gs://accumulo/accumulo.sh'.gcloud dataproc clusters create haz00 \
--initialization-actions gs://zookeeper.sh \
--region=us-east4 \
--zone=us-east4-a \
--single-nodeThanks again Karthik. I had no idea I needed to upload a script to a bucket. Still some on this day three of my Google Cloud experience, issues persist.
I create a bucket.
$ gsutil mb -p bold-rain-193317 -c regional -l us-east4-a gs://haz-bucketIt creates successfully.
I try to upload.
$ cd ~/dataproc-initialization-actions
$ gsutil cp gs://dataproc-initialization-actions/zookeeper/zookeeper.sh gs://haz-bucket/zookeeper.sh
$ gsutil cp gs://dataproc-initialization-actions/accumulo/accumulo.sh gs://haz-bucket/accumulo.sh
zookeeper.sh succeeds, but accumulo.sh does not. The paths are correct.
Question: Why does zookeeper.sh succeed when I run the command from within the dataproc-initialization-actions directory?
Also, I get the following error on accumulo.sh:
CommandException: No URLs matched: gs://dataproc-initialization-actions/accumulo/accumulo.shIf I use a relative path like this (NOTE: I removed the super directory):
$ gsutil cp gs://zookeeper/zookeeper.sh gs://haz-bucket/zookeeper.sh
$ gs://accumulo/accumulo.shI get for zookeeper.sh:
AccessDeniedException: 403 geoffry.roberts@hedronanalytics.com does not have storage.objects.list access to zookeeper.
But for accumulo.sh I get:
BucketNotFoundException: 404 gs://accumulo bucket does not exist.Why? Why the two different errors?
Finally,
Is there a way to execute these scripts from the command line on my compute instance? How do I access my bucket from there?
There is no apt-get for Accumulo. Hence I must download the tarball. Where does the create command download to?
No download appears in my bucket nor in my compute instance.
This is my create command:
$ gcloud dataproc clusters create haz00 \
--initialization-actions gs://haz-bucket/zookeeper.sh,gs://haz-bucket/accumulo.sh \
--bucket=haz-bucket \
--region=us-east4 \
--zone=us-east4-a \
--single-node
You been a lot of help already. Thanks