hi all,
I just discovered this public dataset today, but get access denied when trying to list the content underneath
s3cmd ls s3://aws-publicdatasets/common-crawl/parse-output/
ERROR: Access to bucket 'aws-publicdatasets' was denied
s3cmd ls s3://aws-publicdatasets/common-crawl/crawl-001/
ERROR: Access to bucket 'aws-publicdatasets' was denied
s3cmd ls s3://aws-publicdatasets/common-crawl/crawl-002/
ERROR: Access to bucket 'aws-publicdatasets' was denied
Error message displays 403 when I tried it through hadoop fs:
hadoop fs -ls s3://aws-publicdatasets/common-crawl/parse-output/segment/
ls: org.jets3t.service.S3ServiceException: S3 HEAD request failed for '/common-crawl%2Fparse-output%2Fsegment' - ResponseCode=403, ResponseMessage=Forbidden
I could list other public s3 repo correctly: s3://datasets.elasticmapreduce/ngrams/books/
did I miss setup to access the content?
Thank you.
Yuhan