Hi Everyone,
I've been running some AWS Athena queries against the common crawl data. For all but very limited queries I get errors like this:
Error
opening Hive split
s3://commoncrawl/cc-index/table/cc-main/warc/crawl=CC-MAIN-2020-50/subset=robotstxt/part-00048-8431e8c9-c763-4c27-b286-ba1dd7e2631e.c000.gz.parquet
(offset=0, length=19593449):
com.amazonaws.services.s3.model.AmazonS3Exception: Please reduce your
request rate. (Service: Amazon S3; Status Code: 503; Error Code:
SlowDown; Request ID: ZP18Q8M7ENFMWDNF; S3 Extended Request ID:
0mmk1CH52QxF0BpAN6Bjebg90MZSF5u6XfzX5nKFnBTiWkZdVAtpcZFanNDlf2RetRJKYfnKd6Q=;
Proxy: null), S3 Extended Request ID:
0mmk1CH52QxF0BpAN6Bjebg90MZSF5u6XfzX5nKFnBTiWkZdVAtpcZFanNDlf2RetRJKYfnKd6Q=
This query ran against the "ccindex" database, unless qualified by the query. Please post the error message on our forum or contact customer support with Query Id: ec9b86c7-d0ca-4faa-b3f6-f0bdd7fa76df
Some of the sample queries like this one give the same error:
So I don't think it is specific to something I'm doing. The two other posts from today are also having 503 errors. To me that says that something changed on AWS. Can anyone help?
Thanks,
Craig Schmidt