I am attempting to download specific files following the instructions provided in the links below:
While I was able to download the folder, I noticed that it contains no data and appears to be empty.
Additionally, I attempted to download the files using the following commands:
aws s3 cp s3://commoncrawl/crawl-data/CC-MAIN-2018-17/segments/1524125937193.1/warc/CC-MAIN-20180420081400-20180420101400-00000.warc.gz <local_path> --no-sign-request
Unfortunately, both commands were unsuccessful. Could you please confirm if there are any restrictions on accessing these files? I am trying to access them from the UK.
I would appreciate any guidance you can provide.
Best regards,
Please note, access to data from the Amazon cloud using the S3 API is only allowed for authenticated users. Please see our blog announcement for more information.
--
You received this message because you are subscribed to the Google Groups "Common Crawl" group.
To unsubscribe from this group and stop receiving emails from it, send an email to common-crawl...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/common-crawl/CAHsytomMB50q7y6O%2BmDo%3DYeBNdD2ws%3DfBFPvzbx1dimTPy7tNA%40mail.gmail.com.
To view this discussion visit https://groups.google.com/d/msgid/common-crawl/CAE9vqEG%3Dw690Y4EkXZPaQXL%3DMZkrtBhsGS%2BXW%2BbSt0dQZDZ%3DCQ%40mail.gmail.com.
To view this discussion visit https://groups.google.com/d/msgid/common-crawl/d5197af6-77dd-49e6-a803-36b0bf21c93dn%40googlegroups.com.