S3 Access Denied, April WAT files.

48 views
Skip to the first unread message

Gregory Ray

unread,
29 May 2015, 13:06:4829/05/2015
to common...@googlegroups.com
Hi,

I'm getting an Access Denied error when I try to access the April .wat files specified in wat.paths. I checked earlier wat files and didn't have this problem, also the warc files from April were accessible as well.

https://aws-publicdatasets.s3.amazonaws.com/common-crawl/crawl-data/CC-MAIN-2015-18/segments/1429246633512.41/wat/CC-MAIN-20150417045713-00000-ip-10-235-10-82.ec2.internal.warc.wat.gz
<Error>
<Code>AccessDenied</Code>
<Message>Access Denied</Message>
<RequestId>F38910F75B2FF0C2</RequestId>
<HostId>
fZQH0VGLJ482jx48jiowmIQfiP8zPDIkdXB2bXzBPThhmGxTMwSdVH883qQ8WB0UwY/jn+M91/w=
</HostId>
</Error>

Am I doing something wrong?

Thanks,
Greg

Stephen Merity

unread,
29 May 2015, 15:29:4629/05/2015
to common...@googlegroups.com
Hi Greg,

This was an oversight on my part - I forgot to set a flag during file creation that sets the permissions to publicly accessible.

The permissions have been fixed now, thanks for informing me!

--
You received this message because you are subscribed to the Google Groups "Common Crawl" group.
To unsubscribe from this group and stop receiving emails from it, send an email to common-crawl...@googlegroups.com.
To post to this group, send email to common...@googlegroups.com.
Visit this group at http://groups.google.com/group/common-crawl.
For more options, visit https://groups.google.com/d/optout.



--
Regards,
Stephen Merity
Data Scientist @ Common Crawl
Reply all
Reply to author
Forward
0 new messages