Hi Team,
I’d like to confirm something regarding the use of Common Crawl data. Since this dataset is part of AWS Open Data and accessible via authenticated S3, will I incur any charges if I download and process several months of data on a bare-metal server hosted in a different region or with another cloud provider?
My understanding is that because the dataset is hosted under the Open Data program (and not in our own S3 buckets), there should be no additional charges. Could you please confirm if that’s correct? I am thinking to use S3 method as it can pull data faster to my machines.
Thanks,
Vansh Devgan
--
You received this message because you are subscribed to the Google Groups "Common Crawl" group.
To unsubscribe from this group and stop receiving emails from it, send an email to common-crawl...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/common-crawl/CAE9vqEGnMB2CovRWW7hv86%3DF85DNeh6NE%2BSrex%3DLEx777jRELA%40mail.gmail.com.
--
You received this message because you are subscribed to the Google Groups "Common Crawl" group.
To unsubscribe from this group and stop receiving emails from it, send an email to common-crawl...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/common-crawl/ff270045-4e71-4695-b321-b12d21f2507an%40googlegroups.com.