Hi Jason,
> I presume it's the same issue as discussed above, or should it be
> exempt from the throttling?
The 503s affect all users independent from the location or the
used serviced. During the last days the situation has improved,
I was able to query the columnar index via Athena. However,
we're still working on a final solution.
Thanks for your patience.
Best,
Sebastian
On 2/15/22 19:59, Jason Duke wrote:
> Hi all.
>
> I'm seeing the 503 problem too, but while using Commoncrawl data via
> AWS's Athena service.
>
> I presume it's the same issue as discussed above, or should it be exempt
> from the throttling?
> --
> Jason Duke
>
> * Book a Meeting with me
https://booking.strangelogic.ltd/
> <
https://booking.strangelogic.ltd/> *
>
>
https://StrangeLogic.com/ <
https://strangelogic.com/> - Wisdom &
> Experience is Strangely Logical
>
>
> Email:
ja...@strangelogic.com <mailto:
ja...@strangelogic.com>
> Email:
ja...@the.domain.name <mailto:
ja...@the.domain.name>
> <mailto:
maxk...@gmail.com> wrote:
>
> same here
>
> On Friday, February 11, 2022 at 6:39:16 PM UTC-5
>
kasper...@gmail.com wrote:
>
> I still seem to be getting 503 errors. I tried a download on
> the news dataset. Anyone else have the same problem?
>
> On Friday, February 11, 2022 at 2:30:03 PM UTC+1
>
alan....@gmail.com wrote:
>
> The CC is so incredibly large (well into the petabyte
> range) that AFAIK it's only available via S3. There are
> very few other systems that could hold it, let alone
> deliver it for free.
>
> On Wednesday, February 9, 2022 at 2:28:53 PM UTC+1
>
maxk...@gmail.com wrote:
>
> Ditto, Feb 9th, 503 for every request I've tried.
> Are there any viable mirrors?
>
> On Wednesday, February 9, 2022 at 3:45:39 AM UTC-5
>
sgran...@gmail.com wrote:
>
> Same issues here!
>
> On Tuesday, February 8, 2022 at 5:49:41 AM UTC-8
> Ozgur Turel wrote:
>
> __
>> <
http://index.commoncrawl.org>) and
>> <
https://groups.google.com/d/msgid/common-crawl/f31fa62a-fb50-671f-c542-fa1f2e698f1b%40commoncrawl.org>.
>>
>> >
>>
>>
>> --
>> You received this message because you are
>> subscribed to the Google Groups "Common
>> Crawl" group.
>> To unsubscribe from this group and stop
>> receiving emails from it, send an email to
>>
common-crawl...@googlegroups.com.
>> To view this discussion on the web visit
>>
https://groups.google.com/d/msgid/common-crawl/30677914-db6a-46ce-a28c-f64f13c3df57n%40googlegroups.com
>> <
https://groups.google.com/d/msgid/common-crawl/30677914-db6a-46ce-a28c-f64f13c3df57n%40googlegroups.com?utm_medium=email&utm_source=footer>.
>
> --
> You received this message because you are subscribed to the Google
> Groups "Common Crawl" group.
> To unsubscribe from this group and stop receiving emails from it,
> send an email to
common-crawl...@googlegroups.com
> <mailto:
common-crawl...@googlegroups.com>.
> To view this discussion on the web visit
>
https://groups.google.com/d/msgid/common-crawl/3c8aace1-48f7-4212-ae96-70bdcc658edcn%40googlegroups.com
> <
https://groups.google.com/d/msgid/common-crawl/3c8aace1-48f7-4212-ae96-70bdcc658edcn%40googlegroups.com?utm_medium=email&utm_source=footer>.
>
> --
> You received this message because you are subscribed to the Google
> Groups "Common Crawl" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to
common-crawl...@googlegroups.com
> <mailto:
common-crawl...@googlegroups.com>.
> To view this discussion on the web visit
>
https://groups.google.com/d/msgid/common-crawl/CADTM-zRRk%2Baowx67P-7FR5SMqaKJ7peJava7hSZURGu%3DNf5W9g%40mail.gmail.com
> <
https://groups.google.com/d/msgid/common-crawl/CADTM-zRRk%2Baowx67P-7FR5SMqaKJ7peJava7hSZURGu%3DNf5W9g%40mail.gmail.com?utm_medium=email&utm_source=footer>.