Hi John,
could you share more details and context about the access method and the
location you're accessing the data from?
- which file formats (WARC, WAT, WET files, etc.)?
- from which IP (range), alternatively the location?
- running how many concurrent requests, parallel processes or threads?
- which access method or the requested URL leading to the error?
If possible, please share some log snippets showing the error.
In case you cannot publicly share the details in this discussion group,
you may contact us directly via
in...@commoncrawl.org - Thanks!
Best,
Sebastian