May 2021 crawl archive now available

26 views
Skip to first unread message

Sebastian Nagel

unread,
May 23, 2021, 6:07:41 AMMay 23
to Common Crawl
Hi all,

the archives of the May 2021 crawl are now available! The crawl was
run May 5 - 19. The archives cover 2.6 billion web pages or 280 TiB
of uncompressed content. As always, more details about the crawl and
information how to access and use the data can be found on the Common
Crawl blog [1].

Best,
Sebastian

[1] https://commoncrawl.org/2021/05/may-2021-crawl-archive-now-available/
Reply all
Reply to author
Forward
0 new messages