January 2018 Crawl Archive Now Available

19 views
Skip to first unread message

Sebastian Nagel

unread,
Jan 29, 2018, 12:17:11 PM1/29/18
to common...@googlegroups.com
Hi all,

the January 2018 crawl archive is now available. The crawl was run from Jan 16 and Jan 24, 2018
and covers 3.4 billion web pages or 270 TiB of uncompressed content. More details about the crawl
and information how to access and use the data can be found on our blog [1].

You'll find statistics and metrics about the current and previous crawls on [2].

The URL index of the December crawl is available at [3].

Best,
Sebastian


[1] http://commoncrawl.org/2018/01/january-2018-crawl-archive-now-available/
[2] https://commoncrawl.github.io/cc-crawl-statistics/
[3] http://index.commoncrawl.org/CC-MAIN-2018-05/
Reply all
Reply to author
Forward
0 new messages