Greetings, everyone!
We are pleased to announce the release of the December 2025 Crawl Archive and corresponding Web Graph release.
The December 2025 Crawl Archive (
CC-MAIN-2025-51) fetched between December 4th and December 17th consists of 2.16 billion web pages (or 364 TiB of uncompressed content). Page captures are from 46 million hosts or 36 million registered domains and include 783 million new URLs, not visited in any of our prior crawls.
The December 2025 Web Graph release (
cc-main-2025-oct-nov-dec) consists of 250.8 million nodes and 10.9 billion edges at the host level, and 121.3 million nodes and 6.2 billion edges at the domain level.
Further info can be found at the following links:
🔗
December 2025 Crawl Announcement🔗
December 2025 Web Graph Announcement🔗
Crawl Statistics🔗
Web Graph Statistics
We hope this release is useful in your research and projects. As ever, we welcome your questions, comments, and feedback.
Warm wishes for the season from your friends at Common Crawl.
TV