June 2025 Crawl and Web Graphs

37 views
Skip to first unread message

Thom Vaughan

unread,
Jul 2, 2025, 7:11:47 AMJul 2
to Common Crawl
Hi all,

Our June 2025 crawl archive and its corresponding Web Graph release are now available.

The June 2025 crawl (CC-MAIN-2025-26) crawled between June 12th and June 25th contains 2.38 billion web pages (or 389 TiB of uncompressed content). Page captures are from 47.4 million hosts or 38.5 million registered domains and include 713 million new URLs.

The Web Graph release (cc-main-2025-apr-may-jun) contains 371.6 million nodes and 3.1 billion edges at the host level, and 161.8 million nodes and 2.2 billion edges at the domain level.

See these links for further info:

🔗 June 2025 Crawl Announcement
TV
Reply all
Reply to author
Forward
0 new messages