Hi everyone,
Our June 2026 Crawl Archive and corresponding Web Graph are now available.
The June 2026 crawl consists of 2.10 billion web pages (or 354 TiB of uncompressed content). Captures are from 40.8 million hosts or 33.6 million registered domains.
The corresponding Web Graph release consists of 247.3 million nodes and 6.3 billion edges at the host level, and 121.1 million nodes and 3.9 billion edges at the domain level.
🔗
June 2026 Crawl Announcement🔗
June 2026 Web Graph Announcement🔗
Crawl Statistics🔗
Web Graph StatisticsLive long and prosper! 🖖
Luca