Hi,
I love Common Crawl and have made fair use of it over the years. I'd love to give back in some way.
I operate https://www.merklemap.com/, which is a certificate transparency search engine, and I was wondering if there would be interest in using our stream of hostnames as a "seed" to index undiscovered websites (docs: https://www.merklemap.com/documentation/live-tail). If so, we'd happily provide our service for free to Common Crawl.
Best,
Pierre
To view this discussion visit https://groups.google.com/d/msgid/common-crawl/6555AFC5-C418-407A-8204-360922870934%40pobox.com.