x509 subject alternative names as a seed?

51 views
Skip to first unread message

Pierre Barre

unread,
May 27, 2025, 5:19:19 PMMay 27
to Common Crawl

Hi,

I love Common Crawl and have made fair use of it over the years. I'd love to give back in some way.

I operate https://www.merklemap.com/, which is a certificate transparency search engine, and I was wondering if there would be interest in using our stream of hostnames as a "seed" to index undiscovered websites (docs: https://www.merklemap.com/documentation/live-tail). If so, we'd happily provide our service for free to Common Crawl.


Best, 

Pierre

Ed Summers

unread,
May 27, 2025, 6:19:23 PMMay 27
to common...@googlegroups.com
I wonder what your customers might think about having their “undiscovered" websites suddenly archived automatically?
> --
> You received this message because you are subscribed to the Google Groups "Common Crawl" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to common-crawl...@googlegroups.com.
> To view this discussion visit https://groups.google.com/d/msgid/common-crawl/b89f45e6-3377-44e1-a867-b3c1f1b5eb31n%40googlegroups.com.

Pierre Barre

unread,
May 27, 2025, 6:24:44 PMMay 27
to common...@googlegroups.com
If you rely on the hope that no one is going to find your “secret” hosts as a security measure, I have very bad news for you…
> https://groups.google.com/d/msgid/common-crawl/6555AFC5-C418-407A-8204-360922870934%40pobox.com.

Greg Lindahl

unread,
May 27, 2025, 6:36:55 PMMay 27
to common...@googlegroups.com
Please stop this conversation here. Thanks.

Reply all
Reply to author
Forward
0 new messages