Em Dil
unread,May 3, 2024, 10:12:49 AMMay 3Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Common Crawl
Good afternoon,
I was wondering if all the vertices in the domain level webgraph have at least one or more record in the .wet files of type 'conversion'?
Can vertices appear in the webgraph because hyperlinks to them were found, but pages corresponding to a vertex are not necessarily crawled, or if they all are, some will have a .warc record where they may have blocked the crawler, so there is not .wet conversion record?
Thanks for your help.