Can I query the entire corpus online, or must I download the whole thing?

63 views
Skip to first unread message

Pat McBennett

unread,
Apr 30, 2015, 4:29:53 AM4/30/15
to web-data...@googlegroups.com
Hi,

So the subject line says it all. Here's an example SPARQL query I'd like to run:

select (count(?s) as ?total) where {
}

i.e. how many organization URLs have been captured by the crawl...?

Cheers,

Pat.

Robert Meusel

unread,
Apr 30, 2015, 10:28:08 AM4/30/15
to web-data...@googlegroups.com
 Hi Pat,

Sorry - there is no SPARQL Endpoint. But you can have a look at the sub-datasets: http://webdatacommons.org/structureddata/2014-12/stats/schema_org_subsets.html

Here some numbers are already there. Or you can also download the more detailed statistic .xls for Microdata (http://webdatacommons.org/structureddata/2014-12/stats/html-microdata.xlsx)

Cheers,
Robert
Reply all
Reply to author
Forward
0 new messages