Hi all,
On Tue, 2012-11-06 at 16:08 +0100, Gunnar Aastrand Grimnes wrote:
> Enjoy your new rdflib with rdfa! :)
Here's some sideband data that reinforces the utility of an
RDFa/microformats parser package in RDFLib by illustrating an explosive
growth of RDFa and microdata use according to the Web Data Commons.
FTR: "The Web Data Commons project extracts all Microformat, Microdata
and RDFa data from the Common Crawl web corpus, the largest and most
up-to-data web corpus that is currently available to the public"
Triples retrieved from microformat hcard (for a comparison) vs rdfa vs
microdata for 2010 [1] and 2012 [2]
total triples:
2010: 5,193,276,058
2012: 7,350,953,995 (+41.55%)
triples from hcard:
2010: 3,226,066,019
2012: 3,547,824,107 (+9.97%)
triples from rdfa:
2010: 293,542,991
2012: 1,079,175,202 (+267.64%)
triples from microdata:
2010: 1,197,115
2012: 1,488,063,426 (+124204.13%)
[1]
http://webdatacommons.org/2010-09/stats/stats.html
[2]
http://webdatacommons.org/2012-08/stats/stats.html
Cheers,
--
Graham Higgins
http://bel-epa.com/gjh/