extruct: all-in-one extractor for RDFa, Microdata, JSON-LD, Microformats

25 views
Skip to first unread message

Wes Turner

unread,
Apr 18, 2020, 12:13:17 AM4/18/20
to rdfli...@googlegroups.com
Found this awhile back and thought I'd share:

"extruct is a library for extracting embedded metadata from HTML markup"

- W3C's HTML Microdata
- embedded JSON-LD
- Microformat via mf2py
- Facebook's Open Graph
- (experimental) RDFa via rdflib
Reply all
Reply to author
Forward
0 new messages