updated Wikidata HDT file?

15 views
Skip to first unread message

Egon Willighagen

unread,
Dec 14, 2015, 2:47:44 PM12/14/15
to bio...@googlegroups.com
Hi all,

while I don't have a lot of time this week, I converted the
WikiPathways RDF into a HDT file, which works well, but the amount of
data is not so much. But it's cool to be able to run queries on the
data locally (*).

However, Wikidata comes at a considerable larger size. In fact, the
full RDF does not fit on my SSD and while the smaller set can (16GB
unzipped), 2.5GB of heap space is not enough for Java to create the
index :/

There is a Wikidata dump download, but that is very old... can someone
please make an up to date HDT file? What I did was create a single .nt
file for the "Simplified and derived dumps", concatenate them, and
then create a HDT file.

Egon

--
E.L. Willighagen
Department of Bioinformatics - BiGCaT
Maastricht University (http://www.bigcat.unimaas.nl/)
Homepage: http://egonw.github.com/
LinkedIn: http://se.linkedin.com/in/egonw
Blog: http://chem-bla-ics.blogspot.com/
PubList: http://www.citeulike.org/user/egonw/tag/papers
ORCID: 0000-0001-7542-0286
ImpactStory: https://impactstory.org/EgonWillighagen
Reply all
Reply to author
Forward
0 new messages