On Fri, Jun 26, 2015 at 02:32:42PM -0700, Xuchen Yao wrote:
> The Freebase Topic API returns you a JSON file on a particular topic. Then
> it's relatively easy to:
>
> 1. do string matching with WebQuestions answer node and JSON field
> 2. extract the full dictionary path from top node to the matched JSON
> field, this will require a recursive function.
Ah, so you are using the JSON API. I better hurry if I'd use it too,
I guess. I generally prefer to deal with RDF, but you mentioned that
the results ordering helps.
> If you need a dump of all JSON files with entities in WebQuestions, Jacana
> released it. If you need more entities beyond, then have to call the
> Freebase Topic API.
In Jacana I found the list of question entities, maybe even answer
entities (I wasn't sure) but not graph paths connecting the two...
> On Wikidata:
>
> I'm hoping to build an internal version of the Freebase graph database
> using its RDF dumps. It's a huge engineering challenge and needs powerful
> machines. If someone is seriously interested in this and has resources, I'm
> happy to take it offline and have some more in-depth discussion.
Do you mean just getting a SPARQL endpoint for Freebase RDF?
YodaQA runs a public one and has some docs about how to set one up:
https://github.com/brmson/yodaqa/tree/master/data/freebase
(Feel free to use it *lightly* for other purposes than using YodaQA.
Unfortunately, it really is a lot of data and more complex queries can
easily take minutes.)
(Many people seem to use Virtuoso, maybe it's faster/better/...
But when I was using Virtuoso - just for DBpedia -, it always ended up
randomly corrupting its own database throughout imports, maybe I was
using some buggy version but I gave up on it.)