RIP Freebase & data dump question

127 views
Skip to first unread message

Tom Morris

unread,
Sep 25, 2015, 11:18:07 AM9/25/15
to freebase...@googlegroups.com
A fitting last triple by Gordon:


As he says "requiescat in pace."  It was a fun ride, despite the sad ending.

Speaking of ending, it looks like some of the internal pipelines were still running until July, so folks might want to grab the latest data dump if you stopped looking when the dumps ended back in April, 2015.


Even though writes apparently stopped in mid-July, there are several dumps in July and August with slightly different sizes.

Anyone know what accounts for the differences among the July & August dumps (they seem to be different sizes)?

Also, anyone have accurate final stats?  Total facts written, live facts, topics?  The Freebase home page is claiming 3,041,722,635 facts (and 49,947,845 topics), but the max GUID is 9202a8c04000641f8000000046d5d956, which I make to be 1.8 billion triples/quads over all time.  Does each one get counted more than once?

Tom

Spencer Kelly

unread,
Sep 25, 2015, 12:04:37 PM9/25/15
to freebase...@googlegroups.com
yeah, thanks Tom.

I'm optimistic about the afterlife of freebase data, in its many forms.

I'm working on an epic one-liner that reduces the dump (in a totally opinionated way) from 250gb to 3.4gb (98%). I'll put it somewhere as a mongo dump or something. There's value in having something laptop-friendly, and `s/RDF//`, for people to nerd-out with.

If someone wants to make a Freebase 2, they could really just do everything freebase did, again. None of the design decisions, or social-policy fell apart, and that speaks to how well-conceived it was. It was killed because it was working, not because it wasn't.
DBpedia & ConceptNet are doing well at publishing quarterly or whatever, maybe that could be what freebase will become.
 -spence

--
You received this message because you are subscribed to the Google Groups "Freebase Discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to freebase-discu...@googlegroups.com.
To post to this group, send email to freebase...@googlegroups.com.
Visit this group at http://groups.google.com/group/freebase-discuss.
For more options, visit https://groups.google.com/d/optout.

Leeza Rodriguez

unread,
Sep 27, 2015, 7:23:26 PM9/27/15
to Freebase Discuss
My~ last triple was 'Kian', https://www.freebase.com/m/0130f9_n, the first neural network of Wikidata, created in March 2015 (a close call with the deadline!). I got a kick out of telling Freebase who might end up being the AI source at Wikidata, it's final resting place ;-). But I am easily amused!  

Thanks to everybody who helped me learn! 

Leeza

Johannes Athmer

unread,
Sep 29, 2015, 10:58:01 AM9/29/15
to Freebase Discuss
Am Freitag, 25. September 2015 17:18:07 UTC+2 schrieb Tom Morris:

Also, anyone have accurate final stats?  Total facts written, live facts, topics?  The Freebase home page is claiming 3,041,722,635 facts (and 49,947,845 topics), but the max GUID is 9202a8c04000641f8000000046d5d956, which I make to be 1.8 billion triples/quads over all time.  Does each one get counted more than once?

I'd also like some final stats like from the former Expert Hub - total writes/user, votes (old & new pipleline)/user, topic/user, etc. Do you think the queries from the old Expert Hub are still good to create that data?

-Johannes
Reply all
Reply to author
Forward
0 new messages