Thanks for sharing this Richard (and Karen), it's amazing work. And I really appreciate that you've shared your analysis as well as the raw responses. It's *hugely* helpful in getting a broader understanding of the community (at least the library subset) innovating in this space.
I think we've got a lot of work to do yet on the Open part of Linked Open Data, which to me is a critical part of growing a sustainable ecosystem of open knowledge. There's been a lot of progress, and I'd say the pendulum is swinging toward Open, with the five largest datasets clearly utilizing open licenses. But it gets pretty muddy after that. This is why I think we have to keep pushing for Linked Open Data, and continue to support ongoing discussions, development of legal/policy tools, and reinforcing rewards for utilizing open licenses.
I've taken the liberty of rewriting Karen's table in part 4 with the largest linked datasets, just adding in a couple of columns related to licensing.
Jon
The largest linked data datasets reporting in descending order, with licensing:
Project/Service |
Source |
Size |
License |
Open? |
WorldCat.org |
OCLC |
15 billion triples |
e) CCO 1.0 Universal |
Yes |
WorldCat.org Works |
OCLC |
5 billion triples |
e) CCO 1.0 Universal |
Yes |
Europeana |
Europeana Foundation |
3,798,446,742 triples for the
pilot LOD service |
e) CCO 1.0 Universal |
Yes |
The European Library |
The European Library |
2,131,947,229 triples |
e) CCO 1.0 Universal |
Yes |
Linked Open Data |
Research Libraries UK |
936,054,853 triples |
a) We don’t announce any explicit license. |
No |
Opendata |
Charles University in Prague |
750 million triples |
Other (please specify) The license is based on French State Open License;
the linked dataset has not yet been released |
No |
Semantic Web Collection |
British Museum |
100-500 million triples |
Other (please specify) Open Data
licence |
Yes? |
Drug Encyclopedia |
Charles University in Prague |
100-500 million triples |
e) CCO 1.0 Universal |
Yes |
Cedar project |
DANS |
100-500 million triples |
Other (please specify) TBD |
Not Yet |
id.loc.gov |
Library of Congress |
100-500 million triples |
a) We don’t announce any explicit license. |
No? |
British National Bibliography |
British Library |
50-100 million triples |
Other (please specify) Open as possible, but also dealing with research
project data and enrichment. |
No? |
British Art collection |
Yale Center for British Art |
57 million triples |
Other (please specify) see
http://britishart.yale.edu/open-data-and-data-services-terms-use |
Yes? |
Licenses: 16 projects/services do not announce any explicit license. Of the ones that do, CC0 1.0 Universal is the most common (15), followed by Public Domain Dedication and License or PPDL (7). A few apply Open Data Commons Attribution (ODC-BY), Open Data Commons Open Database License (ODC-ODbl) or ODC-ShareAlike Community Norms. Other licenses mentioned: ODC-BY-SA (Attribution-ShareAlike), but considering ODC-BY; Creative Commons Attribution-NonCommercial-NoDerivatives (BY-NC-ND); French State Open License-based; and a link to a specific set of data services terms.
Jon Voss
Historypin Strategic Partnerships Director
ph. 415-935-4701
-------------------------------------------
We Are What We Do
London | San Francisco