--
Unfortunately, I haven't yet found a single country (or city, state, province, library, genealogy society) that has put any significant vital records or census information online with an open data license (without an NC restriction), anywhere in the world. Not one.
Here's the thing: I don't think asking US cities/counties/states to put their data under Creative Commons licenses is the way to go -- because, as Ben pointed out, in the US those vital records are already in the public domain by virtue of having been created by the US government with taxpayer money.
A better and probably more fruitful initial focus would be identifying the important *non*-governmental parts of the archival world -- like genealogy societies, historical societies, libraries, for-profit websites (like Ancestry.com or Find-A-Grave), not-for-profit websites (like FamilySearch.org, JewishGen.org), etc. -- that have already done the laborious work of locating paper-copy (or headstone) records, scanning or photographing the records, indexing the records, and making databases out of the indices -- and asking *them* to please release their completed datasets, and release them under a CC license at that.
I'd certainly prefer a free license alternative. WeRelate data is
cc-by-sa. But yes, as a practical matter, I believe that not having NC
would be a show-stopper for many genealogy organizations and it's better
to have data under NC than not at all.
It's interesting to note that FamilySearch Wiki started off as NC and
then changed their license a couple of years later to remove it. So
maybe we could think of NC as a "gateway" license to eventual adoption
of a fully free license.
Most states do not consider vital records to be public records in the public domain (otherwise they wouldn't be able to restrict access as they are increasingly doing).
I'm certainly interested in working with others (like you, Brooke) on methodically searching out and trying to improve the access terms for existing data from public, commerical, and non-profit sources (and helping process, improve and host such data). Maybe a project on something like Github/Bitbucket (or an open project manager) would be a way to co-operate, by using issue tracking for who's approaching who, example emails, wiki for legal issues, example data, data handling techniques, etc. It's all very relevant to the Open Genealogy software I'm working on.
It is going to be extremely hard to get them to agree to a license that could hypothetically (if extremely, extremely unlikely) lead to someone unscrupulous "taking" copies of their indices and sticking them behind a paywall and selling monthly access. So getting organizations to at least agree to a sharealike but non-commercial license is a good step in the right direction, considering that we are essentially starting from scratch here.
I'd really like to
know more about similar efforts that are under way, both in the US and
elsewhere. ...
So who's working on public access to public records of interest to genealogists?
Not to make the best the enemy of the good, but I'd really like to
know more about similar efforts that are under way, both in the US and
elsewhere. For one thing, it seems likely that legal and policy work
on public access to public records is already being done by the EFF,
Creative Commons, the Open Access movement, those behind the SSDI
petitions, journalists, Wikimedia, the Internet Archive and others.
If there's an organization already funding lawyers to draft licenses,
file FOIA requests, or litigate test cases, I'm happy to contribute to
them rather than reinventing the wheel.
I'd certainly prefer a free license alternative. WeRelate data iscc-by-sa. But yes, as a practical matter, I believe that not having NCwould be a show-stopper for many genealogy organizations and it's betterto have data under NC than not at all.
I would help with an effort like this.
--
It would be great if this discussion could be summarized on the
rootsdev website as best practices for creating and sharing
genealogical datasets, both from scratch and combining data from
others.
I now think this is a rather bad idea, and hope in the long run to
change FreeBMD's (and FreeREG's and FreeCEN's) licence - and I agree
that unless commercial use is permitted the data is mostly useless.
It does, however,
raise the interesting question of how "free" is sustained as, sadly,
bandwidth, servers, disks and scanners are not free - nor are
sysadmins and coders reliably free.
> I'm surprised the deal with Ancestry.com for
> them to host a copy behind their paywall was considered non-commercial
> though.
It was not behind their paywall.