Thanks, everyone! 😊 This is pretty exciting.
If you'd like to see some of the actual scans, which I randomly picked from various years and each of the five boroughs, they are attached to our latest Facebook post here:
(That page and its scans should be visible even if you don't use Facebook at all.)
The process of uploading the files was supposed to start today, but it has been pushed back to later this week, since I'm still sick with a nasty case of gastroenteritis and was in the hospital Saturday night hooked up to an IV, which is not quite how I would have liked to have celebrated the arrival of the scans. They are beautifully clear, and I've been told they are far easier to read than the old microfilm copies that are onsite at the NYC Municipal Archives. "My" microfilm copies that were won in the legal case were made directly from the Archives' vault masters, and the digital scans were then very generously done by FamilySearch on their professional-grade equipment, which is what likely accounts for the clarity.
Tom is correct that this data, which is an index, is somewhat duplicative of the data in ItalianGen's index, even if the two underlying document sets are different (City Clerk's licenses/affidavits/applications versus Health Department certificates), because the time periods do overlap. But according to previous statements of the head of the NYC Municipal Archives, this new City Clerk's data set has 10% more entries in it. Some of that variance may be due to people who applied for a license/affidavit/application and then never followed through with the marriage, maybe because one or both partners were found to be ineligible (bigamy? STD's? underage?), or died before the wedding, or got cold feet and never followed through with the wedding, or some other cause. Or perhaps the City Clerk's index was just a better index all around, since it was apparently compiled contemporaneously, while the ItalianGen index to the Health Department certificates (really two separate indices, the Brides Index and the Grooms Index) was created years later as a WPA project, and maybe some errors and omissions crept in.
Anyway, I am totally agnostic on how the images get turned into a transcribed index, but I will cheer you all on from the sidelines. Once the images are on the Internet Archives, you can download them in bulk, or even as a torrent if you want, so that should help disseminate them easily. I do plan on giving a copy of the data on a hard drive to ItalianGen/GermanGen, but I know they still do things the tedious way: they burn individual CD's with a small number of files and send them in the mail to people who then work one image at a time, transcribing each image into an Excel row. (Ugh!) If a better way is set up to manage the project, perhaps their indexers might want to join that way instead. Heck, maybe you can set up an arms race amongst indexers. 😀
FYI, the next data set to come out *might* be the New York State (excluding NYC) death index for 1880-1956, and to my surprise it might already be in CSV format and not need an indexing project at all! How cool would that be? Right now I'm in a holding pattern with it because the New York State Department of Health has pushed back their official reply date on that FOIL request yet again, now aiming for April 29th. You can follow the real-time updates on the MuckRock page for that request here:
https://www.muckrock.com/foi/new-york-16/index-to-all-new-york-state-death-records-1880-1956-23256/
If that dataset does get released, and in a nice ready-to-go CSV format at that, that would be *such* a fun dataset to play around with, not just for research but also with visualization tools or statistical tools. You could look at migration patterns of family surnames, most common surnames per county, counties with the most nonagenarians at time of death, and so on. Unfortunately the statewide data supposedly wouldn't include Albany, Buffalo, or Yonkers prior to 1914, but they're next on my list once I get the state data. And I keep getting mixed answers on whether inmates of state prisons or state mental hospitals who died on the grounds were included in the data or not. Guess we'll find out.
- Brooke