MOBILIZING HISTORIC GEODATA: HACK NYC’S PAST WITH NYPL LABS
Project Tracks, Project Ideas, and Sweet, Sweet Data Starter Kits
For the past few years, with a lot of help from the NEH, we’ve been working on a historical gazetteer of New York, called the New York City Chronology of Place, built by the brilliant geo-hackers at Topomancy and derived in part from NYPL’s historical map collection. Topomancy’s been working on a similar tool with the Library of Congress, but while their system is geared around institutional metadata creation and remediation, ours has a public focus. In that spirit, we invite you to dig into New York City’s past with help from our APIs.
We have a few ideas of areas we could focus on:
Applications: Building mind-blowing user-facing proofs of concepts and prototypes of what can be done with the historical geodata and services we’ve got today, while showing kind of things we’ll need to build and data sources we’ll have to provide in the future (to view the past like we view today... whoaaaaa).
Making geo-data out of other data: Like data? Good. Love data? We love you! We’ve got various historic geodata sets and things that could potentially be geodata. Building tools to look for named entities and geoparse texts, working with semi structured and unstructured data.  Metadata warriors, arise!
Geo-integration and visualization: Wanna run batch queries to find places that no longer exist in thousands of metadata records or plot amazing data you’ve got? Wanna build an urban-scale GutenKarte? You’re in the right place!
These are just ours - you should discuss, criticize, and most importantly suggest your own. We also want your help figuring out which of these ideas we should pursue and which we should jettison.
Think Historical Foursquare. Check in anywhere in Manhattan, get back the history of the place.
This idea came from a conversation* investigating historical checkin and how could it be done with the georectified maps from NYPL’s collection... The very simple explanation that HTTR is a basic mobile app that allows a user to check in at CONTEMPORARY locations and have detailed OLD maps of those places sent to a predetermined email... thus not overtaxing a tileserver in the event it’s popular.
We also have grand delusions that we could do more. We’d love to not only have it pull in maps, but also data from other sources so that it not only collates a series of historical maps of the locations you visit and check in to, but, based on those checkin location, this application queries various APIs (e.g. NYPL Repository, National Digital Newspaper Project, NYTimes or you name it), and sends you a series of digital objects we think might be related to your checkin/journey.
We’ve spent lots of time scraping data from old maps and integrating it into the New York City Chronology of Place, so we want to put it to good use. One way to do that is to use the NYCcop API to facilitate search of the newspaper APIs we love. We want to make all the old newspapers searchable by the click of a map, so we can see all the articles, for example on the construction of the Croton Reservoir or the building and conflagration that took down the Crystal Palace where Bryant Park now sits.
One of the holy grails of working with old maps is scraping meaningful (i.e. not garbage) data from maps analogous to how OCR does it for texts. It doesn’t have to be structured data, though if you can do that, please, by all means, go ahead. The basic idea is we’ve got a georectified map, we scrape points, lines or polygons and spit them out for use elsewhere.
Mike Resig, brother of John of the House of jQuery, has provided us with a set of instructions and a proof of concept to show us that this is possible and our own Mauricio Giraldo’s been doing some work toward this.
We’ve got a great set of 65k fully addressed (with lots more attributes) building footprints that we hand transcribed (!) from William Perris’ Maps of the City of New York (Manhattan) from 1852-4. We’ve also scanned a city directory published by John Doggett in 1854. We’ve already OCR’d it and done a fair deal of parsing, so it’s ready to zip up with the Perris data.
A predicate for conducting geoparsing is a gazetteer. Run a body of unstructured texts against a list of places with associated Lat/Long and have returned locations within texts. Historical geoparsing adds the temporal element. Since we’ve built the NYCcop, we’d like to test drive it against whatever data we can find about NYC history. The dream of the Topomancy team has been to build it to parse The Power Broker.
You may know the nearly hundred year old Stokes Iconography of Manhattan Island 1609-1898, an incredibly dense, exhaustively researched tome of Manhattan people, history and geography, a six volume hypertext-before-it-existed epic homage to the city. It is so deeply rooted in place and time that every page littered with discarded and forgotten points, lines and polygons from the past. And copies exist in digital format, some with OCR. This thing begs to be creatively re-instantiated and given back to the world, as a mappable, or timelined history, or whatever you might imagine.
We’ve got this thing full of metadata records (like a million) about things in the library (sadly not books). It’s got an API. The team that built that system and API will be hacking along side us. Let’s geoparse it!
A few weeks ago, I met this amazing group of folks who were going through their favorite NYC books and geocoding any quote related to a place. Schuyler’s GutenKarte did this for public domain texts a few years ago, but never at urban scale. Let’s tackle urban scale book geocoding.
STARTER DATA KITS
NEWSPAPERS
New York Times
Chronicling America (NDNP) - almost all public domain newspapers from the 1830’s to 1923
Daily Eagle (BPL)
http://eagle.brooklynpubliclibrary.org/Default/Skins/BEagle/Client.asp?Skin=BEagle
API: perhaps we can build one
CITY DIRECTORIES (Image & OCR)
NYC - Doggett - 1854
NYC - Longworth - 1856/6
NYC - Longworth - 1839/40
CITY DIRECTORIES (Parsed)
NYC - Longworth 1839/40 https://www.google.com/fusiontables/DataSource?docid=1HRVtVycCMvGPjAJquY-lmE-Pibm_K5nQfjHiEXs
NYC Doggett - 1854 https://docs.google.com/file/d/0B-Ng8P9yTR2UTkZ0OEc5QVdoQ1k/edit?usp=sharing
PHONEBOOKS
Phone Books - 1939-1940
PHOTOGRAPHS
See: http://old-nyc.appspot.com (image ID’s geocoded: https://github.com/danvk/sfhistory/blob/master/viewer/nyc-lat-lons.js)
http://digitalgallery.nypl.org/nypldigital/explore/dgexplore.cfm?col_id=176
MAPS
Historical NYC Vector Maps
Brooklyn Robinson (1886) http://maps.nypl.org/warper/layers/867
Manhattan Perris (1852-4) https://www.google.com/fusiontables/DataSource?docid=15U7eXFpetG9MLC8MEo4oIYDqmvBgQF5rrasLkV4
Queens Bromley (1909) http://maps.nypl.org/warper/layers/870#Export_tab
Staten Island (1917) http://maps.nypl.org/warper/layers/1035#Export_tab
Georectified Bronx Maps (Raster)
Topographic Survey (1892) http://maps.nypl.org/warper/layers/908
Hyde (1901) http://maps.nypl.org/warper/layers/886
Bromley (1904) http://maps.nypl.org/warper/layers/884
Topographic Survey (1905) http://maps.nypl.org/warper/layers/886
Bromley (1921) http://maps.nypl.org/warper/layers/883
Georectified Brooklyn Maps (Raster)
Bromley (1904-1907) http://maps.nypl.org/warper/layers/1057
Kings County (1890) http://maps.nypl.org/warper/layers/875
Robinson (1886) http://maps.nypl.org/warper/layers/867
Hopkins (1880) http://maps.nypl.org/warper/layers/962
Bromley (1880) http://maps.nypl.org/warper/layers/868
Beers (1874) http://maps.nypl.org/warper/layers/1052
Perris (1855) http://maps.nypl.org/warper/layers/860
Miscellaneous (pre-1850) http://maps.nypl.org/warper/layers/1018
Georectified Manhattan Maps (Raster)
NYC Aerials (1924) http://maps.nypl.org/warper/layers/909
Bromley (1911) http://maps.nypl.org/warper/layers/871
Bromley - to 110th St (1898) http://maps.nypl.org/warper/layers/906
Bromley (1897) http://maps.nypl.org/warper/layers/863
Robinson (1885) http://maps.nypl.org/warper/layers/864
Dripps (1867) http://maps.nypl.org/warper/layers/877
Perris - to 72nd St (1857-62) http://maps.nypl.org/warper/layers/859
Perris - to 42nd St (1852-4) http://maps.nypl.org/warper/layers/861
Burr (1841) http://maps.nypl.org/warper/maps/12834
Burr (1839) http://maps.nypl.org/warper/maps/12798
Burr (1829) http://maps.nypl.org/warper/maps/12809
British HQ (1782) http://maps.nypl.org/warper/maps/15066
Ratzer (1767/1776) http://maps.nypl.org/warper/maps/15264
Montresor (1766) http://maps.nypl.org/warper/maps/12949
Castello (1660) http://maps.nypl.org/warper/maps/13913
Georectified Queens Maps (Raster)
Atlas of Long Island (1873) http://maps.nypl.org/warper/layers/887
Wolverton (1891) http://maps.nypl.org/warper/layers/878
Hyde (1904-1907) http://maps.nypl.org/warper/layers/1047
Bromley (1909) http://maps.nypl.org/warper/layers/870
Hyde (1912-1917) http://maps.nypl.org/warper/layers/1046
Georectified Staten Island Maps (Raster)
Beers (1874) http://maps.nypl.org/warper/layers/869
Robinson (1907) http://maps.nypl.org/warper/layers/1036
Survey of Richmond (1913) http://maps.nypl.org/warper/layers/915
Bromley (1917) http://maps.nypl.org/warper/layers/1035
Miscellaneous (pre-1850) http://maps.nypl.org/warper/layers/1021
APIs
NYPL Digital Collections - http://api.repo.nypl.org/
Gazetteer Query API - https://gist.github.com/riordan/19a5ebe68536825648f6