My group wants to prototype a web database application that would do
the inverse of what the usual trip planners do. Rather than computing
how to get from place A to place B at some particular time, the
application would compute and rank a selection of places where several
needs could be satisfied for multiple parties using public transit.
(So back ending from some web-available trip planner won't work, since
those assume you already know where you want to go.)
Our preferable area of interest would be the SF Bay Area, and GTFS
data for the several large transit agencies would be just the thing.
We've already discovered that the enlightened folks at BART and
CalTrain make GTFS feeds freely available, but what about the other
agencies (Muni, AC Transit, etc.)?
I could easily understand that the data are simply unavailable, but it
is clear that outfits like the Google transit and the 511.org trip
planners must have this data available. It's also clear that their
data wasn't obtained by html scraping, since these planners are able
to refer to local bus stops that aren't even on the abridged schedules
such as published online by AC Transit. I guess humans are easily
capable of interpolating arrival times, but Google knows the exact
location of each local stop and can map same.
Can anyone answer definitively whether GTFS data or any logical
equivalent is readily available for the missing agencies? Or is this
a case where Google and 511.org have made deals with the owners of
this proprietary data?
That sounds like a very cool application. I hope you keep at it,
because uses like these help demonstrate the value of maintaining
public schedule data to agencies.
As others have pointed out, Google gets GTFS feeds from individual
agencies in the bay area, though not all of them have decided to
publish their feeds to a wider audience yet. They also share data
with MTC, which republishes it in an ad-hoc CSV format with somewhat
restrictive terms of service--see this thread for some details:
http://groups.google.com/group/transit-developers/browse_frm/thread/bae6dc26f895f93a
I would encourage you to join the transit developers mailing list,
since there's a lot of discussion of data and its uses there:
http://groups.google.com/group/transit-developers
Also, the upcoming TransitCampBayArea in September would be a great
place to demo your project and talk to agency folks about getting
data:
http://barcamp.org/TransitCampBayArea2
Joe