Sample GTFS/GTFS Realtime datasets for software testing

Skip to first unread message

Sean Barbeau

Mar 3, 2022, 5:51:12 PMMar 3
to Transit Developers
Hi all,
Have anyone created a set of sample GTFS or GTFS Realtime datasets used for software unit tests that are publicly shared?

An example of what I'm referring to:

The general idea is to have a sample GTFS dataset that doesn't change, and that models certain conditions (e.g., common errors), so you know that when you make code changes (e.g., in a validator) your expected output is consistent.


Sean Barbeau
Center for Urban Transportation Research
University of South Florida

Tom Brown

Mar 21, 2022, 8:27:03 PMMar 21
Hi Sean, for theSorry for ther/tests/data hasn't been updated much in many years but seems close to what you want for static GTFS. Sorry for my slow reply. I'm not working with transit data any longer but lurking. Thank you for your ongoing effort!

You received this message because you are subscribed to the Google Groups "Transit Developers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
To view this discussion on the web visit

Sean Barbeau

Mar 22, 2022, 10:01:09 AMMar 22
to Transit Developers
Thanks Tom!

For anyone interested, I'm tracking these here on the awesome-transit list:

Sean Barbeau

Mar 22, 2022, 3:33:28 PMMar 22
to Transit Developers
From Drew @ Interline (for some reason his reply was never posted):

Hi Sean,


Yes, we use this type of approach for testing the transitland-lib library that handles GTFS and GTFS Realtime parsing and validation for Transitland:


There are test case feeds for all three levels of GTFS static validation:


1. "bad entities" at the single row level:


2. validation errors that that involve entities in one or more files:


3. best practices:


You'll notice that it's just the relevant files in each directory for each test case. At the time of running the test, the tooling injects the rest of the feed files that are needed to make the feed complete. In theory, the tooling could be expanded to also generate full feed archives for external use.


For GTFS Realtime, we create test cases in JSON and turn them into PB contents in the test rig. Much less coverage for that so far — it's mainly for testing matching logic:


Reply all
Reply to author
0 new messages