Thanks
Sent from my iPhone
If you look at calendar_dates you will see that these schedules are turned on for various dates that they are active and turn off the base schedules, those with single digits.
This follows the gtfs spec and it is not broken.
While I don't agree with Mr. Buddy's approach, I too feel that the files are broken. According to the current data set, no trains are running on Fridays, even if you account for the exceptions in the calendar_dates.txt file.
Ken
I will look into this and post my findings on the forums tomorrow.
In addition there will be new base schedules being published hopefully by the end of this week that extend past October's base.
Mike Dannenbring
Assistant Director IT
MTA Metro-North Railroad
I'm afraid your interpretation of the purpose of the GTFS spec is incorrect. The MTA's implementation is spot-on (recent errors aside).
In particular:
> service_ids in the 2xxx range were added w/o a documented explanation
GTFS does not require (or even suggest) the purpose of service_ids should be documented.
> the change dramatically increased the data set size
Irrelevant. What matters is if the data is correct if the rules are applied correctly. Your code did not follow the rules, so it got incorrect results.
> DO NOT code holidays as exceptions, let the developers handle this in their code
This runs directly counter to the GTFS spec. You're not supposed to code for holidays at all; just follow the GTFS rules and correctly coded feeds will always give you the right times, as MTA's does.
> ALL of the weekday trips were tagged for removal on 070411, then ALL of the weekend trips were added back in for 070411
This is exactly the pattern GTFS recommends feed creators use for holidays.
If the MTA should choose to create a different service_id and set of trips for every single day, that would be legal according to GTFS. There is no requirement, or even recommendation, to keep feed sizes small, however much I appreciate that as a mobile developer.
Adam
On Sep 27, 2011, at 10:33 PM, Wayne wrote:
Thank you for this information, it is very helpful in understanding the underlying logic.
As of last Saturday, Brett and I designed a preprocessor to proliferate the schedule out to an atomic form, then merge the trips back into a normalized state. The result is a set of .csv files that contain all trips and exceptions in a much smaller data set format. These files are then run through our importer and populated into our data store. Our biggest challenge had been checking the integrity and accuracy of the data store so that we haven't lost or misrepresented any trips.
Again, thank you for taking the time to provide this explanation. It makes a lot of sense once you see the rules for which the data was gen'd.
Wayne
John,
I truly appreciate your help and time.
Thanks!
Sent from my iPhone
I'm only pushing for them to be corrected so Metro North riders that use GTFS apps have correct data.
Again, I apologize for any misunderstanding and that I appreciate the MTA's help on this board as well as all participants.
Thanks!
Sent from my iPhone
Only change I'd recommend is doing entirely calendar_dates instead. LIRR does this.
Thanks.
Sent from my iPhone
There really is an issue and the GTFS should probably be recreated.
I appreciate everyone's support of this board and open data.
Thanks!
Sent from my iPhone
All,
I have corrected the issue with the calendar.txt. This was a clerical issue, not a systematic problem.
I have also included the new base schedules that go live on October 16th. I created one day exceptions for every day until October 16th.
Friday's Yankee schedule is also included. I will publish additional Yankee schedules as they become available.
Thak you for your patience,
Mike Dannenbring
Assistant Director IT
MTA Metro-North Railroad
We had, but the file would be much larger than it is. Also since we have 3 schedules for weekday service (Monday, Tuesday thru Thursday and Friday) . The reason why you have 5 or 6 smallish service id's is because we combined schedules from our 6 base schedules. It is also the reason for the single trip on Sunday and Monday as well as some select Friday only trains. You would wind up with 14 or so service id's with over 700 trips instead of just the 8 Yankee schedules.