Hi,
We are working with the GTFS bus schedules from February 2025 and are finding several Weekday trip ids representing identical or very similar trips.
Why do these trips repeat, and is there an easy way to clean these up based on trip id or other data? Currently, we drop duplicates based on first departure, last arrival, number of stops, direction and route.
Some examples to identical trips:
B74:
UP_A5-Weekday-SDon-086500_B74_605, UP_A5-Weekday-SDon-086500_B6_204, UP_A5-Weekday-SDon-086500_B6_281, UP_A5-Weekday-SDon-086500_B6_206
and
Q22:
41914478-FRPA5-FR_A5-Weekday-10-SDon, 41914490-FRPA5-FR_A5-Weekday-10-SDon, 41914500-FRPA5-FR_A5-Weekday-10-SDon, 41914714-FRPA5-FR_A5-Weekday-10-SDon
There are trips where a majority of stop times are identical but there are a few seconds of difference for one or more stop times. Example trip ids:
B8 - 50th stop time shifts by 11 seconds:
JG_A5-Weekday-147400_B8_150 and JG_A5-Weekday-SDon-147400_B8_150
Q54 - all stop times shift by 1:50 after the 7th stop :
FP_A5-Weekday-121700_Q54_725 and FP_A5-Weekday-SDon-121700_Q58_880
Thank you