Александр Цыплаков
unread,Jan 22, 2026, 5:31:59 AM (8 days ago) Jan 22Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to A gathering place for the Open Rail Data community
Good day all,
I wanted to ask a question on whether I am matching Schedule and Train movement data on the train_uid correctly - or there are some things I am missing (because I am getting unexpected results)
Context:
I am trying to collect some historic information on train movements, and to do that I've been collecting Train Movement data from TRUST.
I wanted to enrich train movement data with some train parameters by joining it with information from the Schedule.
To do that I've used Train Activation messages to assign train_uid to all Train_Movement messages with the same train_id code. I then tried joining the result on the Schedule on the train_uid (being mindful of specific dates)
The Schedule itself is full_all_toc schedule for 1st of January 2026 schedule, that has been "progressed" by applying schedule updates from 1st to 7th of January to it.
What I got is that about 15% of messages failed to join: Train Movement messages have train_uid codes, that I can't find in the Schedule. I have done this for the first week of 2026 - and there are about 8000 train_uid codes that I see in Train Activation messages that I can't find in Schedule.
About 1000 of these codes are numeric 5 digit codes, that start with a "4" (so 41632, 41655, etc) . Since all train_uid's in Schedule that I see are Letter+5 digits - that already looks like an edge case that I didn't account for properly
But remaining unmatched codes are regular 6 symbol codes - and I still can't seem to find them in the schedule.
Sorry for the long message. My question is: is there some additional step that I am missing when I am trying to connect train parameters from schedule to the specific train movement via train_uid. Or should all unique train_uid's listed in Train Activation messages of TRUST definetly be in Schedule + Schedule updates, the problem is somewhere in my code
Regards,
Alex