Good Evening Folks,
I've been "tinkering" with the schedule feeds from TRUST (is that technically correct?), consuming the TOC snapshot and then daily updates, I've also been happily consuming VSTP and movements over STOMP. I've been able to get a resolved schedule, see the planned times, see
the actual times, capture cancellations/reinstatements, etc - there are a few edge cases I need to iron out, but in general it's good.
I've been ignoring (up until now) Darwin, however there's various pieces of information in Darwin I want, formations, loading, platform changes, estimates. I've got the PushPort feed from Kafka into my own message streaming solutions (NATS), so that side is all good.
My question is around the necessity of consuming both the Darwin TimeTable files and the PP feed, with the view I've already materialised the schedule from TRUST.
Can I "get away with" just consuming the contents of the kafka topic? Or will I be missing critical data if I don't consume the file based snapshots as well?
My naïve view of the snapshots suggests there's no details in there that I haven't already materialised from ITPS files and feed. However, the one thing that does seem to necessitate it, is the mapping from the WTT UID/Date/Signalling ID combination to a RID?
Could someone just give me a nod if I've got the right view of the data?
A huge thanks to everyone who's contributed to the google group in the past, it's been hugely useful to read through and gain more understanding.
Incidentally, does anyone know what partition key is being used on the PP Kafka topics? I noticed that the topic has two partitions, and as such message times shuffle around now and again, depending where you are on each partition (no global ordering accross partitions in Kafka topics). Are we safe to assume messages about a RID (for example) will always be in the same partition, and thus there's no ordering concern?
Peter.