Thanks for all the pointers, especially to tracing/bin!
It looks like the python API gives all the functionality we could ask for here. The histograms2csv doesn't quite cover all the information we need (IIRC, it was lacking the pageset repeat number, and I couldn't figure out how to get it to give non-summarized data. My memory is hazy though, since I did this 3 weeks ago, and meant to update this with my findings then...), but whipping up a quick script using the provided API was really straightforward.
Making a colab kernel once we have a better idea of what all we want sounds like a good idea. I'm not anywhere near a colab expert, though, so... :)