Simon,
There seem to be two big sets of TtT data - one where people have adopted the syntax, and one where they are naturally using something close to it. Both could be extremely valuable. I've been picking them up and storing them in different sets. For the strict #TtT syntax, I have code that parses all of them into records, removes duplicates, and throws out a spreadsheet file:
http://tinyurl.com/ttt-sebusca. I run the scripts to generate a new spreadsheet every few hours - and manually update/upload the spreadsheet file. This is a patch solution - ideally we would get this information to a live webpage - or get it integrated into other sources (like People Finder / Ushahidi).
To pick strict #TtT up, I've had to go to the presence of third tag (#sitio, #contacto, or #info a this point is all I'm checking for). That's provided a pretty noiseless sample. I also had to adjust my location parsing to accept the next word, even if it's tagged - because so many are (smartly) tagging the city in their tweets.
As for the non-TtT tweets, I tried to write some parsing stuff to pick up the names, locations and contact info. I had a hard time removing duplicate records, because there were so many variations in the ways that names were presented. And then I ran out of cycles. My to do list has on it to do something with those #sebusca_ location tweets. But I won't get to it. From what I can tell, a decent parser could turn about 80% of those messages into records of some kind. They won't be super clean, but they'll be effective.
It would be great if you could pump over both types of tweets in a record format that Ushahidi can/will accept - both strict #TtT tweets and less strict ones. I find splitting them up makes for one very pretty set and one very-mildly noisy set.
And we are still pushing out the syntax. The initial push for #chile was pretty significant - now it goes out in waves. The best chance for deployment is to find an official information source to carry the prescriptive tweets on their Twitter account. We're still brainstorming ways of getting that done - though we have seen smaller sources in Chile helping to distribute.
I hope you had an excellent conference last week.
Kate