Chile version of Tweetneed.org at chile.tweetneed.org

Simon Twigger

unread,

Mar 1, 2010, 5:31:19 PM3/1/10

to Tweak the Tweet, katest...@earthlink.net

I've put up another version of tweetneed.org for the chile tweets -
its at http://chile.tweetneed.org. Its still got Haiti text and stuff
on there but I'll clean that up tonight with any luck.

I'm using the primary tags and hash tag regex from Kate. I have a
simple cron job running every minute searching for any of the main
tags, then I'm filtering on the regex to decide if it should be stored
or not.

Lots of #sebusca tweets, I'd like to try and modify the code to pull
those sections out, perhaps identify names, etc. and aggregate
somewhere on the site. The top needs, etc. are not correct as I
haven't had a chance to add in the various spanish tags into the
ontology, I will look at the google docs page for that tonight and try
to clean that up too.

Any comments or suggestions welcome, if there are specific feeds or
other ways to slice and dice the data that would be useful to anyone,
please let me know and I'll see what I can do.

Simon.

M. Edward (Ed) Borasky

unread,

Mar 1, 2010, 6:16:22 PM3/1/10

to tweak-t...@googlegroups.com, Simon Twigger, Tweak the Tweet, katest...@earthlink.net

You can lift my Ruby Twitter Streaming API code from
http://github.com/znmeb/Chile-Quake-Tracking. There are still some
places where I have hard-coded the syntax, but it's dumped to a YAML
file at that point and read right back in again. Once you have the
syntax you want, just comment out the dump part and it's ready to go.

Right now it captures two logs. One is everything tagged with #chile,
#fuerzachile or #terremotochile, The second is everything that has one
of the above hashtags *and* one of the tweaks defined in the Google
Doc. The both are in JSON. The first one gets compressed with BZIP2
every hour, but the second one stays uncompressed.
--
M. Edward (Ed) Borasky
borasky-research.net/m-edward-ed-borasky/

"A mathematician is a device for turning coffee into theorems." ~ Paul Erd?s

Catharine E. Starbird

unread,

Mar 1, 2010, 8:17:44 PM3/1/10

to tweak-t...@googlegroups.com

Simon,

This is great. We're pointing to it from our Twitter feed. Let me know
if/when you make significant updates - and we'll redirect.

Tweak the Tweeters,

If anyone is interested in tracking current TtT use on Twitter, follow
@TtT_Pacific and @TtT_Chile.

This link contains a concise Excel file w/ strict TtT syntax. We're
updating it every few hours - and we may turn it into a Google Doc for
easier access:

http://www.cs.colorado.edu/~starbird/sebusca_chile.xls

Also see http://docs.google.com/Doc?docid=0AUMdSKIMnfTPZGNjajk0N2NfMmcybWIzcmRr&hl=en
for a crowd-sourced conversation about what the syntax should look
like. We've been receiving lots of help in the translation.

We're still looking for venues to distribute these resources. If you
have any ideas, let us know.

Kate

M. Edward (Ed) Borasky

unread,

Mar 1, 2010, 8:28:43 PM3/1/10

to tweak-t...@googlegroups.com, Catharine E. Starbird

I'll be doing a couple of blog posts late tonight. One of them will be
about my "filter.rb" code, which I will post to DZone. That gets
widely picked up - I've pulled in as many as 800 unique visitors from
a post there, versus 100 - 200 from a post to Twitter. I get nothing
from posting to Facebook. ;-)

If you want to post things there, use this link:

http://www.dzone.com/links/add.html

You'll need to have an account, and I think "new" accounts get
"moderated", for some definition of the two words in quotes. But
they've been taking everything I post, so at worst, you can send me
links and I'll post them.

--
M. Edward (Ed) Borasky
borasky-research.net/m-edward-ed-borasky/

"A mathematician is a device for turning coffee into theorems." ~ Paul Erd?s

Sophia Liu

unread,

Mar 1, 2010, 8:37:55 PM3/1/10

to tweak-t...@googlegroups.com

Simon,

Kate sent the wrong link out of her excel file. I attached it below and created a Google spreadsheet of it: http://spreadsheets.google.com/ccc?key=0AtGZBDg1MfnrdHdVU2FoTnFPX3ZpVkxlRWR3TWg0S2c&hl=en