I've been playing with this feature with some limited success. I have Phirehose setup on a dedicated server with monitoring so I have a rough idea of what has been happening on it.
I can follow 1 user with pretty good throughput from twitter (tested with a test account I have and would get most tweets in real time and @replies)
Now as I push through and build up the list past 5-10 accounts it starts to choke and burst valid tweets on occasion.
Keep in mind that I'm only getting a couple of tweets out (maybe 5-10 in an hour) and then radio silence for hours. I know this is not accurate since I have followed
these individual accounts (which have a HIGH traffic of @'s and RT's when followed by themselves.) So I know something is up with either twitter's feed or something
in the library.
TLDR:
I see phirehose ghetto collector using bandwidth talking to twitter, but I'm not getting near the amount of content expected for the bandwidth used.
Some hints as to where to look would be appreciated.
I've also started looking at the Twitter reference library (hbc @
https://github.com/twitter/hbc) to see what they are doing to parse incoming json stream as I think that this may be the source of trouble/breakage in tweets.
Thanks for the anticipated feed back!
--Kyle