What is the value of such a dataset to an "academic researcher"? I
consider myself an academic researcher, though I don't have a formal
position as one. What can you do with a "real" Twitter "social graph"
that you can't do with one generated by random techniques based on
statistical sampling of Twitter data?
A million-user "real" social graph, even assuming fewer than 5,000
friend_ids and follower_ids per user, costs two million API calls. At
350 calls per hour, that works out to 238 days by my calculation. And
during that 238 days, the social graph is changing many times a
second.
I don't see the value of a *static* sample of the Twitter social graph.
A randomly-generated graph of a much larger size could be
constructed in a day, *including* coding time, *and* you could
incorporate the changing nature of Twitter social graphs in a
simulation. What would be interesting to me would be the *model*, not
the data. To quote Dr. Neil Gunther
(http://www.perfdynamics.com/Manifesto/gcaprules.html):
"Data comes from the Devil, only models come from God."
(And smiling at a subtle irony in my standard email signature) ;-)
--
M. Edward (Ed) Borasky
borasky-research.net/m-edward-ed-borasky/
"A mathematician is a device for turning coffee into theorems." ~ Paul Erdos
What is the value of such a dataset to an "academic researcher"? I consider myself an academic researcher, though I don't have a formal position as one. What can you do with a "real" Twitter "social graph" that you can't do with one generated by random techniques based on statistical sampling of Twitter data?
A million-user "real" social graph, even assuming fewer than 5,000 friend_ids and follower_ids per user, costs two million API calls. At 350 calls per hour, that works out to 238 days by my calculation. And during that 238 days, the social graph is changing many times a second. A randomly-generated graph of a much larger size could be constructed in a day, *including* coding time, *and* you could incorporate the changing nature of Twitter social graphs in a simulation.
(Smiling at the subtle irony in my standard email signature) ;-)