Twitter to open source Hadoop-like tool: "Storm"

16 views
Skip to first unread message

Scott Tsai

unread,
Aug 6, 2011, 4:30:14 AM8/6/11
to 沈育德, 王海威, 周凱楓, itrs google group
Apparently Twitter recently acquired a team with extensive Hadoop
experience and is going release a Hadoop replacement:
1. Critique of Hadoop: http://tech.backtype.com/the-dark-side-of-hadoop
Worth a read even if, like me, you've never used Hadoop

2. http://engineering.twitter.com/2011/08/storm-is-coming-more-details-and-plans.html
They describe Storm as a "“COMPLEX EVENT PROCESSING” system rather
then a plugin Hadoop replacement, ex: Storm doesn't have a builtin
datastore.

One system programming problem mentioned in the second post, "tracking
and cleaning up sub processes" is a problem of the traditional Unix /
POSIX API.
i.e. you can monitor a direct child process but not "child-of-child processes"
I personal would solve it by using Linux specific APIs like "cgroups":
http://libcg.sourceforge.net/

What's your take?

Tien Ren Chen

unread,
Aug 7, 2011, 1:43:39 AM8/7/11
to it...@googlegroups.com, 沈育德, 王海威, 周凱楓
According to my friend who works with Twitter - they don't even start
to use it internally. don't expect too much...

2011/8/6 Scott Tsai <scot...@gmail.com>:

> --
> You are subscribed to the "itrs" group, see: http://groups.google.com/group/itrs
>

Chia Hao Lo

unread,
Aug 7, 2011, 2:15:31 AM8/7/11
to it...@googlegroups.com, 沈育德, 王海威, 周凱楓
Ummm, yet another good example shows that the internal source is the most valuable XD

chlo

2011/8/7 Tien Ren Chen <trche...@gmail.com>
Reply all
Reply to author
Forward
0 new messages