I'm trying to get a relatively simple setup going on 2 physical hosts:
baklava: node + master
ramen: node
Both machines running Ubuntu 10.04 (lucid) & CDH3.
Logical node config:
agent1 : text("/home/yun/flume-src") | autoE2EChain;
collector1 : collectorSource | text("/tmp/flume-test-output");
Master status:
http://www.mooh.org/public/flume/20100701-flumemaster.html
Why does the autoE2EChain fail with a "no collectors" message?
If I use an agentSink("baklava.mooh.org") with agent1 then I'm able to
transport events. However if I stop & restart the node process on
either machine it seems all previous events are replayed, even those
already sent previously.
Could someone explain the ack behaviour?
Also, in my initial goofing around with a console source where I typed a
message every few minutes, I notice that in the replay behaviour above
the events arrive in groups, and out of order. Is this intentional? I
would've expected that events from a single source should arrive in
source submission order.
Lastly... how do I get rid of the extraneous entries for "collector",
"baklava.mooh.org" and "ramen.mooh.org" in the Node status?
Thanks in advance :)
yun
hi all,
I'm trying to get a relatively simple setup going on 2 physical hosts:
baklava: node + master
ramen: node
Both machines running Ubuntu 10.04 (lucid) & CDH3.
Logical node config:
agent1 : text("/home/yun/flume-src") | autoE2EChain;
collector1 : collectorSource | text("/tmp/flume-test-output");
Master status:
http://www.mooh.org/public/flume/20100701-flumemaster.html
Why does the autoE2EChain fail with a "no collectors" message?
If I use an agentSink("baklava.mooh.org") with agent1 then I'm able to
transport events. However if I stop & restart the node process on
either machine it seems all previous events are replayed, even those
already sent previously.
Could someone explain the ack behaviour?
Also, in my initial goofing around with a console source where I typed a
message every few minutes, I notice that in the replay behaviour above
the events arrive in groups, and out of order. Is this intentional? I
would've expected that events from a single source should arrive in
source submission order.
Lastly... how do I get rid of the extraneous entries for "collector",
"baklava.mooh.org" and "ramen.mooh.org" in the Node status?
Thanks in advance :)
yun
Thanks for the explanations, everything now works as expected.
On 1/07/2010 11:41 PM, Jonathan Hsieh wrote:
> Right now the extraneous node statuses will live on in DECOMMISSIONED
> state. We want to make sure you know what has happened to these nodes
> and my first thought is that we don't want them to automatically
> disappear. Maybe we could give you the ability make them "go away"
> (which would also let the system know that you have seen the effect).
> Does that seem like a reasonable idea? (if so, go to
> http://issues.cloudera.com and add a file a bug/feature request!)
Ah ok, that makes sense, will file a feature request.
Is this also the source of these messages:
2010-07-02 00:13:22,187 INFO com.cloudera.flume.agent.LivenessManager:
Logical Node 'ramen.mooh.org' not configured on master
They're appearing every 5 seconds which is cluttering the log somewhat. :)
Thanks!
yun