Testing HA for Apex operators

1 view
Skip to first unread message

Pavan Kulkarni (pavkulk2)

unread,
Apr 7, 2017, 1:02:31 PM4/7/17
to dt-u...@googlegroups.com, Devavrath Subramanyam (desubram)

Hello all,

 

I was trying to test the HA for my cluster by bringing a node in my cluster down, to see how Apex behaves, I see it takes a lot of time for the host to go down, I have my dfs heartbeat property set to 3 seconds

 

Can anyone please let me know, what should be done so that I see all my apex running containers shift to a active node

 

Thanks

Pavan Kulkarni

-Software engineer

Cisco

Sanjay Pujare

unread,
Apr 7, 2017, 1:13:49 PM4/7/17
to Pavan Kulkarni (pavkulk2), dt-u...@googlegroups.com, Devavrath Subramanyam (desubram)
When you say "...it takes a lot of time for the host to go down..." how much time is that? Also I suppose you mean it takes a long time for the detection.

If you are using the DT Console do you see the state of the nodes and containers in the UI showing "failed" soon after the host has gone down? How long does that take?

--
You received this message because you are subscribed to the Google Groups "DataTorrent Users Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dt-users+unsubscribe@googlegroups.com.
To post to this group, send email to dt-u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dt-users/B8A6D295-1A81-4BFB-B187-7A00EF438AC8%40cisco.com.
For more options, visit https://groups.google.com/d/optout.

Reply all
Reply to author
Forward
0 new messages