One of the big things that people overlook is the visibility that
using an mbus provides. Moving pieces to a direct RPC mechanism may
make sense, sans the distributed queuing mechanism, but the loss of
visibility into the system is what one should be most concerned with..
`nats-sub >` is a powerful tool, since NATS treats every messaging
pattern underneath as pub/sub, meaning you can watch the entire system
with the fore-mentioned command.
=derek
On May 1, 5:22 pm, Matt Page <
mp...@rbcon.com> wrote:
> > If it weren't for queue groups, writing an HA version ofNATSserver would be very straight forward.
>
> If you don't care about the round-robin nature of queue groups (only that
> the distribution of messages is fair), a probabilistically correct
> implementation
> doesn't seem too challenging.
>
> > Is there anything in vcap that actually uses queue groups?
>
> Both the HealthManager (when sending requests to CloudControllers) and
> the CloudController (when sending requests to Stagers) use queue groups.
>
> This is something that we'd like to change in the future by eliminating all
> command-and-control traffic that flows overNATS. Ideally, we would only
> usedNATSfor discovery and rely on point-to-point RPC calls for all other
> requests.
>
> Cheers,
> Matt
>
>
>
>
>
>
>
> On Tue, May 1, 2012 at 2:36 PM, Mike Heath <
elc...@gmail.com> wrote:
> > If it weren't for queue groups, writing an HA version ofNATSserver would
> > be very straight forward. Is there anything in vcap that actually uses queue
> > groups?
>
> > -Mike
>
> > On Monday, April 23, 2012 8:21:40 PM UTC-6, yssk22 wrote:
>
> >> Hi
>
> >> I think mbus (NATS) would be a SPoF in the current vcap architecture.
> >> It should have 'scalability' and 'high-availability'. And I know the
> >> original repository ofNATS'sis working on cluster feature branch so
> >> we would get both of them in the future.
>
> >> But at this time, I'd like to discuss/share 'high-availablity'
> >> workaround or know-how with the core developer team.
>
> >> I found the facts:
>
> >> - All of components except for CloudController would exit ifNATS
> >> raise an error (like connection errors).
> >> - If a DEA exits with theNATSerror, applications on the DEA still
> >> work and can respond the requests.
> >> - After a while DEA exits, routers unregister applications on the dead
> >> DEA because heartbeats fail. (*A)
>
> >> I took an workaround scenario forNATSserver failure.
>
> >> 1. ifnatsserver is down, then the IP address ofNATSshould be moved
> >> to another server, whereNATSis restarted.
> >> # I use heartbeat&pacemaker to do this
> >> 2. DEAs can reconnect the newNATSserver because the IP is not