Perry Krug
Solutions Architect
direct: 831-824-4123
email: pe...@couchbase.com
I've seen this happen when nodes are under a heavy load. How is your CPU load on those nodes?--chad
Perry Krug
Solutions Architect
direct: 831-824-4123
email: pe...@couchbase.com
Is this running in Amazon or on your own hardware?Yes, the UI is a valid place for looking at this data, but it's only one perspective. If the nodes are unable to communicate with each other, that doesn't necessarily mean that your clients are also unable to (hence my question about your application). We've seen a few cases where some delay in the cluster management side of things can cause this, but does not impact the actual functioning of the software. While we certainly want to resolve that, the process of diagnosis changes from "why is this server malfunctioning" (which it's not) to "why is the communication being lost between these two".Alk, what info would be useful to look at to see why Erlang might be losing these connections?
fred <lave...@gmail.com> wrote:
Hi Aliaksey,
Is there a way for me to remsh into the membase erlang node? Wanted
to run etop to get more insight. I couldn't find a way.
How should I send you the iptables and ifconfig save output?
Thanks!
On Sep 20, 12:02 pm, Aliaksey Kandratsenka <alkondrate...@gmail.com>
wrote:
> On Tue, Sep 20, 2011 at 9:39 PM, Perry Krug <pe...@couchbase.com> wrote:
> > Is this running in Amazon or on your own hardware?
>
> > Yes, the UI is a valid place for looking at this data, but it's only one
> > perspective. If the nodes are unable to communicate with each other, that
> > doesn't necessarily mean that your clients are also unable to (hence my
> > question about your application). We've seen a few cases where some delay
> > in the cluster management side of things can cause this, but does not impact
> > the actual functioning of the software. While we certainly want to resolve
> > that, the process of diagnosis changes from "why is this server
> > malfunctioning" (which it's not) to "why is the communication being lost
> > between these two".
>
> > *Alk*, what info would be useful to look at to see why Erlang might be
> > losing these connections?
>
> Very weird stuff. Just in case lets grab iptables-save output. And
> information about traffic. Which can be done by sampling ifconfig output.
> I'd like to get info for queue sizes, but not aware of out of the box way.
>
>
>
>
>
>
>
How should I send you the iptables and ifconfig save output?