Hi,
I have experienced a process crash on a node, that caused the entire node to go down
From the log in the console, I have
Port server ns_server on node 'babysitte...@127.0.0.1' exited with status 134. Restarting. Messages: Apache CouchDB 1.2.0a-386be73-git (LogLevel=info) is starting.
Apache CouchDB has started. Time to relax.
working as port
/opt/couchbase/lib/erlang/lib/os_mon-2.2.7/priv/bin/memsup: Erlang has closed.
Erlang has closed
Crash dump was written to: erl_crash.dump.1384711092.3044
eheap_alloc: Cannot allocate 1459620480 bytes of memory (of type "heap"). ns_log000 ns...@production.couchbase.node.8 12:39:37 - Wed Nov 20, 2013
Node 'ns...@production.couchbase.node.8' synchronized otp cookie nhntjjqiovvnjcvf from cluster ns_cookie_manager002 ns...@production.couchbase.node.8 12:39:37 - Wed Nov 20, 2013
After the crash, the node was unresponsive, and I had to kill and restart server on node to make it join the cluster which it did immediately. The reason for the server to become unresponsive was probably that a couchbase process was using all available cpu
I'm running a cluster of 3 nodes on aws on m1.large instances with the following specs
General purpose m1.large 64-bit 2 vcpu 4 ecu 7.5 gigs 2 x 420 gb
I'm running a single replication to another cluster
I have allocated 17.6 gigs in total for the cluster and 10.1 gigs of these are used. All together the machines does have 22.5 gigs of memory.
Does anybody have a suggestion to a possible workaround or should I try to file an issue
I'm currently not allowed to post questions in the community forums for couchbase, I'm just getting a message saying "access denied". Anyone experienced this
Thanks
Niels
--
BinaryConstructors ApS
Vestergade 10a, 4th
1456 Kbh K
Denmark
phone:
+4529722259web:
http://www.binaryconstructors.dkmail:
n...@binaryconstructors.dk
skype: nielsboldt