Hi All.
I have a cluster with 4 nodes RabbitMQ 3.5.6. Cluster uptime 6d. Also I have zabbix for monitoring all nodes. Yesterday zabbix has start to alerting in blink mode that one node is dead, but this node has the same uptime that each other and works well. Today I've try to resolve this issue and found that there are a lot of CLOSE_WAIT state connections from local zabbix agent to local rabbitmq (124) and rabbit's memory other_proc too high - 4,7G (on other nodes 200M).
Of course, I can "solve" this problem with node reboot, but I want to understand what had happened.
...
# service rabbitmq-server status
Status of node rabbit@rabbitmq4 ...
[{pid,8797},
{running_applications,
[{rabbitmq_auth_backend_ldap,"RabbitMQ LDAP Authentication Backend",
"3.5.6"},
{eldap,"Ldap api","1.2"},
{lager,"Erlang logging framework","2.0.0-rmq3.5.x-git9719370"},
{rabbitmq_management,"RabbitMQ Management Console","3.5.6"},
{rabbitmq_management_agent,"RabbitMQ Management Agent","3.5.6"},
{rabbit,"RabbitMQ","3.5.6"},
{mnesia,"MNESIA CXC 138 12","4.13.1"},
{amqp_client,"RabbitMQ AMQP Client","3.5.6"},
{rabbitmq_web_dispatch,"RabbitMQ Web Dispatcher","3.5.6"},
{webmachine,"webmachine","1.10.3-rmq3.5.6-gite9359c7"},
{mochiweb,"MochiMedia Web Server","2.7.0-rmq3.5.6-git680dba8"},
{xmerl,"XML parser","1.3.8"},
{inets,"INETS CXC 138 49","6.0.1"},
{os_mon,"CPO CXC 138 46","2.4"},
{sasl,"SASL CXC 138 11","2.6"},
{stdlib,"ERTS CXC 138 10","2.6"},
{kernel,"ERTS CXC 138 10","4.1"}]},
{os,{unix,linux}},
{erlang_version,
"Erlang/OTP 18 [erts-7.1] [source] [64-bit] [smp:4:4] [async-threads:128] [kernel-poll:true]\n"},
{memory,
[{total,5080348064},
{connection_readers,418232},
{connection_writers,589992},
{connection_channels,1991272},
{connection_other,825408},
{queue_procs,2257080},
{queue_slave_procs,11272448},
{plugins,339576},
{other_proc,4867926208},
{mnesia,410944},
{mgmt_db,12216},
{msg_index,813200},
{other_ets,1306960},
{binary,23844248},
{code,20755419},
{atom,711569},
{other_system,146873292}]},
{alarms,[]},
{listeners,[{clustering,25672,"::"},{amqp,5672,"::"}]},
{vm_memory_high_watermark,0.85},
{disk_free_limit,1682818253},
{file_descriptors,
[{total_limit,10140},
{total_used,41},
{sockets_limit,9124},
{sockets_used,11}]},
{processes,[{limit,1048576},{used,631}]},
{run_queue,0},
{uptime,521353}]