Your profiling results suggests that most of the threads are in
ethr_event_wait. With that many cores and higher inter-node traffic,
you can try configuring the VM to use more I/O threads (`rabbitmqctl status`
suggests you use the default 30), probably as high as 60-90.
Have you noticed significantly higher than before I/O or network activity?
Do you use mirroring? 3.5.x introduces flow control between nodes. There
will be a way to disable it in 3.5.2 but this only really matters
if mirroring is in place. It reduces throughput somewhat, and increases
inter-node traffic, but makes sure that mirrors don't fall behind master
and begin consuming a lot of RAM.
--
MK
Staff Software Engineer, Pivotal/RabbitMQ