Correct, I am testing CNS pub sub as I assumed I would get more performance given the supposed transient nature of CNS. This has not been the case, neither has the transiency. I struggle to understand how CNS could be made to work by using CQS anyway, unless CQS has a non-persistent flavor which would bring into question the entire existence of CNS? Anyways. I think I am using cqs endpoints. UseHttp = false; by default and :
My Message size is small, 120 chars or so, with overall bandwidth in and out of my dev box sitting at just under 15Mb/s. The latency stat is the difference between the time serialized on publish and deserialized on subscribe.
I have split up the Queue subscribers how ever I want. 3 queues 1 pub, 1 sub per queue. or 4 queues 2 pub, 2 sub per queue etc... any combination of those make absolutely no difference. Well it does, the higher those numbers the less combined (aggregated over all queues and subs) msgs/s I usually get.
I get the occasional error in CMB's logs, none elsewhere.
Node 10.0.0.150 does seem to have some issues. I cannot for example install the OPS center agent on it. It just won't take. Node 10.0.0.142 displays strange resource usages characteristics on Linux. Lots of "system cpu" time. I will have to investigate that further. OPS center also reports that node 142 is under load sometimes. I set the replication factor on the keyspaces to 3. Combined with cmb.cassandra.readConsistencyLevel=QUORUM and cmb.cassandra.writeConsistencyLevel=QUORUM there might also be my issue. Not entirely sure what those must be.
The 127.0.1.1 node is an interesting case. I have a nginx load balancer on that box (10.0.0.139), which means that the cmb instance's ports were moved to 6060 and 6062. For some reason that jams the IP to 127.0.1.1 and the node generally fails to show up like the rest do. I load balance all cmb instances and all cmb's connections to the 3 cassandra nodes. The only affect of this had was lower message latency.
I have attached some stats. 50/50 width running and not running.
I will do some more digging, there are lots of things I can still try to tweak.
Thanks for the great feedback. You have given me a few ideas to work on.