Hi,
We have a manually configured DAG for a Hazelcast job, with one distributed node like this:
dag.edge(from(vertexA).to(vertexB).distributed()
.partitioned(m -> m.getId()));
dag.edge(from(vertexB).to(vertexC));
We just noticed in production that it took more than 50 seconds in one case between the message got from vertexA to vertexB - even though it didn't have to travel to another node, all happened just in a single node.
The node was under some load at that moment caused by another running job, but I don't see any direct reason why it should be slowing down the first job so much. My question - what can slow down "the journey" of an item via distributed edges? I undertand that e.g. busy cooperative threads - anything else?
Martin