Hello,
our setup consists of 10 worker with 15 agents each. We run ansible + mitogen on the agents. Currently, we have a problem with the go-agent + mitogen.
Mitogen itself is a tool to speedup ansible runs by "tunneling" multiple tasks over one ssh connection.
If we use i on the worker without the agent directly on the cli it runs very well
Basic Ansible: ~ 5min
Ansible + Mitogen: ~ 1.5 min
Ansible + Mitogen + Go-agent (expected): ~2 min
Ansible + Mitogen + Go-agent (currently): ~ 10 - 30 min
Now, if we start ansible with mitogen enabled IN the go-agent, the runtime is significant longer than the basic run.
Some runs can slow down to 10 - 30 min is highly unusual since it should only take 2 - 5 min. Run directly on the cli it's fast as expected.
Strangely, this was not from the beginning. This is only after because of an incident we had to stress all 150 agents at once.
We already reinstalled ansible, mitogen and the go-agent itself, but the degraded performance persists.
I hope somebody can help in how further debug this, since the last resort would be to complete reinstall the whole workernodes.
Regards