On Wednesday, July 17, 2013 12:00:13 PM UTC-5, yannig rousseau wrote:
Hi,
3 days ago, we had a production issue where the puppetmaster became unreachable for 20 minute.
All of the puppet clients which tried to connect at this time are now failing : "Run of Puppet configuration client already in progress; skipping"
Further investigation show that the puppet agent process is still running on all of this clients 3 days later. I tried to kille the process on a machine, and puppet service came back to normal for this machine.
Is there a way to kill the process on all of the client ?
Puppet does not provide a built-in mechanism for this. There is a variety of tools that could do it, but you would have to had already set them up.
Is there a way to specify a timeout on the process ? This would permit to discard the process without human action
Again, Puppet does not provide this as a built-in feature, but it should be possible to add external instrumentation to make it happen. That would probably be facilitated by launching the agent periodically via a scheduler (such as cron) instead of running it in daemon mode.
John