We occasionally encoutered cases when one ubic-watchdog process hanged and all following ubic-watchdog processes hanged too because they all waited on the same lock file.
After some time, such host would run completely out of memory.
Because of this problem, I implemented separate non-blocking lock (see https://github.com/berekuk/ubic/issues#issue/1).
Since then we encountered this problem several times again, so this update includes new precautions against this situation.
1.23 changelog:
* ubic-ping: 404 message content fixed
* ubic-watchdog:
- set check_timeout alarm a bit earlier
- another alarm before compiling services
1.22 changelog (I forgot to announce it last time):
* Ubic::Multiservice: default implementation of 'has_simple_service' method
--
Vyacheslav Matyukhin
http://github.com/berekuk
http://friendfeed.com/mmcleric
http://berekuk.ru