New issue 218 by poberhol...@google.com: errors running ganeti jobs if
/proc/sys/fs/inotify/max_user_watches is too low
http://code.google.com/p/ganeti/issues/detail?id=218
What software version are you running? Please provide the output
of "gnt-cluster --version" and "gnt-cluster version".
Software version: 2.4.5
Internode protocol: 2040000
Configuration format: 2040000
OS api version: 20
Export interface: 0
What distribution are you using?
Debian Squeeze 6.0
What steps will reproduce the problem?
1. lower inotify max_user_watches
(echo 4000 > /proc/sys/fs/inotify/max_user_watches)
2. run gnt-cluster verify
(if no error appears, lower max_user_watches even more)
What is the expected output? What do you see instead?
Ganeti should warn about a too low max_user_watches
error reported instead:
Error checking job status: Job with id 6311 lost
Warning that inotifywatch /tmp reports if the number is too low:
Failed to watch /tmp/; upper limit on inotify watches reached!
Please increase the amount of inotify watches allowed per user via
`/proc/sys/fs/inotify/max_user_watches'.
Please provide any additional information below.
I have Crashplan installed on the same machine as Ganeti. Crashplan uses
inotify as well and already increased the default of 8192 to 16384 in
/etc/sysctl.conf.
Can Crashplan exclude a specific directory? Excluding /var/lib/ganeti/queue
would seem sensible. There's a lot of I/O with atomically replaced files
going on.
Crashplan can exclude that directory, yes. But it doesn't help the fact
that it will crash when max_user_watches is set too low.
Comment #3 on issue 218 by han...@google.com: errors running ganeti jobs if
/proc/sys/fs/inotify/max_user_watches is too low
http://code.google.com/p/ganeti/issues/detail?id=218
Yes, we will look into improving error reporting and handling.