| We have a fairly large farm of some 100 machines and some 10Pb of data. Maybe 80 of them are Sunfire x4200s and the rest are Dells. When doing a puppet apply on the Suns, there's a 50% chance of a kernel crash. The servers are running Debian 8 with a 3.16.0-4-amd64 kernel. I've attached the dump. It seems to happen in get_empty_filp() in a do_sys_open() call. Google returns no hints except a possible security hole in that kernel. Don't really want to upgrade some of the boxes as there's C code compiled against the kernel. We can do it if we have to. |