queue_top.pl is the old one. Use gearman_top instead. Do you see any worker connected to your gearmand?
Are the queues filling up?
2 for check_results is 1 too much. Nagios usually only starts a single worker for the results queue.
Seems like there is a results worker which takes away all the results, which then leads to the
"results never came back" never came back messages.
Is there an orphaned nagios process running or something like that?
Could you try to stop Nagios and move the retention.dat away so you have a clean start.
check_gearman is able to monitor the check counter for the worker. Another idea is to
increase the loglevel from the worker. You should then see the jobs coming in.
Regarding the documentation: Help is always welcome :-)
Its not easy for me to guess what a new user expects or what pitfalls they can avoid.
Sven
This usually means a problem with encryption. There should be messages
like "discarding invalid job" or something like that. Make sure your encryption
settings from the worker match the ones from the server. Password, Encryption enabled etc...
Have you set the logfile option?
Ah i mixed that up. mod_gearman logs to the nagios.log. But don't do that (at least not in production), because Nagios is not thread safe and will segfault with a version <= 3.2.3. Hopefully the next nagios is better. They are working on that problem. The logfile option is only for the worker.
How do you start your worker? Its in the same config file as the encryption settings for your worker.
The gearman client provided by the gearman package itself? Thats probably just a sample client. You have to start the mod_gearman_worker provided by mod_gearman. This would explain everything...
I think a small step by step instruction would be really helpful sometimes...