This is common error when all php workers are busy. The most likely scenario
is all your workers are waiting for a database response. You can try out
php-fpm-0.5.9-rc1 with option request_slowlog_timeout=1 to find exact place
in php code where hangs took place. Alternatively, you can use xdebug or any
other profiling tool to find out why your scripts executes so long.
> 4534#0: *2948650 upstream prematurely closed connection while reading
> response header from upstream, client: ...., upstream: "fastcgi://
> 127.0.0.1:9000",
This is failure of php to serve fastcgi request. Please check your php-fpm.log
for possible explanations.
> When I lowered the worker_connections in nginx to 512 I got:
> [alert] 4533#0: *2460021 1024 worker_connections is not enough while
> connecting to upstream, client: .... request: "GET ... HTTP/1.1",
> upstream: "fastcgi://127.0.0.1:9000",
>
>
>
>
>
> Thanks for any kind of input. And if you need more stats or logs just
> ask :)
--
Andrei Nigmatulin
GPG PUB KEY 6449830D
Now I lay me down to sleep(3)
Pray the OS my core to keep
If I die before I wake
Pray the Disk my core to take
This is the most important information in your case: this backtrace in theory
might help you to track down slow sql request, which executes too long and
can cause connection timeout errors.
> Jun 05 00:10:01.129184 pid 18006 (pool default)
> script_filename = /...../index.php
> [0xbfa6ccb0] session_start() /..../index.php:4
>
> Jun 05 00:10:01.022224 [NOTICE] fpm_children_bury(), line 188: child
> 17837 stopped for tracing
> Jun 05 00:10:01.022278 [NOTICE] fpm_php_trace(), line 331: about to
> trace 17837
> Jun 05 00:10:01.022465 [ERROR] fpm_trace_get_strz(), line 225:
> ptrace(PEEKDATA) failed: Input/output error (5)
> Jun 05 00:10:01.022544 [NOTICE] fpm_php_trace(), line 359: finished
> trace of 17837
> Jun 05 00:10:01.022612 [NOTICE] fpm_got_signal(), line 48: received
> SIGCHLD
> Jun 05 00:10:01.022685 [NOTICE] fpm_got_signal(), line 48: received
> SIGCHLD
>
> Hopefully you can help me clear out why they break (or where at
> least), and why the dump fails?
There are several reasons for that, I will investigate what might be in your
case.
>
> fpm conf:
> max_children 300
> request_slowlog_timeout 10s
--
http://wiki.codemongers.com/NginxHttpStubStatusModule