Il 16/12/2009 14.57, Lewis G Rosenthal ha scritto:
> On 12/16/09 05:27 am, Massimo thus wrote :
>> another crash
>>
>> LIBC PANIC!!
>> fmutex deadlock: Owner died!
>> 0x0027013c: Owner=0x99850006 Self=0x99850001 fs=0x3 flags=0x0 hev=0x00010012
>> Desc="LIBC Heap"
>> pid=0x9985 ppid=0x98f5 tid=0x0001 slot=0x00ee pri=0x0200 mc=0x0000
>> D:\APACHE\BIN\HTTPD.EXE
>> Process dumping was disabled, use DUMPPROC / PROCDUMP to enable it.
>>
>>
> <snip>
>
> Assuming there is no data corruption or other issue reading data from
> the volume, *something* must have changed. If you can't figure out what,
> then restore from a backup.
nothing changed, just two virtual hosts addd
no HD crc or problems
since it's a logical unit RAID1 mirroring on a LSI Logic 150-6 sata raid
i activated debug-log and i've seen some erros on reverse-proxy feature
since, lucklily, at the moment i don't need it
i've disabled reverse-proxy feature
now it works better, but see just a little bit better
since after server reboot it's allways quite difficult to get it running again
could the debug-log be of any interest or helpful?
thanks really a lot
massimo
Hi Massimo,
Just a few comments that might help you track this down.
>>> another crash
Not really. The LIBC PANIC messages are a side effect of something else
failing.
>>> LIBC PANIC!!
>>> fmutex deadlock: Owner died!
>>> 0x0027013c: Owner=0x99850006 Self=0x99850001 fs=0x3 flags=0x0 hev=0x00010012
>>> Desc="LIBC Heap"
>>> pid=0x9985 ppid=0x98f5 tid=0x0001 slot=0x00ee pri=0x0200 mc=0x0000
What this says is that a thread (TID 1) wanted access to the semaphore
that controls one of the heaps and could not get access because the
semaphore owner (TID 6) died while it owned the semaphore.
I suspect there is a message in the logs indicating why TID 6 died. Can
you find it?
>since after server reboot it's allways quite difficult to get it running
>again
What do you mean by this?
>could the debug-log be of any interest or helpful?
What would be interesting is to see the "missing" mesasge about what
happened to TID 6. While it's possible, it is unusual for code in LIBC to
die silently.
Since the error is in LIBC, perhaps you can convince Paul to do a build of
LIBC603 with logging enabled in the heap management code.
Steven
--
----------------------------------------------------------------------
"Steven Levine" <ste...@earthlink.net> eCS/Warp/DIY etc.
www.scoug.com www.ecomstation.com
----------------------------------------------------------------------
Il 24/12/2009 21.14, Steven Levine ha scritto:
> In <4B3368F...@ecomstation.it>, on 12/24/09
> at 02:13 PM, Massimo <m...@ecomstation.it> said:
>
> Hi Massimo,
>
> Just a few comments that might help you track this down.
>
>>>> another crash
>
> Not really. The LIBC PANIC messages are a side effect of something else
> failing.
>
>>>> LIBC PANIC!!
>>>> fmutex deadlock: Owner died!
>>>> 0x0027013c: Owner=0x99850006 Self=0x99850001 fs=0x3 flags=0x0 hev=0x00010012
>>>> Desc="LIBC Heap"
>>>> pid=0x9985 ppid=0x98f5 tid=0x0001 slot=0x00ee pri=0x0200 mc=0x0000
>
> What this says is that a thread (TID 1) wanted access to the semaphore
> that controls one of the heaps and could not get access because the
> semaphore owner (TID 6) died while it owned the semaphore.
>
> I suspect there is a message in the logs indicating why TID 6 died. Can
> you find it?
nothing about tid 6 in the logs :(
>> since after server reboot it's allways quite difficult to get it running
>> again
>
> What do you mean by this?
when apache "die" in this way i find easy to reboot the server
after the reboot apache "die" again in the same way
a rexx fault-daemon detect that the process HTTPD isn't running
so it try to restart it automatically (it check every 5 minutes)
after a minimum of 3 o 4 restarts apache start to "live" again
don't ask me why this happens
it's a total mistery for me
>> could the debug-log be of any interest or helpful?
>
> What would be interesting is to see the "missing" mesasge about what
> happened to TID 6. While it's possible, it is unusual for code in LIBC to
> die silently.
>
> Since the error is in LIBC, perhaps you can convince Paul to do a build of
> LIBC603 with logging enabled in the heap management code.
>
> Steven
i switched down the reverse provy feature since i don't need it anymore
apache seems to work better, but it was just a "feeling"
the problems are still the same than before...
thanks
bye
massimo s.