module issues

9 views
Skip to first unread message

Narayan Desai

unread,
May 31, 2011, 1:01:46 PM5/31/11
to sicorte...@googlegroups.com
Hi folks. We've recently had a power outage (expected, but nonetheless
harmful) which has caused a few of our modules to go out to lunch; the
modules don't power on. What is the best way to debug this sort of
issue? We've got some spares, but they don't seem to be doing any
good. Can midplane slots go bad?
-nld

Aaron Brooks

unread,
May 31, 2011, 1:12:14 PM5/31/11
to sicorte...@googlegroups.com
Narayan,

Are the modules not powering on at all or, do the lights go on but the module is never seen by the scboot script (or SSP via other commands)?

The reason I ask is that there was a yucky race conditions when swapping modules where the MSP (Module Service Processor) DHCP leases for the old modules basically created a lockout condition if the module slots weren't left unoccupied for sufficient lease timeout. This might be the case if the modules light up but never become available to the SSP/scboot.

If the modules do light up, sending me the DHCP leases file might help me diagnose the problem. I think the file is /var/lib/dnsmasq.leases or some thing like that. If you don't find it I can figure out where things actually are.

-Aaron

P.S. Midplane slots did have some cases of connector/pin smashing but they were fairly rare (always be careful to assure proper alignment and careful force when inserting modules). The midplane is passive (with the exception of the global clock module (which affects all modules)) so an electrical event should not harm the midplane itself in any way.


--
You received this message because you are subscribed to the Google Groups "SiCortex Users" group.
To post to this group, send email to sicorte...@googlegroups.com.
To unsubscribe from this group, send email to sicortex-user...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/sicortex-users?hl=en.


Lawrence Stewart

unread,
May 31, 2011, 1:34:20 PM5/31/11
to sicorte...@googlegroups.com, Narayan Desai, Lawrence Stewart

> --
> You received this message because you are subscribed to the Google Groups "SiCortex Users" group.
> To post to this group, send email to sicorte...@googlegroups.com.
> To unsubscribe from this group, send email to sicortex-user...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/sicortex-users?hl=en.
>

What do you mean by don't power on? No lights? MSP works but nodes don't? If the msp comes up,
there's quite a lot you can do.

I have never seen a midplane slot go bad.

-L

Narayan Desai

unread,
May 31, 2011, 2:01:45 PM5/31/11
to sicorte...@googlegroups.com
Hi Aaron.
The lights are off altogether on two modules.
-nld

Narayan Desai

unread,
May 31, 2011, 2:05:52 PM5/31/11
to sicorte...@googlegroups.com, Lawrence Stewart
(since my earlier mail bounced)

I've currently got two modules on our 5832 that have no lights at all.

Prior to this (i've spent the morning fiddling with the machine), I
was seeing PLL errors and module has nodes without power messages.
-nld

Narayan Desai

unread,
May 31, 2011, 3:00:38 PM5/31/11
to sicorte...@googlegroups.com, Lawrence Stewart
And actually, is there any way to check what kind of power you're
getting out of the power subsystem? I'm beginning to wonder if the
system is underpowered. (we also have the old setup for 500 mhz cpus;
we never got upgraded due to the bankruptcy)
-nld

Lawrence Stewart

unread,
May 31, 2011, 3:04:56 PM5/31/11
to Narayan Desai, Lawrence Stewart, sicorte...@googlegroups.com
There's a web or command line interface to the power subsystem that you can get voltages
and currents from. I think it uses bootp, so once it comes up it has an address for all time.

It should be set up for 48-50 volts, and not droop much under load.

Look in the dnsmasq configuration to figure out which one it is, should be on the control net.

I think you can telnet to it, or use a web browser that supports flash or java or something. Why a power supply needs flash/java I never understood.

-L

Reply all
Reply to author
Forward
0 new messages