I have an SiCortex 1458. We moved to a new facility which meant
changing IPs from 172.16.x.x to 10.10.x.x. I changed the IP on the
head node, and the external compute nodes used DHCP. I ran sinfo:
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
sci up infinite 243 down* sci-m0n[0-26],sci-
m1n[0-26],sci-m2n[0-26],sci-m3n[0-26],sci-m4n[0-26],sci-m5n[0-26],sci-
m6n[0-26],sci-m7n[0-26],sci-m8n[0-26]
sci-comp up infinite 239 down* sci-m0n[0,2-5,7-26],sci-
m1n[0,2-26],sci-m2n[0-26],sci-m3n[0-26],sci-m4n[0-26],sci-
m5n[0-26],sci-m6n[0-26],sci-m7n[0-26],sci-m8n[0-5,7-26]
sci-ok up infinite 186 down* sci-m0n[0,2-5,7-26],sci-
m2n[0-26],sci-m3n[0-26],sci-m4n[0-26],sci-m6n[0-26],sci-m7n[0-26],sci-
m8n[0-5,7-26]
When I try to run scboot, I get a lot of errors about what seems to be
blade 8. Any thoughts? I looked through some SiCortex documentation
I found on a DVD image I pulled from
http://mirror.anl.gov/pub/sicortex/isos/V3.1/.
sicortex-ssp ~ # scboot
/var/state/route_info.sci checks out OK!
Booting partition: sci
Checking Module Service Processors
unrecognized num Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect()
poll failed (msp0 (sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))'], rev Diagcomm failure: ['MSP', 0, -1,
'diagcomm_connect() poll failed (msp0 (sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Warning: inconsistent board speeds!
sci-msp0: 2101-03, rev 06: B1-633MHz-capable (2)
sci-msp1: 2101-03, rev 06: B1-633MHz-capable (2)
sci-msp2: 2101-03, rev 06: B1-633MHz-capable (2)
sci-msp3: 2101-03, rev 06: B1-633MHz-capable (2)
sci-msp4: 2101-03, rev 06: B1-633MHz-capable (2)
sci-msp5: 2101-03, rev 06: B1-633MHz-capable (2)
sci-msp6: 2101-03, rev 06: B1-633MHz-capable (2)
sci-msp7: 2101-03, rev 06: B1-633MHz-capable (2)
sci-msp8: Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll
failed (msp0 (sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))'], rev Diagcomm failure: ['MSP', 0, -1,
'diagcomm_connect() poll failed (msp0 (sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']: unrecognized num Diagcomm failure: ['MSP', 0, -1,
'diagcomm_connect() poll failed (msp0 (sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))'], rev Diagcomm failure: ['MSP', 0, -1,
'diagcomm_connect() poll failed (msp0 (sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))']
Diagcomm failure: ['MSP', 0, -1, 'diagcomm_connect() poll failed (msp0
(sci-msp8:1235))'] (1)
Reverting to lowest common denominator (1)
Creating boot configuration
Halting all nodes
scand unresponsive, try 0/5
scand unresponsive, try 1/5
scand unresponsive, try 2/5
scand unresponsive, try 3/5
scand unresponsive, try 4/5
scand connection failed
Halt of nodes on sci-msp8 failed
Caught signal, cleaning up.
sicortex-ssp ~ #