Thanks for the help, unfortunately I still haven't gotten these nodes booting. Only in the office once in awhile and some of this debugging requires direct eyes/hands-on work.... ugh.
We've tried various serial boards but no luck on those (have "official" dell serial boards on order from china someplace, few weeks out)...
I did get the vga consoles booting and it goes through PXE, and then hangs at: "Emulab looking for control net among: ..."
here are the various log entries from the boot (tries to frisbee, but never loads MFS to frisbee...):
power.log:
Apr 22 17:39:18 boss power[35814]: [root] on: pc13
dhcpd.log:
Apr 22 17:39:37 boss dhcpd: DHCPDISCOVER from f8:bc:12:85:e8:95 via em0
Apr 22 17:39:37 boss dhcpd: DHCPOFFER on 192.168.10.22 to f8:bc:12:85:e8:95 via em0
Apr 22 17:39:37 boss dhcpd: DHCPREQUEST for 192.168.10.22 (192.168.10.3) from f8:bc:12:85:e8:95 via em0
Apr 22 17:39:37 boss dhcpd: DHCPACK on 192.168.10.22 to f8:bc:12:85:e8:95 via em0
stated.log:
Apr 22 17:39:40 [1515]: OBJTYPE='TBNODESTATE', OBJNAME='pc13', EVENTTYPE='PXEBOOTING'
Apr 22 17:39:40 [1513]: pc13: RELOAD/SHUTDOWN => RELOAD/PXEBOOTING
Apr 22 17:39:40 [1513]: pc13: Mode change RELOAD => PXEKERNEL forced
Apr 22 17:39:40 [1513]: pc13: RELOAD/PXEBOOTING => PXEKERNEL/PXEBOOTING
Apr 22 17:39:40 [1515]: OBJTYPE='TBNODESTATE', OBJNAME='pc13', EVENTTYPE='BOOTING'
Apr 22 17:39:40 [1513]: pc13: PXEKERNEL/PXEBOOTING => PXEKERNEL/BOOTING
Apr 22 17:39:40 [1513]: pc13: BootWhat says 10007 (mode RELOAD).
Apr 22 17:39:40 [1513]: pc13: Mode change PXEKERNEL => RELOAD forced
Apr 22 17:39:40 [1513]: pc13: PXEKERNEL/BOOTING => RELOAD/BOOTING
Apr 22 17:39:40 [1513]: HEAD: pc13 in 180, queue=1
bootinfo.log:
Apr 22 17:39:40 boss bootinfo[1486]: tbdb: query failed: MySQL server has gone away
Apr 22 17:39:40 boss bootinfo[1486]: Lost connection to DB; Attempting to reconnect ...
Apr 22 17:39:40 boss bootinfo[1486]:
192.168.10.22: REQUEST (vers 1)
Apr 22 17:39:40 boss bootinfo[1486]:
192.168.10.22: REPLY(1): boot from mfs /tftpboot/frisbee
tftpd.log:
Apr 22 17:39:40 boss tftpd[35820]:
192.168.10.22/2748: RRQ for /tftpboot/frisbee/boot/loader.rc.gz (remapped to /fri sbee/boot/loader.rc.gz)
Apr 22 17:39:40 boss tftpd[1889]: pid 35820 exits, numchildren=0
Apr 22 17:39:40 boss tftpd[35821]:
192.168.10.22/2749: RRQ for /tftpboot/frisbee/boot/loader.conf.gz (remapped to /f risbee/boot/loader.conf.gz)
Apr 22 17:39:40 boss tftpd[1889]: pid 35821 exits, numchildren=0
Apr 22 17:39:40 boss tftpd[35822]:
192.168.10.22/2750: RRQ for /tftpboot/frisbee/boot/kernel.gz (remapped to /frisbe e/boot/kernel.gz)
Apr 22 17:39:41 boss tftpd[1889]: pid 35822 exits, numchildren=0
Apr 22 17:39:41 boss tftpd[35823]:
192.168.10.22/2751: RRQ for /tftpboot/frisbee/boot/mfsroot.gz (remapped to /frisb ee/boot/mfsroot.gz)
Apr 22 17:39:42 boss tftpd[1889]: pid 35823 exits, numchildren=0
Thought maybe it was something in how I added the nodes and the database got messed up or ports not assigned correct and switchmac was messed up so I deleted a node and have it waiting in newnode, but switchmac fails, because switchmac is getting leakage from someplace and 2 other switches are reporting the macs from the new node (as well as listing 2 other mac addrs, one is ops on vlan 2, and one is a monitor pc we have on vlan 2, is the a way to add mac addrs in an ignore list? still haven't figured out the bleedover issue, rechecked all switch configs and found some inconsistencies, but nothing fixed switchmac yet.....)
Thoughts? ideas? - how can we get these to boot and play nice without serial consoles?