I also tried to boot using the DVD ISO but the node goes back to the
GRUB> prompt. Thanks
-M
-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://lists.sdsc.edu/pipermail/npaci-rocks-discussion/attachments/20090128/d36551a0/attachment.html
in the past, we've seen this behavior on supermicro motherboards. if
you have one, then boot the node into BIOS and disable the floppy.
then, reinstall the node.
- gb
PXE.
- gb
I tried pxe and using the dvd iso.
-M
-----Original Message-----
From: npaci-rocks-dis...@sdsc.edu
[mailto:npaci-rocks-dis...@sdsc.edu] On Behalf Of Greg Bruno
Sent: Wednesday, January 28, 2009 3:49 PM
To: Discussion of Rocks Clusters
Subject: Re: [Rocks-Discuss] Compute node stuck at GRUB>
hard to say what is going on.
it only happens on a couple nodes, right?
if so, then you may want to try updating the BIOS on those nodes.
- gb
BTW,
I have an old frontend running 4.1. Could I use this old frontend and
just update it rather than installing a new frontend and run into these
headaches?
-----Original Message-----
From: npaci-rocks-dis...@sdsc.edu
[mailto:npaci-rocks-dis...@sdsc.edu] On Behalf Of Greg Bruno
Sent: Wednesday, January 28, 2009 4:07 PM
To: Discussion of Rocks Clusters
Subject: Re: [Rocks-Discuss] Compute node stuck at GRUB>
find /boot/grub/stage1
find /boot/grub/stage2
For example:
grub> find /boot/grub/stage1
(hd0,0)
(hd1,0)
If both of the find commands succeed, then it looks like grub was not
installed correctly. You can try the following in an attempt to
re-install grub on the MBR (make sure that the disk device matches your
setup -- the following example assumes a SATA drive on /dev/sda):
grub> device (hd0) /dev/sda
grub> root (hd0,0)
grub> setup (hd0)
Example output:
grub> device (hd0) /dev/sda
device (hd0) /dev/sda
grub> root (hd0,0)
root (hd0,0)
Filesystem type is ext2fs, partition type 0xfd
grub> setup (hd0)
setup (hd0)
Checking if "/boot/grub/stage1" exists... yes
Checking if "/boot/grub/stage2" exists... yes
Checking if "/boot/grub/e2fs_stage1_5" exists... yes
Running "embed /boot/grub/e2fs_stage1_5 (hd0)"... 16 sectors are
embedded. succeeded
Running "install /boot/grub/stage1 (hd0) (hd0)1+16 p
(hd0,0)/boot/grub/stage2 /boot/grub/grub.conf"... succeeded
Done.
If, however, the find commands fail, then either the grub software has
not been installed onto the HDD at all (unlikely), or grub is unable to
locate the /boot partition on the host. This might be because the BIOS
is not "presenting" the HDDs to grub or because the OS has been
installed on a drive that is not bootable -- that is, not visible to the
BIOS during POST. I've seen issues like this on systems with many HDDs
installed.
It's not much to go on, but it may point you in the right direction. You
definitely need to find a way to boot from a CD/DVD if you can, though.
That way you can boot a rescue environment and start poking around.
Regards,
Malcolm.
-------------- next part --------------
An HTML attachment was scrubbed...
Either do
$ rock set host pxeboot pxeboot="install" hostname
Or remove the host and reinsert it using
$ rocks remove host hostname
$ insert-ethers --update
$ insert-ethers --rack=racknum --rank=ranknum
Daniel.
* Hamilton, Scott L. <hamil...@mst.edu> [01/29/2009 09:57]:
Malcom,
I typed "find /boot/grub/stage1" and I got a "file not found". I tried to install grub by following your recommendation:
device (hd0) /dev/sda
root (hd0,0)
setup (hd0)
and the command "device" does not exist at the grub prompt.
any other ideas?
thanks
-M
find /boot/grub/stage1
find /boot/grub/stage2
Example output:
Regards,
Malcolm.
http://www.gnu.org/software/grub/manual/
http://www.gnu.org/software/grub/manual/html_node/Installing-GRUB-natively.html#Installing-GRUB-natively
You may have to create a custom device map or there may be another
problem. For example, the section about the "install" command highlights
some known BIOS problems:
http://www.gnu.org/software/grub/manual/html_node/install.html#install
The manual is full of references to BIOS setup, the ones I mention are
simply examples of issues that the GRUB developers know about. It may or
may not be related to the problems you are experiencing. Without booting
the affected system into a recovery environment and having a look around
(e.g. using fdisk, /proc file system, etc.) it is hard to know for sure
what's gone wrong. One might also simply try to install the OS from a
DVD and see what happens.
Regards,
Malcolm.
You need to make sure that these node are in fact PXE booting. My guess
is that they are trying to but off of the hard-drive and there is a
corrupt installation on them. If your head node has no knowledge of the
nodes, then you need to rerun insert-ethers and pxe boot the nodes.
Scott