About two weeks ago I installed the current Solaris Os on two v240 to
form a cluster. For the first install I used ZFS, but that was
incompatible with the cluster software. So I reinstalled the OS, chose
"UFS" as the filesystem and mirrored all partitions onto the second
drive in each machine with the Solaris Volume Manager.
Today I wanted to patch these machines and rebooted one of them into
single user mode. Instead of booting correctly, I got this message:
| WARNING: The following files in / differ from the boot archive:
|
| changed /kernel/drv/did.conf
|
| The recommended action is to reboot to the failsafe archive to correct
| the above inconsistency. To accomplish this, on a GRUB-based platform,
| reboot and select the "Solaris failsafe" option from the boot menu.
| On an OBP-based platform, reboot then type "boot -F failsafe". Then
| follow the prompts to update the boot archive. Alternately, to continue
| booting at your own risk, you may clear the service by running:
| "svcadm clear system/boot-archive"
So I booted the machine into failsafe mode and used the instructions
from http://docs.sun.com/app/docs/doc/817-1985/gglbw?a=view to update
the boot-archive. During this procedure I was confused that my first
disk was listed as c0t0d0s0 and not c1t0d0s0. Also the output of
metastat did not work and the system asked me if I wanted to mount a
zpool which it found during startup. But there should not be any more
zpool now, all partitions are UFS.
The reboot did not work with c0t0d0s0 in the vfstab, so I rebooted once
again into failsafe. From this point, the whole system is corrupted. I
am unable to boot anymore. The OpenBoot prompt tells me this:
| Rebooting with command: boot
| Boot device: disk File and args:
| ERROR: Last Trap: Fast Data Access MMU Miss
|
| {1} ok boot
| ERROR: /pci@1c,600000: Last Trap: Fast Data Access MMU Miss
|
| {1} ok
I am unsure how to proceed from this point. The second machine tells me
exactly the same error about "/kernel/drv/did.conf", so I do not think
that in both servers a hard drive failed simultaniously.
Regards
Stephan
The error message you get on attempting to boot strongly suggests a
hardware problem. Call Sun and get it fixed!
I hope you have a backup of your disk from before you started tinkering!
You may need that after you get your MMU problem straightened out.
Hi,
not sure what you have done on those servers exactly - but the error
you got (boot_archive and GRUB) is related to x86 architecture. but
you have got v240 (SPARC).
now, if I were in your shoes I would boot off OS CD or net into single-
user mode and:
* restore bootblk on the internal disks
* mount your root FS from the internal disks and look into the
following:
/etc/system
/etc/vfstab
if you get the same error (MMU Miss) when booting off CD or net - open
the call in Sun support.
Cheers,
Nikolai
ops, sorry - I was wrong - as I see it must be Solaris10 U6 - so, the
warnings to boot_archive can be met on this particular configuration
after patching - due to the boot_archive file is not updated after
patch has been applied.
anyway, before booting off CD or net - try to reset the server
from OBP
setenv auto-boot? false
reset-all
and then
try to boot off the local disk with -F failsafe option (I would try
from both disks), if it doesn't work - boot off cdrom or net (if you
have JS server)
if you still have MMU Miss - log the call to Sun
Cheers,
Nikolai