smile_330803 发表于 2012-06-28 16:24

Solaris RAID1损坏导致无法开机

各位大大,

小弟有一急事相求,我的solaris server上面配置的RAID出问题了,导致无法开机(一直reboot并show下面这些错误信息), 所以想请教一下各位大大,有没有什么办法,比如我想,能不能在安全模式下将Raid删掉/disable,然后重新配置?在不影响原来的数据的前提下?这事很紧急!!!各位大大,如果有会的,一定要不吝赐教啊.. .小弟先谢过了..

================================================================

rebooting...

SPARC Enterprise T3120, No Keyboard
Copyright 2009 Sun Microsystems, Inc.All rights reserved.
OpenBoot 4.30.4_nightly_03.26.2009, 16256 MB memory available, Serial #87305134.
Ethernet address 0:21:28:34:2b:ae, Host ID: 85342bae.

Boot device: /pci@0/pci@0/pci@1/scsi@0/disk@0,0:aFile and args:

SunOS Release 5.10 Version Generic_137137-09 64-bit
Copyright 1983-2008 Sun Microsystems, Inc.All rights reserved.
Use is subject to license terms.
WARNING: Postattach failed for hid2
WARNING: Postattach failed for hid0
WARNING: md d0: open error on (Unavailable)

Cannot open mirrored root device, error 6
Cannot remount root on /pseudo/md@0:0,0,blk fstype ufs

panic/thread=180e000: vfs_mountroot: cannot remount root000000000180b950 genunix:vfs_mountroot+380 (21, 2000, 18d1650, 18ad390, 18a7800, 18a7da0)
%l0-3: 0000000000000001 0000000000008025 0000060021c78080 0000000001273c00
%l4-7: 00000000018a7b58 0000060021c78080 00000000018d1650 0000000000002021
000000000180ba10 genunix:main+a0 (182b548, 180c000, 185b240, 18f6c00, 1831948, 182b400)
%l0-3: 000000000101a800 0000000000000001 0000000070002000 0000000000000000
%l4-7: 0000000001862800 0000000000000000 000000000180c000 0000000000000000

syncing file systems... done
skipping system dump - no dump device configured
rebooting...

dfck001 发表于 2012-06-28 16:30

解决方法 :
1. Booting from CD-RM
ok boot cdrom -s

2.Executing file system check to /(root) partition.
# fsck -y /dev/rdsk/c0t8d0s0

3.Mounting /(root) partition.
# mount /dev/dsk/c0t8d0s0 /mnt

4.Editing /etc/vfstab
# TERM=vt100;export TERM
# vi /mnt/etc/vfstab
(before editing)
--------------------------------------------------------------------------------
#devicedevicemountFS fsck mount mount
#to mount to fsckpointtype pass at boot options
#
fd - /dev/fd fd - no -
/proc - /proc proc - no -
/dev/md/dsk/d10 - - swap - no -
/dev/md/dsk/d100 /dev/md/rdsk/d100 / ufs 1 no -
/dev/md/dsk/d60 /dev/md/rdsk/d60 /export/home ufs 2 yes -
/dev/md/dsk/d50 /dev/md/rdsk/d50 /opt ufs 2 yes -
swap - /tmp tmpfs - yes -
#RMS#/dev/dsk/c4t1d0s6 /dev/rdsk/c4t1d0s6 /opt/goat ufs 2 yes -
---------------------------------------------------------------------------------
(after editing)
--------------------------------------------------------------------------------
#devicedevicemountFS fsck mount mount
#to mount to fsckpointtype pass at boot options
#
fd - /dev/fd fd - no -
/proc - /proc proc - no -
/dev/dsk/c0t0d0s1 - - swap - no -
/dev/dsk/c0t0d0s0 /dev/rdsk/c0t0d0s0 / ufs 1 no -
/dev/dsk/c0t0d0s6 /dev/rdsk/c0t0d0s6 /export/home ufs 2 yes -
/dev/dsk/c0t0d0s5 /dev/rdsk/c0t0d0s5 /opt ufs 2 yes -
swap - /tmp tmpfs - yes -
#RMS#/dev/dsk/c4t1d0s6 /dev/rdsk/c4t1d0s6 /opt/goat ufs 2 yes -
---------------------------------------------------------------------------------
5. Commenting out "rootdev" line of /etc/system
# vi /mnt/etc/system
(before editing)
--------------------------------------
* Begin MDD root info (do not edit)
forceload: misc/md_trans
forceload: misc/md_raid
forceload: misc/md_hotspares
forceload: misc/md_sp
forceload: misc/md_stripe
forceload: misc/md_mirror
forceload: drv/pcipsy
forceload: drv/fjulsa
forceload: drv/sd
rootdev:/pseudo/md@0:0,100,blk
* End MDD root info (do not edit)
--------------------------------------
(after editing)
--------------------------------------
* Begin MDD root info (do not edit)
forceload: misc/md_trans
forceload: misc/md_raid
forceload: misc/md_hotspares
forceload: misc/md_sp
forceload: misc/md_stripe
forceload: misc/md_mirror
forceload: drv/pcipsy
forceload: drv/fjulsa
forceload: drv/sd
* rootdev:/pseudo/md@0:0,100,blk
* End MDD root info (do not edit)
--------------------------------------
照着做把系统从另一块盘启动注意修改一下盘符,系统启动后把镜像重新做一下(如果有硬盘故障记得更换掉)

doging 发表于 2012-06-28 16:35

楼上的方法可行

smile_330803 发表于 2012-06-29 14:00

多谢dfck001一直以来对小弟的淳淳教诲,把/etc/system, /etc/vfstab修改后确实是可以重新进入系统了,但开机会报RAID的错误,看起来metab database已经损坏了,不知有没有办法重新配置或修复。。?

====================================================================================

Boot device: rootdiskFile and args:
SunOS Release 5.10 Version Generic_137137-09 64-bit
Copyright 1983-2008 Sun Microsystems, Inc.All rights reserved.
Use is subject to license terms.
Hostname: mtsd-factory-03
Insufficient metadevice database replicas located.

Use metadb to delete databases which are broken.Ignore any Read-only file system error messages.
Reboot the system when finished to reload the metadevice database.
After reboot, repair any broken database replicas which were deleted.
Jun 29 13:49:49 svc.startd: svc:/system/metainit:default: Method "/lib/svc/method/svc-metainit" failed with exit status 96.
Jun 29 13:49:49 svc.startd: system/metainit:default misconfigured: transitioned to maintenance (see 'svcs -xv' for details)/dev/rdsk/c0t2d0s7 is clean
Reading ZFS config: done.

mtsd-factory-03 console login:

dfck001 发表于 2012-06-29 15:28

不用客气,我也做SUN不久以后相互学习吧
#metadb查看哪块盘的状态数据库坏了
#metadb -d /dev/dsk/cxtxdxsx就可以
看来你第一块盘坏掉了,换掉吧

smile_330803 发表于 2012-07-01 16:30

嗯,这几天我在SUN网站上找这方面的资料看了下,问题已经解决了,就是先不经RAID(透过修改/etc/vfstab & /etc/system)进入系统,然后删除meta database(透过metadb -d cxtxdxsx),然后再重建RAID
页: [1]
查看完整版本: Solaris RAID1损坏导致无法开机