- 论坛徽章:
- 0
|
昨晚测试程序搞到12点多,本来就很窝火了,在正认为可以完成任务回去睡觉时又出了岔子。在起was集群时,mc中的一台小机突然挂了,这下郁闷了,机房钥匙还没有。不过还好,等了十多分钟机器重新起来了。mc中的包启动、运行没问题。但是一台机器的mc外的单独vg激活后,文件系统mount失败,报错:vxfs mount: /dev/v3data/v3hddata is corrupted. needs checking
估计是异常重起,破坏了文件系统。于是决定用fsck检查文件系统完整性。
下面是处理的过程:
mgirl#vgchange -a y /dev/v3data (激活卷组v3data)
HYC1#vgdisplay (看到v3data已激活)
--- Volume groups ---
VG Name /dev/vg00
VG Write Access read/write
VG Status available
Max LV 255
Cur LV 9
Open LV 9
Max PV 16
Cur PV 1
Act PV 1
Max PE per PV 4328
VGDA 2
PE Size (Mbytes) 16
Total PE 4318
Alloc PE 4317
Free PE 1
Total PVG 0
Total Spare PVs 0
Total Spare PVs in use 0
VG Name /dev/vglock
VG Write Access read/write
VG Status available, exclusive
Max LV 255
Cur LV 0
Open LV 0
Max PV 16
Cur PV 1
Act PV 1
Max PE per PV 1016
VGDA 2
PE Size (Mbytes) 4
Total PE 24
Alloc PE 0
Free PE 24
Total PVG 0
Total Spare PVs 0
Total Spare PVs in use 0
VG Name /dev/v3data
VG Write Access read/write
VG Status available
Max LV 255
Cur LV 2
Open LV 2
Max PV 16
Cur PV 3
Act PV 3
Max PE per PV 30720
VGDA 6
PE Size (Mbytes) 4
Total PE 92145
Alloc PE 85000
Free PE 7145
Total PVG 0
Total Spare PVs 0
Total Spare PVs in use 0
VG Name /dev/qhv3hd
VG Write Access read/write
VG Status available, exclusive
Max LV 255
Cur LV 30
Open LV 30
Max PV 16
Cur PV 3
Act PV 3
Max PE per PV 30720
VGDA 6
PE Size (Mbytes) 4
Total PE 92145
Alloc PE 64000
Free PE 28145
Total PVG 0
Total Spare PVs 0
Total Spare PVs in use 0
mgirl#bdf (文件系统/v3hddata和/v3tddata没有挂载)
Filesystem kbytes used avail %used Mounted on
/dev/vg00/lvol3 278528 205568 72448 74% /
/dev/vg00/lvol1 311296 108408 201352 35% /stand
/dev/vg00/lvol8 4718592 3483912 1228024 74% /var
/dev/vg00/lvol7 3506176 2656600 842992 76% /usr
/dev/vg00/lvol4 1540096 88360 1440832 6% /tmp
/dev/vg00/lvol6 36864000 25044568 11729984 68% /opt
/dev/vg00/lvol5 15728640 4068720 11586168 26% /home
QHYC2:/home/share 41943040 6645800 35033344 16% /home/share
/dev/qhv3hd/hdhome 10240000 5885005 4082886 59% /hdhome
/dev/qhv3hd/hdback 102400000 44210233 54552933 45% /hdback
mgirl#mount /dev/v3data/v3hddata /v3hddata (挂载失败,需要检查文件系统正确和有效性fsck)
vxfs mount: /dev/v3data/v3hddata is corrupted. needs checking
mgirl#fsck -F vxfs /dev/v3data/v3tddata (检查文件系统)
log replay in progress
replay complete - marking super-block as CLEAN
mgirl#mount /dev/v3data/v3tddata /v3tddata (挂载成功)
mgirl#bdf (可看出挂载成功)
Filesystem kbytes used avail %used Mounted on
/dev/vg00/lvol3 278528 205568 72448 74% /
/dev/vg00/lvol1 311296 108408 201352 35% /stand
/dev/vg00/lvol8 4718592 3483920 1228000 74% /var
/dev/vg00/lvol7 3506176 2656600 842992 76% /usr
/dev/vg00/lvol4 1540096 88360 1440832 6% /tmp
/dev/vg00/lvol6 36864000 25044632 11729920 68% /opt
/dev/vg00/lvol5 15728640 4059544 11595272 26% /home
QHYC2:/home/share 41943040 6665424 35013872 16% /home/share
/dev/qhv3hd/hdhome 10240000 5885005 4082886 59% /hdhome
/dev/qhv3hd/hdback 102400000 44210233 54552933 45% /hdback
/dev/v3data/v3tddata
174080000 8622223 155128431 5% /v3tddata
/dev/v3data/v3hddata
174080000 342093 162880559 0% /v3hddata
今天来,察看dmesg里有不少错误信息,具体信息如下:
MCA[0]:REVISION:0002
MCA[0]:SEVERITY:0
MCA[0]:Processor Error Device Info decode begins.
MCA[0]:VALIDATION_BITS = 0x000000000100101f
MCA[0]:PSP = 0x28000000fff21130
MCA/CMC[0]: cache check. info:0x008000df00240511
MCA/CMC[0]: req:N/A res:N/A
MCA/CMC[0]: tgt:N/A ip:N/A
MCA/CMC[0]: bus check. info:0x0080000001000040
MCA/CMC[0]: req:N/A res:N/A
MCA/CMC[0]: tgt:N/A ip:N/A
MCA[0]:PSI_STATIC_STRUCT.VALID_FIELD_BITS=0x000000000000003f
MCA[0]:Processor Error Device Info decode ends.
Logical volume 64, 0x3 configured as ROOT
Logical volume 64, 0x2 configured as SWAP
Logical volume 64, 0x2 configured as DUMP
Swap device table: (start & size given in 512-byte blocks)
entry 0 - major is 64, minor is 0x2; start = 0, size = 8388608
Dump device table: (start & size given in 1-Kbyte blocks)
entry 0000000000000000 - major is 31, minor is 0x2000; start = 826228, size = 4194300
Starting the STREAMS daemons-phase 1
Create STCP device files
MCA[3]:REVISION:0002
MCA[3]:SEVERITY:0
MCA[3]:Processor Error Device Info decode begins.
MCA[3]:VALIDATION_BITS = 0x000000000100100f
MCA[3]:PSP = 0x20000000fff211a0
MCA/CMC[3]: bus check. info:0x0080000001000040
MCA/CMC[3]: req:N/A res:N/A
MCA/CMC[3]: tgt:N/A ip:N/A
MCA[3]:PSI_STATIC_STRUCT.VALID_FIELD_BITS=0x000000000000003f
MCA[3]:Processor Error Device Info decode ends.
MCA[2]:REVISION:0002
MCA[2]:SEVERITY:0
MCA[2]:Processor Error Device Info decode begins.
MCA[2]:VALIDATION_BITS = 0x000000000100100f
MCA[2]:PSP = 0x20000000fff211a0
MCA/CMC[2]: bus check. info:0x0080000001000040
MCA/CMC[2]: req:N/A res:N/A
MCA/CMC[2]: tgt:N/A ip:N/A
MCA[2]:PSI_STATIC_STRUCT.VALID_FIELD_BITS=0x000000000000003f
MCA[2]:Processor Error Device Info decode ends.
MCA[1]:REVISION:0002
MCA[1]:SEVERITY:0
MCA[1]:Processor Error Device Info decode begins.
MCA[1]:VALIDATION_BITS = 0x000000000100100f
MCA[1]:PSP = 0x20000000fff211a0
MCA/CMC[1]: bus check. info:0x0080000001000040
MCA/CMC[1]: req:N/A res:N/A
MCA/CMC[1]: tgt:N/A ip:N/A
MCA[1]:PSI_STATIC_STRUCT.VALID_FIELD_BITS=0x000000000000003f
MCA[1]:Processor Error Device Info decode ends.
MCA[1]:memory Error
MCA[1]:Address = 0x200b2eff8
0x0000000200b2eff8 = uBadPageAdr (JAGaf30158 in mca.c)
MCA/CMC[1]:handler = OS_MCA, sub_type = Generic
MCA/CMC[1]:00 00 00 00 00 00 20 01 e0 00 00 00 00 23 0d 70
MCA/CMC[1]:00 00 00 02 00 b2 ef f8 00 00 00 01 00 00 00 02
MCA/CMC[1]:00 00 00 00 00 00 00 00 00 00 00 00 00 00 23 e0
MCA[1]:Platform Specific Pluto Rope error
MCA[1]:Platform Specific Data = 0x0
MCA[1]:Error Status val = 0xa1800
MCA[1]:Error Status type = ERR_ERROR Detection of PATH_ERROR.
MCA/CMC[1]:handler = OS_MCA, sub_type = Generic
MCA/CMC[1]:00 00 00 00 00 00 20 05 e0 00 00 00 00 23 0e 40
MCA/CMC[1]:00 00 00 00 00 0a 18 00 00 00 00 00 00 00 00 00
MCA/CMC[1]:00 00 00 00 fe d0 00 00 00 00 00 00 00 00 00 00
MCA[1]:PCI Mercury Bus Error
MCA[1]:Bus ID = 0x0
MCA[1]:Error Type = 0x0
MCA[1]:Error Status val = 0x0
MCA[1]:Error Status type = NO DATA
MCA/CMC[1]:handler = OS_MCA, sub_type = Generic
MCA/CMC[1]:00 00 00 00 00 00 20 04 e0 00 00 00 00 23 0d 10
MCA/CMC[1]:00 00 00 00 00 0a 18 00 00 00 00 00 00 00 00 00
MCA/CMC[1]:00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
估计内存的pci插槽有问题了,还是800让hp的工程师看吧,真吓人!
本文来自ChinaUnix博客,如果查看原文请点:http://blog.chinaunix.net/u/24018/showart_407982.html |
|