软件环境:sun cluster 3.1 u4+vxvm4.0+oracle ha
硬件环境:V490(2*1.35GHz US-IV+8GB MEM)+SE3510
无法定位是哪里的问题,post几次全部是:POST Passed all devices.
但是系统一启动就panic:
{2} ok boot
Boot device: /pci@9,600000/SUNW,qlc@2/fp@0,0/disk@w21000014c37f35fe,0:a File and args:
SunOS Release 5.9 Version Generic_118558-24 64-bit
Copyright 1983-2003 Sun Microsystems, Inc. All rights reserved.
Use is subject to license terms.
Configuring ATM interfaces:
VxVM sysboot INFO V-5-2-3390 Starting restore daemon...
VxVM sysboot INFO V-5-2-3409 starting in boot mode...
NOTICE: VxVM vxdmp V-5-0-34 added disk array OTHER_DISKS, datype = OTHER_DISKS
NOTICE: VxVM vxdmp V-5-0-112 disabled path 118/0x10 belonging to the dmpnode 285/0x10
configuring IPv4 interfaces: ce0 ce1.
Hostname: scnode1
VxVM INFO V-5-2-3247 starting special volumes ( swapvol rootvol rootdg_16vol )...
The / file system (/dev/vx/rdsk/bootdg/rootvol) is being checked.
NOTICE: VxVM vxdmp V-5-0-148 enabled path 118/0x10 belonging to the dmpnode 285/0x10
/dev/vx/rdsk/bootdg/rootvol: UNREF FILE I=6766 OWNER=root MODE=100644
/dev/vx/rdsk/bootdg/rootvol: SIZE=58948 MTIME=Aug 17 19:12 2008 (RECONNECTED)
/dev/vx/rdsk/bootdg/rootvol: LINK COUNT FILE I=6766 OWNER=root MODE=100644
/dev/vx/rdsk/bootdg/rootvol: SIZE=58948 MTIME=Aug 17 19:12 2008 COUNT 0 SHOULD BE 1
/dev/vx/rdsk/bootdg/rootvol: LINK COUNT INCREASING
/dev/vx/rdsk/bootdg/rootvol: UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY.
WARNING - Unable to repair the / filesystem. Run fsck
manually (fsck -F ufs /dev/vx/rdsk/bootdg/rootvol). Exit the shell when
done to continue the boot process.
Type control-d to proceed with normal startup,
(or give root password for system maintenance):
WARNING: [AFT1] TSCE Event detected by CPU0 in Privileged mode at TL>0, errID 0x00000308.ba36d664
AFSR 0x03301000<THCE,TSCE,ME,PRIV,TO>.00000000 AFAR 0x000000a1.ff57fd00
Fault_PC 0x117fb08
WARNING: [AFT1] THCE Event detected by CPU0 at TL>0, errID 0x00000308.ba36d664
AFSR 0x03301000<THCE,TSCE,ME,PRIV,TO>.00000000 AFAR 0x000000a1.ff57fd00 INVALID
Fault_PC 0x117fb08
WARNING: [AFT1] Timeout (TO) Event detected by CPU0 in Privileged mode at TL>0, errID 0x00000308.ba36d664
AFSR 0x03301000<THCE,TSCE,ME,PRIV,TO>.00000000 AFAR 0x000000a1.ff57fd00 INVALID
Fault_PC 0x117fb08
panic[cpu0]/thread=2a100013d40: [AFT1] errID 0x00000308.ba36d664 TSCE THCE TO Error(s)
See previous message(s) for details
syncing file systems... done
skipping system dump - no dump device configured
rebooting...
Resetting ...
循环不止。。。。。
RSC Alert: Host System has Reset
Software Reset
Enabling system bus....... Done
Initializing CPUs......... Done
Initializing boot memory.. Done
Initializing OpenBoot
Probing system devices
Probing I/O buses
Probing system devices
Probing I/O buses
Sun Fire V490, No Keyboard
Copyright 2005 Sun Microsystems, Inc. All rights reserved.
OpenBoot 4.18.8, 8192 MB memory installed, Serial #67171490.
Ethernet address 0:14:4f:0:f4:a2, Host ID: xxxxxxxx.
Rebooting with command: boot
Boot device: /pci@9,600000/SUNW,qlc@2/fp@0,0/disk@w21000014c37f35fe,0:a File and args:
SunOS Release 5.9 Version Generic_118558-24 64-bit
Copyright 1983-2003 Sun Microsystems, Inc. All rights reserved.
Use is subject to license terms.
Configuring ATM interfaces:
VxVM sysboot INFO V-5-2-3390 Starting restore daemon...
VxVM sysboot INFO V-5-2-3409 starting in boot mode...
NOTICE: VxVM vxdmp V-5-0-34 added disk array OTHER_DISKS, datype = OTHER_DISKS
NOTICE: VxVM vxdmp V-5-0-112 disabled path 118/0x18 belonging to the dmpnode 285/0x10
configuring IPv4 interfaces: ce0 ce1.
Hostname: scnode1
VxVM INFO V-5-2-3247 starting special volumes ( swapvol rootvol rootdg_16vol )...
The / file system (/dev/vx/rdsk/bootdg/rootvol) is being checked.
NOTICE: VxVM vxdmp V-5-0-148 enabled path 118/0x18 belonging to the dmpnode 285/0x10
BBC Devices: 0000.0000.0000.0005
BBC Arb: 0000.0000.0000.000f
BBC Quiesce: 0000.0000.0000.0009
BBC WDogAct: 0000.0000.0000.0000
BBC POR Gen: 0000.0000.0000.0000
BBC XIR Gen: 0000.0000.0000.0000
BBC POR Src: 0000.0000.0000.0000
BBC XIR Src: 0000.0000.0000.000f
BBC EBus TC: 014f.99fd.a7e6.3f29
Mem Time Ctl1: 20ac.0460.8124.950a
Mem Time Ctl2: 42a8.f833.89f1.0020
Mem Time Ctl3: 1060.03c7.1c82.0480
Mem Time Ctl4: 1d28.7ec0.38e7.0020
Mem Time Ctl5: 0000.0000.0058.0000
Mem Addr Dec1: 8000.fe02.8002.0000
Mem Addr Dec2: 8000.fe02.8002.0200
Mem Addr Dec3: 8000.fe02.8002.0400
Mem Addr Dec4: 8000.fe02.8002.0600
Mem Addr Ctl: 0104.1128.4422.1108
CPU16 Config/Control/Status registers:
CPUVersion: 003e.0018.3100.0507
SafConfig: 0caa.01bc.2020.8002 9:1 ID:16 HBM TOL:15
SafBaseAdr: 0000.0400.0000.0000
DispatchCtl: 0000.0000.0000.0009 MS SI
DCacheCtl: 0000.0200.0000.0010 WE
ECacheCtl: 0000.0000.01c5.5000 5:1 8MB mode=5-5-5(2) R/W-turn:2 Late-Sel ECC:off
ErrorEnable: 0000.0000.0000.000b CEEN NCEEN UCEEN
AFAR: 0000.0000.0000.0000
AFSR: 0000.0000.0000.0000 (no errors set)
AFAR 2: 0000.0000.0000.0000
AFSR 2: 0000.0000.0000.0000 (no errors set)
Mem Time Ctl1: 20ac.0460.8124.950a
Mem Time Ctl2: 42a8.f833.89f1.0020
Mem Time Ctl3: 1060.03c7.1c82.0480
Mem Time Ctl4: 1d28.7ec0.38e7.0020
Mem Time Ctl5: 0000.0000.0058.0000
Mem Addr Dec1: 8000.fe02.8002.0000
Mem Addr Dec2: 8000.fe02.8002.0200
Mem Addr Dec3: 8000.fe02.8002.0400
Mem Addr Dec4: 8000.fe02.8002.0600
Mem Addr Ctl: 0104.1128.4422.1108
Mem Time Ctl1: 20ac.0460.8124.950a
Mem Time Ctl2: 42a8.f833.89f1.0020
Mem Time Ctl3: 1060.03c7.1c82.0480
Mem Time Ctl4: 1d28.7ec0.38e7.0020
Mem Time Ctl5: 0000.0000.0058.0000
Mem Addr Dec1: 8000.fe02.8002.0100
Mem Addr Dec2: 8000.fe02.8002.0300
Mem Addr Dec3: 8000.fe02.8002.0500
Mem Addr Dec4: 8000.fe02.8002.0700
Mem Addr Ctl: 0104.1128.4422.1108
CPU18 Config/Control/Status registers:
CPUVersion: 003e.0018.3100.0507
SafConfig: 1534.01bc.2024.8002 9:1 ID:18 HBM TOL:15
SafBaseAdr: 0000.0400.0100.0000
DispatchCtl: 0000.0000.0000.0009 MS SI
DCacheCtl: 0000.0200.0000.0010 WE
ECacheCtl: 0000.0000.01c5.5000 5:1 8MB mode=5-5-5(2) R/W-turn:2 Late-Sel ECC:off
ErrorEnable: 0000.0000.0000.000b CEEN NCEEN UCEEN
AFAR: 0000.0000.0000.0000
AFSR: 0000.0000.0000.0000 (no errors set)
AFAR 2: 0000.0000.0000.0000
AFSR 2: 0000.0000.0000.0000 (no errors set)
Mem Time Ctl1: 20ac.0460.8124.950a
Mem Time Ctl2: 42a8.f833.89f1.0020
Mem Time Ctl3: 1060.03c7.1c82.0480
Mem Time Ctl4: 1d28.7ec0.38e7.0020
Mem Time Ctl5: 0000.0000.0058.0000
Mem Addr Dec1: 8000.fe02.8002.0100
Mem Addr Dec2: 8000.fe02.8002.0300
Mem Addr Dec3: 8000.fe02.8002.0500
Mem Addr Dec4: 8000.fe02.8002.0700
Mem Addr Ctl: 0104.1128.4422.1108
IO-Bridge 8 at 0000.0400.0400.0000
Device ID fc00.0000.0011.ad57
Ctl/Stat 0255.5554.0080.7e02
Error Ctl fc00.0000.0000.03e0
Int Ctl 8000.0000.0000.0017
Error Log 0000.0000.0000.0000
ECC Ctl e000.0000.0000.0000
EStar Ctl 0000.0000.0000.0001
Queue Ctl 0000.0000.0000.0000
Address Match Address Mask
PCIA Mem 8000.07fd.0000.0000 0000.07ff.0000.0000
PCIA C/IO 8000.07ff.ec00.0000 0000.07ff.fe00.0000
PCIB Mem 8000.07fe.0000.0000 0000.07ff.0000.0000
PCIB C/IO 8000.07ff.ee00.0000 0000.07ff.fe00.0000
AFAR AFSR
UE 0000.0ffb.fabb.bb50 0000.01f5.df83.805b
CE 0000.0f7a.767d.dbf0 0000.03fc.d54f.e17f
PCI A 0000.0000.0000.0000 0000.0000.0000.0000
PCI B 0000.0000.0000.0000 0000.0000.0000.0000
Control/Status Idle Check Diag Diagnostic
PCI A 0001.0002.010f.003f 0000.0000.0000.c00a 0000.0000.0000.0000
PCI B 0006.0000.010f.003f 0000.0000.0000.c002 0000.0000.0000.0000
IO-Bridge 9 at 0000.0400.0480.0000
Device ID fc00.0000.0013.ad57
Ctl/Stat 0255.59a8.0090.7e02
Error Ctl fc00.0000.0000.03e0
Int Ctl 8000.0000.0000.0017
Error Log 0000.0000.0000.0000
ECC Ctl e000.0000.0000.0000
EStar Ctl 0000.0000.0000.0001
Queue Ctl 0000.0000.0000.0000
Address Match Address Mask
PCIA Mem 8000.07fb.0000.0000 0000.07ff.0000.0000
PCIA C/IO 8000.07ff.e800.0000 0000.07ff.fe00.0000
PCIB Mem 8000.07fc.0000.0000 0000.07ff.0000.0000
PCIB C/IO 8000.07ff.ea00.0000 0000.07ff.fe00.0000
AFAR AFSR
UE 0000.0fe6.fefe.dbb0 0000.03ef.dcc3.8180
CE 0000.0a78.740f.1a30 0000.021d.1fc2.a11f
PCI A 0000.0000.0000.0000 0000.0000.0000.0000
PCI B 0000.0000.0000.0000 0000.0000.0000.0000
Control/Status Idle Check Diag Diagnostic
PCI A 0006.0002.010e.003f 0000.0000.0000.c000 0000.0000.0000.0000
PCI B 0000.0000.010e.003f 0000.0000.0000.c008 0000.0000.0000.0000
Resetting...
RSC Alert: Host System has Reset
ERROR: CPU RED-State Exception Reset Recovery
Enabling system bus....... Done
Initializing CPUs......... Done
Initializing boot memory.. Done
Initializing OpenBoot
Probing system devices
Probing I/O buses
Probing system devices
Probing I/O buses
Sun Fire V490, No Keyboard
Copyright 2005 Sun Microsystems, Inc. All rights reserved.
OpenBoot 4.18.8, 8192 MB memory installed, Serial #67171490.
Ethernet address 0:14:4f:0:f4:a2, Host ID: xxxxxxxx.
DPE D$ parity event
DDSPE D$ data parity event
DTSPE D$ physical tag parity event
IPE I$ parity event
IDSPE I$ data parity event
ITSPE I$ physical tag parity event
TSCE software correctable single-bit E$ tag ECC event
THCE hardware corrected single-bit E$ tag ECC event
UCC software correctable E$ ECC event
UCU uncorrectable E$ ECC event
EDC hardware corrected E$ ECC event
EDU:ST uncorrectable E$ ECC event for store merge
EDU:BLD uncorrectable E$ ECC event for block load
WDC hardware corrected E$ ECC event for writeback (victimization)
WDU uncorrectable E$ ECC event for writeback (victimization)
CPC hardware corrected E$ ECC event for copyout (snoop request)
CPU uncorrectable E$ ECC event for copyout (snoop request)
ETP uncorrectable E$ tag parity error
TSCE software correctable single-bit E$ tag ECC error
A non block load I$ fetch or D$ fill (including atomic instructions) request
to the E$ results in a TSCE if a single-bit ECC error is detected in the E$ tag.
Under most circumstances, the error is logged and the system continues running. If, however, an correctable E$ error occurs while the kernel is in critical kernel code, it may not be recoverable. These events should be very rare.
THCE hardware corrected single-bit E$ tag ECC error
The THCE error covers cases of single-bit ECC tag error detected during all tag accesses other than the ones due to atomic instructions and I$ or D$ load miss. These include writeback, copyout, store merge, block load/store, prefetch queue operations, snoop read, displacement flush, E$ data fill, local writeback and store queue RTO operations.
This error is logged, and the system continues running.