- 论坛徽章:
- 0
|
p570x2,AIX5.3(ML03)+HA5.3(cluster.es.server.utils.5.3.0.3)+EMC cx500做RAC。
1、datavg为concurrent模式。
2、node1起HA,可以正常看到datavg中的raw。
3、node2起HA,会将datavg从node1中抢过来,可以看到有“rg_remove”动作。此时node1上看不到datavg,但是HA进程正常。而在node2上,又提示需要chvg -u datavg。解锁后在node2上观察lsvg -l datavg,发现有一部分raw变成了jfs。
4、停下HA,做exportvg和importvg,情况照旧,如此反复几次,都有这种异常情况。
在errpt中看到一些关于vg的报错:
---------------------------------------------------------------------------
LABEL: LVM_SA_QUORCLOSE
IDENTIFIER: CAD234BE
Date/Time: Mon Feb 20 17:01:34 BEIST 2006
Sequence Number: 2202
Machine Id: 00C2E62A4C00
Node Id: ODB01
Class: H
Type: UNKN
Resource Name: LVDD
Resource Class: NONE
Resource Type: NONE
Location:
Description
QUORUM LOST, VOLUME GROUP CLOSING
Probable Causes
PHYSICAL VOLUME UNAVAILABLE
Detail Data
MAJOR/MINOR DEVICE NUMBER
8000 003D 0000 0000
QUORUM COUNT
2
ACTIVE COUNT
0
SENSE DATA
0000 0000 0000 0901 00C2 E69A 0000 4C00 0000 0109 52EF 162F 00C2 E62A B00F 3756
---------------------------------------------------------------------------
---------------------------------------------------------------------------
LABEL: SC_DISK_ERR2
IDENTIFIER: B6267342
Date/Time: Mon Feb 20 19:30:54 BEIST 2006
Sequence Number: 2289
Machine Id: 00C2E62A4C00
Node Id: ODB01
Class: H
Type: PERM
Resource Name: hdisk4
Resource Class: disk
Resource Type: CLAR_FC_raid5
Location: U7879.001.DQDGFHW-P1-C5-T1-W500601683021BD0B-L0
VPD:
Manufacturer................DGC
Machine Type and Model......RAID 5
ROS Level and ID............0219
Serial Number...............CK200053101058
Device Specific.(SI)........CX500
Device Specific.(PQ)........00
Device Specific.(VS)........0200003581CL
Device Specific.(UI)........6006016069F9150038C5D24EA799DA11
Device Specific.(FL)........0002
Device Specific.(Z0)........10
Device Specific.(Z1)........10
Description
DISK OPERATION ERROR
Probable Causes
DASD DEVICE
Failure Causes
DISK DRIVE
DISK DRIVE ELECTRONICS
Recommended Actions
PERFORM PROBLEM DETERMINATION PROCEDURES
Detail Data
PATH ID
0
SENSE DATA
0600 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0118 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0001 4000 0000 0011 0000 0000 0000 0000 0000 0000 0000
0000 003D 001A
Duplicates
Number of duplicates
2
Time of first duplicate
Mon Feb 20 19:30:54 BEIST 2006
Time of last duplicate
Mon Feb 20 19:30:54 BEIST 2006
---------------------------------------------------------------------------
用diag检查光纤通道卡,没有发现硬件错误。有没有可能是存储的问题呢? |
|