- 论坛徽章:
- 0
|
我在组下用SUNW.HAStoragePlus:6资源类型建共享Zpool资源,
现在问题是通过clrg switch命令,可在双机间自由切换,但是如果我reboot当前Zpool挂起的节点(reboot非活动节点没有问题),就会发生数据冲突。
重启时在另一个节点日志里有以下记录:
Feb 8 09:56:16 cluster1 qlc: [ID 439991 kern.info] NOTICE: Qlogic qlc(0,0): Loop OFFLINE
Feb 8 09:56:16 cluster1 scsi: [ID 107833 kern.warning] WARNING: /pci@0,0/pci8086,65f9@6/pci1077,15d@0/fp@0,0/disk@w210000d02308005c,1 (sd2):
Feb 8 09:56:16 cluster1 SCSI transport failed: reason 'tran_err': retrying command
Feb 8 09:56:16 cluster1 qlc: [ID 439991 kern.info] NOTICE: Qlogic qlc(0,0): Loop ONLINE
Feb 8 09:56:16 cluster1 scsi: [ID 243001 kern.info] /pci@0,0/pci8086,65f9@6/pci1077,15d@0/fp@0,0 (fcp0):
Feb 8 09:56:16 cluster1 ndi_devi_online: failed for scsiclass,0d: target=25 lun=3 ffffffff
Feb 8 09:56:16 cluster1 scsi: [ID 107833 kern.warning] WARNING: /pci@0,0/pci8086,65f9@6/pci1077,15d@0/fp@0,0/disk@w210000d02308005c,1 (sd2):
Feb 8 09:56:16 cluster1 Error for Command: read(10) Error Level: Retryable
Feb 8 09:56:16 cluster1 scsi: [ID 107833 kern.notice] Requested Block: 0 Error Block: 0
Feb 8 09:56:16 cluster1 scsi: [ID 107833 kern.notice] Vendor: IFT Serial Number: 161743AB-020
Feb 8 09:56:16 cluster1 scsi: [ID 107833 kern.notice] Sense Key: Unit_Attention
Feb 8 09:56:16 cluster1 scsi: [ID 107833 kern.notice] ASC: 0x29 (power on, reset, or bus reset occurred), ASCQ: 0x0, FRU: 0x0
然后查看pool状态,可能就是以下样子:
root@cluster2:~# zpool status -v
pool: mynas
state: ONLINE
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: http://www.sun.com/msg/ZFS-8000-8A
scrub: scrub completed after 0h0m with 2 errors on Mon Feb 8 10:05:29 2010
config:
NAME STATE READ WRITE CKSUM
mynas ONLINE 0 0 0
c11t112d1 ONLINE 0 0 0 22K repaired
errors: Permanent errors have been detected in the following files:
<metadata>:<0x1>
<metadata>:<0x15> |
|