- 论坛徽章:
- 0
|
本帖最后由 lanfeng356 于 2011-06-16 11:35 编辑
1.平台:
主机:IBM P6 550
操作系统:AIX 6100-06
cluster:POWER HA5.5
2.问题现象:
A主机上的资源组1(包含单实例数据库)可以切换到B主机上
B主机将资源组1(包含单实例数据库)无法回切到A主机上
(双机配置已经同步,两边的启停脚本一模一样,执行权限也一模一样)
此时无法停止B主机双机
B主机:root:/hacmp>lssrc -ls clstrmgrES
Current state: ST_RP_FAILED
sccsid = "@(#)36 1.135.5.2 src/43haes/usr/sbin/cluster/hacmprd/main.C, hacmp.pe, 53haes_r550, 0934B_hacmp550 8/8/09 14:48:23"
i_local_nodeid 1, i_local_siteid -1, my_handle 2
ml_idx[1]=0 ml_idx[2]=1
tp is 20714628
Events on event queue:
te_type 36, te_nodeid 2, te_network 1
There are 0 events on the Ibcast queue
There are 0 events on the RM Ibcast queue
CLversion: 10
local node vrmf is 5506
cluster fix level is "6"
The following timer(s) are currently active:
Event error node list: node_B
Current DNP values
DNP Values for NodeId - 1 NodeName - node_A
PgSpFree = 2222330 PvPctBusy = 0 PctTotalTimeIdle = 369.378540
DNP Values for NodeId - 2 NodeName - node_B
PgSpFree = 2224876 PvPctBusy = 0 PctTotalTimeIdle = 365.773210
切换资源组的时候报错:
Command: failed stdout: yes stderr: no
Before command completion, additional instructions may appear below.
Attempting to move resource group RG1 to node A.
Waiting for the cluster to process the resource group movement request....
Waiting for the cluster to stabilize...........
ERROR: Event processing has failed for the requested resource
group movement. The cluster is unstable and requires manual intervention
to continue processing.
查看双机状态:
Resource Group Name: RG1
Startup Policy: Online On Home Node Only
Fallover Policy: Fallover To Next Priority Node In The List
Fallback Policy: Fallback To Higher Priority Node In The List
Site Policy: ignore
Primary instance(s):
The following node temporarily has the highest priority for this instance:
A, user-requested rg_move performed on Mon Jun 13 18:03:22 2011
Node Group State
---------------------------- ---------------
A OFFLINE
B ERROR
只有将主机B shutdown -Fr 以后,主机A自动重新接管资源组RG1
B主机上的资源组2(只有一个浮动IP)可以切换到主机A上
A主机可以将资源组2(只有一个浮动IP)回切到主机B上
3.报错日志
hacmp.rar
(39.63 KB, 下载次数: 74)
4.另外一个无法使用clstat的问题
使用clstat报错:
emacdb2:root:/usr/es/sbin/cluster>./clstat
Failed retrieving cluster information.
There are a number of possible causes:
clinfoES or snmpd subsystems are not active.
snmp is unresponsive.
snmp is not configured correctly.
Cluster services are not active on any nodes.
Refer to the HACMP Administration Guide for more information.
Additional information for verifying the SNMP configuration on AIX 6
can be found in /usr/es/sbin/cluster/README5.5.0.UPDATE
按照 /usr/es/sbin/cluster/README5.5.0.UPDATE文档中的提示:
在文件/etc/snmpdv3.conf中添加下面行
VACM_VIEW defaultView 1.3.6.1.4.1.2.3.1.2.1.5 - included -
然后重启snmp服务
1) stopsrc -s snmpd
2) startsrc -s snmpd
依然报上面的错误
求达人指点,谢谢! |
|