- 论坛徽章:
- 0
|
很奇怪,还是互相重启,我的系统是RHEL5.4 64位,两台X3850 X5,用的IBM IPMI做的fence设备,目前是两台机器的eth0口和fence都连在同一个交换机上,eth1连在另外一个交换机上做外网IP
/etc/hosts
[root@kms1 ~]# cat /etc/hosts
# Do not remove the following line, or various programs
# that require network functionality will fail.
127.0.0.1 localhost.localdomain localhost
::1 localhost6.localdomain6 localhost6
192.168.170.20 kms1
192.168.170.21 kms2
192.168.170.30 kms1_fence
192.168.170.31 kms2_fence
cluster.conf
[root@kms1 ~]# cat /etc/cluster/cluster.conf
<?xml version="1.0"?>
<cluster alias="kms_rhcs" config_version="8" name="kms_rhcs">
<fence_daemon post_fail_delay="0" post_join_delay="60"/>
<clusternodes>
<clusternode name="kms1" nodeid="1" votes="1">
<fence>
<method name="1">
<device name="kms1_fence"/>
</method>
</fence>
</clusternode>
<clusternode name="kms2" nodeid="2" votes="1">
<fence>
<method name="1">
<device name="kms2_fence"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman expected_votes="1" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_ipmilan" auth="" ipaddr="192.168.170.30" login="USERID" name="kms1_fence" passwd="PASSW0RD"/>
<fencedevice agent="fence_ipmilan" auth="" ipaddr="192.168.170.31" login="USERID" name="kms2_fence" passwd="PASSW0RD"/>
</fencedevices>
<rm>
<failoverdomains>
<failoverdomain name="kms_domain" ordered="0" restricted="1">
<failoverdomainnode name="kms1" priority="1"/>
<failoverdomainnode name="kms2" priority="1"/>
</failoverdomain>
</failoverdomains>
<resources>
<ip address="133.0.104.47" monitor_link="1"/>
</resources>
<service autostart="1" domain="kms_domain" name="kms_serv">
<ip ref="133.0.104.47"/>
</service>
</rm>
</cluster>
tail -f /var/log/message
Dec 6 13:16:23 kms1 openais[8161]: [CMAN ] CMAN 2.0.115 (built Aug 5 2009 08:24:57) started
Dec 6 13:16:23 kms1 openais[8161]: [MAIN ] Service initialized 'openais CMAN membership service 2.01'
Dec 6 13:16:23 kms1 openais[8161]: [SERV ] Service initialized 'openais extended virtual synchrony service'
Dec 6 13:16:23 kms1 openais[8161]: [SERV ] Service initialized 'openais cluster membership service B.01.01'
Dec 6 13:16:23 kms1 openais[8161]: [SERV ] Service initialized 'openais availability management framework B.01.01'
Dec 6 13:16:23 kms1 openais[8161]: [SERV ] Service initialized 'openais checkpoint service B.01.01'
Dec 6 13:16:23 kms1 openais[8161]: [SERV ] Service initialized 'openais event service B.01.01'
Dec 6 13:16:23 kms1 openais[8161]: [SERV ] Service initialized 'openais distributed locking service B.01.01'
Dec 6 13:16:23 kms1 openais[8161]: [SERV ] Service initialized 'openais message service B.01.01'
Dec 6 13:16:23 kms1 openais[8161]: [SERV ] Service initialized 'openais configuration service'
Dec 6 13:16:23 kms1 openais[8161]: [SERV ] Service initialized 'openais cluster closed process group service v1.01'
Dec 6 13:16:23 kms1 openais[8161]: [SERV ] Service initialized 'openais cluster config database access v1.01'
Dec 6 13:16:23 kms1 ccsd[8152]: Initial status:: Quorate
Dec 6 13:16:23 kms1 openais[8161]: [SYNC ] Not using a virtual synchrony filter.
Dec 6 13:16:23 kms1 openais[8161]: [TOTEM] Creating commit token because I am the rep.
Dec 6 13:16:23 kms1 openais[8161]: [TOTEM] Saving state aru 0 high seq received 0
Dec 6 13:16:23 kms1 openais[8161]: [TOTEM] Storing new sequence id for ring 10
Dec 6 13:16:23 kms1 openais[8161]: [TOTEM] entering COMMIT state.
Dec 6 13:16:23 kms1 openais[8161]: [TOTEM] entering RECOVERY state.
Dec 6 13:16:23 kms1 openais[8161]: [TOTEM] position [0] member 192.168.170.20:
Dec 6 13:16:23 kms1 openais[8161]: [TOTEM] previous ring seq 12 rep 192.168.170.20
Dec 6 13:16:23 kms1 openais[8161]: [TOTEM] aru 0 high delivered 0 received flag 1
Dec 6 13:16:23 kms1 openais[8161]: [TOTEM] Did not need to originate any messages in recovery.
Dec 6 13:16:23 kms1 openais[8161]: [TOTEM] Sending initial ORF token
Dec 6 13:16:23 kms1 openais[8161]: [CLM ] CLM CONFIGURATION CHANGE
Dec 6 13:16:23 kms1 openais[8161]: [CLM ] New Configuration:
Dec 6 13:16:23 kms1 openais[8161]: [CLM ] Members Left:
Dec 6 13:16:23 kms1 openais[8161]: [CLM ] Members Joined:
Dec 6 13:16:23 kms1 openais[8161]: [CLM ] CLM CONFIGURATION CHANGE
Dec 6 13:16:23 kms1 openais[8161]: [CLM ] New Configuration:
Dec 6 13:16:23 kms1 openais[8161]: [CLM ] r(0) ip(192.168.170.20)
Dec 6 13:16:23 kms1 openais[8161]: [CLM ] Members Left:
Dec 6 13:16:23 kms1 openais[8161]: [CLM ] Members Joined:
Dec 6 13:16:23 kms1 openais[8161]: [CLM ] r(0) ip(192.168.170.20)
Dec 6 13:16:23 kms1 openais[8161]: [SYNC ] This node is within the primary component and will provide service.
Dec 6 13:16:24 kms1 openais[8161]: [TOTEM] entering OPERATIONAL state.
Dec 6 13:16:24 kms1 openais[8161]: [CMAN ] quorum regained, resuming activity
Dec 6 13:16:24 kms1 openais[8161]: [CLM ] got nodejoin message 192.168.170.20
Dec 6 13:18:10 kms1 fenced[8180]: kms2 not a cluster member after 60 sec post_join_delay
Dec 6 13:18:10 kms1 fenced[8180]: fencing node "kms2"
Dec 6 13:18:24 kms1 fenced[8180]: fence "kms2" success
找不到心跳,所以就重启了另一台机器,目前交换机我没法调,也动不了。。请大家帮忙查看下原因 |
|