Chinaunix

标题: heartbeat crm模块如何使用 [打印本页]

作者: applegump    时间: 2012-08-20 17:27
标题: heartbeat crm模块如何使用
现状: 两台apache httpd服务器,各自单独的对外提供服务


需求:希望一号机作为主机,二号机作为备机,使用一个对外的IP地址对外提供服务

正常运行情况下,该IP在一号机上,当一号机宕机,网线断网,或者httpd服务down掉时,

IP漂移到二号机上。

目前我使用heartbeat可以做到当一号机宕机、网线断网时,IP能飘到二号机,但是没法监控到

httpd这个应用的运行情况,网上有些文章介绍了heartbeat 的crm模块,可我不会使用

不知道哪位用过这东西,能给点意见,谢谢


我注意到有些文章说要自己写一个脚本,脚本能支持 start stop status 参数,能返回固定格式的字符串

小弟真是才疏学浅,不知道对于apache httpd 服务而言,这个脚本怎么写,哪位能给个例子吗?


作者: applegump    时间: 2012-08-21 17:21
目前配置了两个节点

kf28-1    IP  192.1.101.128     固定地址   主节点
oradb2-1   IP 192.1.101.12     固定地址   备节点

虚拟IP  192.1.101.212

两台机器上均安装配置好了apache httpd  

在没有配置crm之前 , 两个节点分别做 heartbeat  start ,随后两个节点的httpd服务均能被拉起来,然后192.1.101.212 能被分配到 kf28-1节点上

在kf28-1 上做heartbeat stop之后, 该节点上的httpd能被down掉,随后192.1.101.212 漂移至oradb2-1上


随后再在kf28-1 上做heartbeat start之后, 该节点上的httpd能被启动,随后192.1.101.212能够漂移回到kf28-1上


在oradb2-1上做heartbeat stop之后, 该节点上的httpd能被down掉。

如上所述,表现一切正常。


接着我希望heartbeat能监视httpd的运行情况,希望在某节点的httpd服务down掉之后,另一节点能接管192.1.101.212这个ip

按照网上的资料描述,我自己写了一个符合lsb规范的脚本myhttpd,并参考网上的crm教程做了相关配置,现在发现出了问题,现象描述如下:

1、kf28-1  上启动heartbeat   ,能在一段时候后拉起httpd服务,192.1.101.212这个ip也能分配到这个节点上,

     在kf28-1上使用crm_mon命令查看,结果如下:

      Refresh in 10s...

      ============
      Last updated: Tue Aug 21 16:50:18 2012
      Current DC: kf28-1 (b275747d-b787-43c1-b05e-15a84603ebbf)
      2 Nodes configured.
      1 Resources configured.
       ============

      Node: oradb2-1 (34bd366d-3b86-4cb1-9bb1-b901f0e4e08b): OFFLINE
      Node: kf28-1 (b275747d-b787-43c1-b05e-15a84603ebbf): online

     Resource Group: group_1
                 IPaddr_192_1_101_212        (heartbeat:cf:IPaddr):        Started kf28-1
                  myhttpd_2   (lsb:myhttpd):  Started kf28-1


2、在oradb2-1 上启动heartbeat  ,无法拉起httpd应用,使用crm_mon查看,结果如下:

      Not connected:Refresh in 3s...

      
下面我将两个节点的相关配置文件列示如下

   ha.cf (该文件在两个节点基本相同):

    # ha.cf
    debugfile /var/log/ha-debug
    logfile /var/log/ha-log
    keepalive 2
    deadtime 30
    warntime 10
    initdead 120
    udpport 694
    ucast eth0 192.1.101.12  (oradb2-1节点上这一行写的是  ucast  eth0 192.1.101.12
    auto_failback on
    crm    yes
   node   kf28-1
   node   oradb2-1


authkeys (该文件在两个节点相同):

auth 1
1 crc
#2 sha1 HI!
#3 md5 Hello!


haresources(该文件在两个节点上相同):

kf28-1  192.1.101.212 myhttpd


myhttpd是我写的一个脚本,放在/etc/init.d中 ,内容如下(两个节点均放置了这个脚本):

#!/bin/sh
ARGV="$@"
#echo $ARGV
#echo "------------"
Start="start"
#echo $Start
Stop="stop"
#echo $Stop
Status="status"

if [ "$ARGV" == "$Start" ];then
        echo " starting httpd ..............."
        /users/ems/apache/bin/apachectl start

elif [ "$ARGV" = "$Stop" ];then
        echo " stopping httpd ..............."
        /users/ems/apache/bin/apachectl stop

elif [ "$ARGV" = "$Status" ];then
        http=`ps -A |grep httpd |wc -l`
        two=2
        let "http=http-two"
        if [ $http -gt 0 ];then
                echo "OK."
        else
                echo "stopped"
        fi
else
        echo " can not accept the input parameter......"
fi

该脚本中对status的判断中减去了一个2,是因为在使用status选项启动的时候,即使系统没有启动httpd服务,脚本也报有2个,不知道怎么回事,所以我就强行减去了个2,这样就符合了lsb的规范,哪位知道是什么原因,请不吝赐教。


生成的cib.xml文件分别如下:

kf28-1:  下面有好几个类似的文件

cib.xml内容为:

<cib admin_epoch="0" epoch="4" num_updates="61" generated="true" have_quorum="true" ignore_dtd="false" num_peers="2" cib_feature_revision="1.3" cib-last-written="Tue Aug 21 17:06:18 2012" ccm_transition="1" dc_uuid="b275747d-b787-43c1-b05e-15a84603ebbf">
   <configuration>
     <crm_config>
       <cluster_property_set id="cib-bootstrap-options">
         <attributes>
           <nvpair id="cib-bootstrap-options-symmetric-cluster" name="symmetric-cluster" value="true"/>
           <nvpair id="cib-bootstrap-options-no_quorum-policy" name="no_quorum-policy" value="stop"/>
           <nvpair id="cib-bootstrap-options-default-resource-stickiness" name="default-resource-stickiness" value="0"/>
           <nvpair id="cib-bootstrap-options-default-resource-failure-stickiness" name="default-resource-failure-stickiness" value="0"/>
           <nvpair id="cib-bootstrap-options-stonith-enabled" name="stonith-enabled" value="false"/>
           <nvpair id="cib-bootstrap-options-stonith-action" name="stonith-action" value="reboot"/>
           <nvpair id="cib-bootstrap-options-stop-orphan-resources" name="stop-orphan-resources" value="true"/>
           <nvpair id="cib-bootstrap-options-stop-orphan-actions" name="stop-orphan-actions" value="true"/>
           <nvpair id="cib-bootstrap-options-remove-after-stop" name="remove-after-stop" value="false"/>
           <nvpair id="cib-bootstrap-options-short-resource-names" name="short-resource-names" value="true"/>
           <nvpair id="cib-bootstrap-options-transition-idle-timeout" name="transition-idle-timeout" value="5min"/>
           <nvpair id="cib-bootstrap-options-default-action-timeout" name="default-action-timeout" value="5s"/>
           <nvpair id="cib-bootstrap-options-is-managed-default" name="is-managed-default" value="true"/>
         </attributes>
       </cluster_property_set>
     </crm_config>
     <nodes>
       <node id="34bd366d-3b86-4cb1-9bb1-b901f0e4e08b" uname="oradb2-1" type="normal"/>
       <node id="b275747d-b787-43c1-b05e-15a84603ebbf" uname="kf28-1" type="normal"/>
     </nodes>
     <resources>
       <group id="group_1">
         <primitive class="ocf" id="IPaddr_192_1_101_212" provider="heartbeat" type="IPaddr">
           <operations>
             <op id="IPaddr_192_1_101_212_mon" interval="5s" name="monitor" timeout="5s"/>
           </operations>
           <instance_attributes id="IPaddr_192_1_101_212_inst_attr">
             <attributes>
               <nvpair id="IPaddr_192_1_101_212_attr_0" name="ip" value="192.1.101.212"/>
             </attributes>
           </instance_attributes>
         </primitive>
         <primitive class="lsb" id="myhttpd_2" provider="heartbeat" type="myhttpd">
           <operations>
             <op id="myhttpd_2_mon" interval="120s" name="monitor" timeout="60s"/>
           </operations>
         </primitive>
       </group>
     </resources>
     <constraints>
       <rsc_location id="rsc_location_group_1" rsc="group_1">
         <rule id="prefered_location_group_1" score="100">
           <expression attribute="#uname" id="prefered_location_group_1_expr" operation="eq" value="kf28-1"/>
         </rule>
       </rsc_location>
     </constraints>
</configuration>
</cib>




























作者: applegump    时间: 2012-08-21 17:22
cib.xml.last内容为:

<cib admin_epoch="0" epoch="4" num_updates="61" generated="true" have_quorum="true" ignore_dtd="false" num_peers="2" cib_feature_revision="1.3" cib-last-written="Tue Aug 21 17:06:15 2012" ccm_transition="1" dc_uuid="b275747d-b787-43c1-b05e-15a84603ebbf">
   <configuration>
     <crm_config>
       <cluster_property_set id="cib-bootstrap-options">
         <attributes>
           <nvpair id="cib-bootstrap-options-symmetric-cluster" name="symmetric-cluster" value="true"/>
           <nvpair id="cib-bootstrap-options-no_quorum-policy" name="no_quorum-policy" value="stop"/>
           <nvpair id="cib-bootstrap-options-default-resource-stickiness" name="default-resource-stickiness" value="0"/>
           <nvpair id="cib-bootstrap-options-default-resource-failure-stickiness" name="default-resource-failure-stickiness" value="0"/>
           <nvpair id="cib-bootstrap-options-stonith-enabled" name="stonith-enabled" value="false"/>
           <nvpair id="cib-bootstrap-options-stonith-action" name="stonith-action" value="reboot"/>
           <nvpair id="cib-bootstrap-options-stop-orphan-resources" name="stop-orphan-resources" value="true"/>
           <nvpair id="cib-bootstrap-options-stop-orphan-actions" name="stop-orphan-actions" value="true"/>
           <nvpair id="cib-bootstrap-options-remove-after-stop" name="remove-after-stop" value="false"/>
           <nvpair id="cib-bootstrap-options-short-resource-names" name="short-resource-names" value="true"/>
           <nvpair id="cib-bootstrap-options-transition-idle-timeout" name="transition-idle-timeout" value="5min"/>
           <nvpair id="cib-bootstrap-options-default-action-timeout" name="default-action-timeout" value="5s"/>
           <nvpair id="cib-bootstrap-options-is-managed-default" name="is-managed-default" value="true"/>
         </attributes>
       </cluster_property_set>
     </crm_config>
     <nodes>
       <node id="34bd366d-3b86-4cb1-9bb1-b901f0e4e08b" uname="oradb2-1" type="normal"/>
       <node id="b275747d-b787-43c1-b05e-15a84603ebbf" uname="kf28-1" type="normal"/>
     </nodes>
     <resources>
       <group id="group_1">
         <primitive class="ocf" id="IPaddr_192_1_101_212" provider="heartbeat" type="IPaddr">
           <operations>
             <op id="IPaddr_192_1_101_212_mon" interval="5s" name="monitor" timeout="5s"/>
           </operations>
           <instance_attributes id="IPaddr_192_1_101_212_inst_attr">
             <attributes>
               <nvpair id="IPaddr_192_1_101_212_attr_0" name="ip" value="192.1.101.212"/>
             </attributes>
           </instance_attributes>
         </primitive>
         <primitive class="lsb" id="myhttpd_2" provider="heartbeat" type="myhttpd">
           <operations>
             <op id="myhttpd_2_mon" interval="120s" name="monitor" timeout="60s"/>
           </operations>
         </primitive>
       </group>
     </resources>
     <constraints>
       <rsc_location id="rsc_location_group_1" rsc="group_1">
         <rule id="prefered_location_group_1" score="100">
           <expression attribute="#uname" id="prefered_location_group_1_expr" operation="eq" value="kf28-1"/>
         </rule>
       </rsc_location>
     </constraints>
</configuration>
</cib>
作者: applegump    时间: 2012-08-21 17:24
cib.xml.sig内容为:
cef71bb53253e460c4fd704f70c80258

cib.xml.sig.last内容为:
fd885cb2a132302679fdc28d0af89dc3

oradb2-1中只有一个cib.xml文件,内容为:

?xml version="1.0" ?>
<cib admin_epoch="0" epoch="0" num_updates="0">
        <configuration>
                <crm_config>
                        <cluster_property_set id="cib-bootstrap-options">
                                <attributes>
                                        <nvpair id="cib-bootstrap-options-symmetric-cluster" name="symmetric-cluster" value="true"/>
                                        <nvpair id="cib-bootstrap-options-no_quorum-policy" name="no_quorum-policy" value="stop"/>
                                        <nvpair id="cib-bootstrap-options-default-resource-stickiness" name="default-resource-stickiness" value="0"/>
                                        <nvpair id="cib-bootstrap-options-default-resource-failure-stickiness" name="default-resource-failure-stickiness" value="0"/>
                                        <nvpair id="cib-bootstrap-options-stonith-enabled" name="stonith-enabled" value="false"/>
                                        <nvpair id="cib-bootstrap-options-stonith-action" name="stonith-action" value="reboot"/>
                                        <nvpair id="cib-bootstrap-options-stop-orphan-resources" name="stop-orphan-resources" value="true"/>
                                        <nvpair id="cib-bootstrap-options-stop-orphan-actions" name="stop-orphan-actions" value="true"/>
                                        <nvpair id="cib-bootstrap-options-remove-after-stop" name="remove-after-stop" value="false"/>
                                        <nvpair id="cib-bootstrap-options-short-resource-names" name="short-resource-names" value="true"/>
                                        <nvpair id="cib-bootstrap-options-transition-idle-timeout" name="transition-idle-timeout" value="5min"/>
                                        <nvpair id="cib-bootstrap-options-default-action-timeout" name="default-action-timeout" value="5s"/>
                                        <nvpair id="cib-bootstrap-options-is-managed-default" name="is-managed-default" value="true"/>
                                </attributes>
                        </cluster_property_set>
                </crm_config>
                <nodes/>
                <resources>
                        <group id="group_1">
                                <primitive class="ocf" id="IPaddr_192_1_101_212" provider="heartbeat" type="IPaddr">
                                        <operations>
                                                <op id="IPaddr_192_1_101_212_mon" interval="5s" name="monitor" timeout="5s"/>
                                        </operations>
                                        <instance_attributes id="IPaddr_192_1_101_212_inst_attr">
                                                <attributes>
                                                        <nvpair id="IPaddr_192_1_101_212_attr_0" name="ip" value="192.1.101.212"/>
                                                </attributes>
                                        </instance_attributes>
                                </primitive>
                                <primitive class="lsb" id="myhttpd_2" provider="heartbeat" type="myhttpd">
                                        <operations>
                                                <op id="myhttpd_2_mon" interval="120s" name="monitor" timeout="60s"/>
                                        </operations>
                                </primitive>
                        </group>
                </resources>
                <constraints>
                        <rsc_location id="rsc_location_group_1" rsc="group_1">
                                     <rule id="prefered_location_group_1" score="100">
                                        <expression attribute="#uname" id="prefered_location_group_1_expr" operation="eq" value="kf28-1"/>                          
                                </rule>
                        </rsc_location>
                </constraints>
        </configuration>
        <status/>
</cib>



作者: applegump    时间: 2012-08-21 17:27
oradb2-1的日志文件/var/log/ha-debug内容如下(由于比较多,我只列出有代表性的几段)


......................

cib[9163]: 2012/08/21_17:25:28 WARN: init_start: CCM Activation failed
cib[9163]: 2012/08/21_17:25:28 ERROR: init_start: CCM Activation failed 30 (max) times
cib[9163]: 2012/08/21_17:25:28 ERROR: init_start: Couldnt start all communication channels, exiting.
mgmtd[8963]: 2012/08/21_17:25:28 ERROR: cib_native_signon: No reply message - disconnected - 0
mgmtd[8963]: 2012/08/21_17:25:28 info: login to cib: 3, ret:-3
heartbeat[26040]: 2012/08/21_17:25:28 info: Exiting /usr/local/lib/heartbeat/cib process 9163 returned rc 0.
heartbeat[26040]: 2012/08/21_17:25:28 ERROR: Respawning client "/usr/local/lib/heartbeat/cib":
crmd[6397]: 2012/08/21_17:25:28 ERROR: cib_native_signon: No reply message - disconnected - 0
heartbeat[26040]: 2012/08/21_17:25:28 info: Starting child client "/usr/local/lib/heartbeat/cib" (503,501)
crmd[6397]: 2012/08/21_17:25:28 WARN: cib_native_signon: Connection to CIB failed: not connected
crmd[6397]: 2012/08/21_17:25:28 WARN: do_cib_control: Couldn't complete CIB registration 20 times... pause and retry
crmd[6397]: 2012/08/21_17:25:28 WARN: crm_fsa_trigger: FSA took 29220ms to complete
heartbeat[9414]: 2012/08/21_17:25:28 info: Starting "/usr/local/lib/heartbeat/cib" as uid 503  gid 501 (pid 9414)
cib[9414]: 2012/08/21_17:25:28 info: G_main_add_SignalHandler: Added signal handler for signal 15
cib[9414]: 2012/08/21_17:25:28 info: G_main_add_TriggerHandler: Added signal manual handler
cib[9414]: 2012/08/21_17:25:28 info: G_main_add_SignalHandler: Added signal handler for signal 17
cib[9414]: 2012/08/21_17:25:28 info: main: Retrieval of a per-action CIB: disabled
cib[9414]: 2012/08/21_17:25:28 info: cib_register_ha: Signing in with Heartbeat
cib[9414]: 2012/08/21_17:25:29 info: cib_register_ha: FSA Hostname: oradb2-1
cib[9414]: 2012/08/21_17:25:29 info: readCibXmlFile: Reading cluster configuration from: /usr/local/var/lib/heartbeat/crm/cib.xml
cib[9414]: 2012/08/21_17:25:29 WARN: validate_cib_digest: No on-disk digest present
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk] <cib admin_epoch="0" epoch="0" num_updates="0" generated="false" have_quorum="false">
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]   <configuration>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]     <crm_config>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]       <cluster_property_set id="cib-bootstrap-options">
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]         <attributes>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-symmetric-cluster" name="symmetric-cluster" value="true"/>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-no_quorum-policy" name="no_quorum-policy" value="stop"/>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-resource-stickiness" name="default-resource-stickiness" value="0"/>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-resource-failure-stickiness" name="default-resource-failure-stickiness" value="0"/>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stonith-enabled" name="stonith-enabled" value="false"/>
crmd[6397]: 2012/08/21_17:25:29 info: crm_timer_popped: Wait Timer (I_NULL) just popped!
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stonith-action" name="stonith-action" value="reboot"/>
crmd[6397]: 2012/08/21_17:25:29 WARN: cib_native_signon: Connection to CIB failed: connection failed
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stop-orphan-resources" name="stop-orphan-resources" value="true"/>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stop-orphan-actions" name="stop-orphan-actions" value="true"/>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-remove-after-stop" name="remove-after-stop" value="false"/>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-short-resource-names" name="short-resource-names" value="true"/>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-transition-idle-timeout" name="transition-idle-timeout" value="5min"/>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-action-timeout" name="default-action-timeout" value="5s"/>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-is-managed-default" name="is-managed-default" value="true"/>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]         </attributes>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]       </cluster_property_set>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]     </crm_config>
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]     <nodes/>
mgmtd[8963]: 2012/08/21_17:25:29 info: login to cib: 4, ret:-10
cib[9414]: 2012/08/21_17:25:29 info: log_data_element: readCibXmlFile: [on-disk]     <resources>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]       <group id="group_1">
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]         <primitive class="ocf" id="IPaddr_192_1_101_212" provider="heartbeat" type="IPaddr">
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]           <operations>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]             <op id="IPaddr_192_1_101_212_mon" interval="5s" name="monitor" timeout="5s"/>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]           </operations>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]           <instance_attributes id="IPaddr_192_1_101_212_inst_attr">
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]             <attributes>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]               <nvpair id="IPaddr_192_1_101_212_attr_0" name="ip" value="192.1.101.212"/>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]             </attributes>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]           </instance_attributes>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]         </primitive>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]         <primitive class="lsb" id="myhttpd_2" provider="heartbeat" type="myhttpd">
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]           <operations>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]             <op id="myhttpd_2_mon" interval="120s" name="monitor" timeout="60s"/>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]           </operations>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]         </primitive>
crmd[6397]: 2012/08/21_17:25:30 WARN: cib_native_signon: Connection to CIB failed: connection failed
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]       </group>
crmd[6397]: 2012/08/21_17:25:30 WARN: do_cib_control: Couldn't complete CIB registration 21 times... pause and retry
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]     </resources>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]     <constraints>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]       <rsc_location id="rsc_location_group_1" rsc="group_1">
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]         <rule id="prefered_location_group_1" score="100">
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]           <expression attribute="#uname" id="prefered_location_group_1_expr" operation="eq" value="kf28-1"/>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]         </rule>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]       </rsc_location>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]     </constraints>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]   </configuration>
cib[9414]: 2012/08/21_17:25:30 info: log_data_element: readCibXmlFile: [on-disk]   <status/>
mgmtd[8963]: 2012/08/21_17:25:31 info: login to cib failed
cib[9414]: 2012/08/21_17:25:31 info: log_data_element: readCibXmlFile: [on-disk] </cib>
mgmtd[8963]: 2012/08/21_17:25:31 ERROR: Can't initialize management library.Shutting down.(-1)
heartbeat[26040]: 2012/08/21_17:25:31 WARN: Exiting /usr/local/lib/heartbeat/mgmtd -v process 8963 returned rc 1.
heartbeat[26040]: 2012/08/21_17:25:31 ERROR: Respawning client "/usr/local/lib/heartbeat/mgmtd -v":
heartbeat[26040]: 2012/08/21_17:25:31 info: Starting child client "/usr/local/lib/heartbeat/mgmtd -v" (0,0)
heartbeat[9466]: 2012/08/21_17:25:31 info: Starting "/usr/local/lib/heartbeat/mgmtd -v" as uid 0  gid 0 (pid 9466)
cib[9414]: 2012/08/21_17:25:31 notice: readCibXmlFile: Enabling DTD validation on the existing (sane) configuration
mgmtd[9466]: 2012/08/21_17:25:31 info: G_main_add_SignalHandler: Added signal handler for signal 15
cib[9414]: 2012/08/21_17:25:31 info: startCib: CIB Initialization completed successfully
crmd[6397]: 2012/08/21_17:25:31 info: crm_timer_popped: Wait Timer (I_NULL) just popped!
mgmtd[9466]: 2012/08/21_17:25:31 debug: Enabling coredumps
cib[9414]: 2012/08/21_17:25:31 WARN: init_start: CCM Activation failed
mgmtd[9466]: 2012/08/21_17:25:31 info: G_main_add_SignalHandler: Added signal handler for signal 10
cib[9414]: 2012/08/21_17:25:31 WARN: init_start: CCM Connection failed 1 times (30 max)
mgmtd[9466]: 2012/08/21_17:25:31 info: G_main_add_SignalHandler: Added signal handler for signal 12
mgmtd[9466]: 2012/08/21_17:25:31 info: init_crm
cib[9414]: 2012/08/21_17:25:32 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:32 WARN: init_start: CCM Connection failed 2 times (30 max)
cib[9414]: 2012/08/21_17:25:33 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:33 WARN: init_start: CCM Connection failed 3 times (30 max)
cib[9414]: 2012/08/21_17:25:34 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:34 WARN: init_start: CCM Connection failed 4 times (30 max)
cib[9414]: 2012/08/21_17:25:35 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:35 WARN: init_start: CCM Connection failed 5 times (30 max)
cib[9414]: 2012/08/21_17:25:36 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:36 WARN: init_start: CCM Connection failed 6 times (30 max)
cib[9414]: 2012/08/21_17:25:37 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:37 WARN: init_start: CCM Connection failed 7 times (30 max)
cib[9414]: 2012/08/21_17:25:38 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:38 WARN: init_start: CCM Connection failed 8 times (30 max)
cib[9414]: 2012/08/21_17:25:39 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:39 WARN: init_start: CCM Connection failed 9 times (30 max)
cib[9414]: 2012/08/21_17:25:40 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:40 WARN: init_start: CCM Connection failed 10 times (30 max)
cib[9414]: 2012/08/21_17:25:41 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:41 WARN: init_start: CCM Connection failed 11 times (30 max)
cib[9414]: 2012/08/21_17:25:42 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:42 WARN: init_start: CCM Connection failed 12 times (30 max)
cib[9414]: 2012/08/21_17:25:43 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:43 WARN: init_start: CCM Connection failed 13 times (30 max)
cib[9414]: 2012/08/21_17:25:44 WARN: init_start: CCM Activation failed
cib[9414]: 2012/08/21_17:25:44 WARN: init_start: CCM Connection failed 14 times (30 max)
cib[9414]: 2012/08/21_17:25:45 WARN: init_start: CCM Activation failed
作者: applegump    时间: 2012-08-21 17:29
cib[8631]: 2012/08/21_17:24:25 ERROR: init_start: Couldnt start all communication channels, exiting.
crmd[6397]: 2012/08/21_17:24:25 ERROR: cib_native_signon: No reply message - disconnected - 0
heartbeat[26040]: 2012/08/21_17:24:25 info: Exiting /usr/local/lib/heartbeat/cib process 8631 returned rc 0.
crmd[6397]: 2012/08/21_17:24:25 WARN: cib_native_signon: Connection to CIB failed: not connected
heartbeat[26040]: 2012/08/21_17:24:25 ERROR: Respawning client "/usr/local/lib/heartbeat/cib":
heartbeat[26040]: 2012/08/21_17:24:25 info: Starting child client "/usr/local/lib/heartbeat/cib" (503,501)
mgmtd[8386]: 2012/08/21_17:24:25 ERROR: cib_native_signon: No reply message - disconnected - 0
mgmtd[8386]: 2012/08/21_17:24:25 info: login to cib: 3, ret:-3
heartbeat[8918]: 2012/08/21_17:24:25 info: Starting "/usr/local/lib/heartbeat/cib" as uid 503  gid 501 (pid 891
cib[8918]: 2012/08/21_17:24:25 info: G_main_add_SignalHandler: Added signal handler for signal 15
cib[8918]: 2012/08/21_17:24:25 info: G_main_add_TriggerHandler: Added signal manual handler
cib[8918]: 2012/08/21_17:24:25 info: G_main_add_SignalHandler: Added signal handler for signal 17
cib[8918]: 2012/08/21_17:24:25 info: main: Retrieval of a per-action CIB: disabled
cib[8918]: 2012/08/21_17:24:25 info: cib_register_ha: Signing in with Heartbeat
cib[8918]: 2012/08/21_17:24:25 info: cib_register_ha: FSA Hostname: oradb2-1
cib[8918]: 2012/08/21_17:24:25 info: readCibXmlFile: Reading cluster configuration from: /usr/local/var/lib/heartbeat/crm/cib.xml
cib[8918]: 2012/08/21_17:24:26 WARN: validate_cib_digest: No on-disk digest present
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk] <cib admin_epoch="0" epoch="0" num_updates="0" generated="false" have_quorum="false">
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]   <configuration>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]     <crm_config>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]       <cluster_property_set id="cib-bootstrap-options">
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]         <attributes>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-symmetric-cluster" name="symmetric-cluster" value="true"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-no_quorum-policy" name="no_quorum-policy" value="stop"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-resource-stickiness" name="default-resource-stickiness" value="0"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-resource-failure-stickiness" name="default-resource-failure-stickiness" value="0"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stonith-enabled" name="stonith-enabled" value="false"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stonith-action" name="stonith-action" value="reboot"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stop-orphan-resources" name="stop-orphan-resources" value="true"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stop-orphan-actions" name="stop-orphan-actions" value="true"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-remove-after-stop" name="remove-after-stop" value="false"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-short-resource-names" name="short-resource-names" value="true"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-transition-idle-timeout" name="transition-idle-timeout" value="5min"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-action-timeout" name="default-action-timeout" value="5s"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-is-managed-default" name="is-managed-default" value="true"/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]         </attributes>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]       </cluster_property_set>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]     </crm_config>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]     <nodes/>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]     <resources>
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]       <group id="group_1">
cib[8918]: 2012/08/21_17:24:26 info: log_data_element: readCibXmlFile: [on-disk]         <primitive class="ocf" id="IPaddr_192_1_101_212" provider="heartbeat" type="IPaddr">
crmd[6397]: 2012/08/21_17:24:26 WARN: cib_native_signon: Connection to CIB failed: connection failed
mgmtd[8386]: 2012/08/21_17:24:26 info: login to cib: 4, ret:-10
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]           <operations>
crmd[6397]: 2012/08/21_17:24:27 WARN: do_cib_control: Couldn't complete CIB registration 17 times... pause and retry
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]             <op id="IPaddr_192_1_101_212_mon" interval="5s" name="monitor" timeout="5s"/>
crmd[6397]: 2012/08/21_17:24:27 ERROR: crm_fsa_trigger: FSA took 30120ms to complete
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]           </operations>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]           <instance_attributes id="IPaddr_192_1_101_212_inst_attr">
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]             <attributes>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]               <nvpair id="IPaddr_192_1_101_212_attr_0" name="ip" value="192.1.101.212"/>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]             </attributes>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]           </instance_attributes>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]         </primitive>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]         <primitive class="lsb" id="myhttpd_2" provider="heartbeat" type="myhttpd">
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]           <operations>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]             <op id="myhttpd_2_mon" interval="120s" name="monitor" timeout="60s"/>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]           </operations>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]         </primitive>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]       </group>
crmd[6397]: 2012/08/21_17:24:27 info: crm_timer_popped: Wait Timer (I_NULL) just popped!
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]     </resources>
crmd[6397]: 2012/08/21_17:24:27 WARN: cib_native_signon: Connection to CIB failed: connection failed
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]     <constraints>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]       <rsc_location id="rsc_location_group_1" rsc="group_1">
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]         <rule id="prefered_location_group_1" score="100">
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]           <expression attribute="#uname" id="prefered_location_group_1_expr" operation="eq" value="kf28-1"/>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]         </rule>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]       </rsc_location>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]     </constraints>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]   </configuration>
cib[8918]: 2012/08/21_17:24:27 info: log_data_element: readCibXmlFile: [on-disk]   <status/>
cib[8918]: 2012/08/21_17:24:28 info: log_data_element: readCibXmlFile: [on-disk] </cib>
mgmtd[8386]: 2012/08/21_17:24:28 info: login to cib failed
cib[8918]: 2012/08/21_17:24:28 notice: readCibXmlFile: Enabling DTD validation on the existing (sane) configuration
mgmtd[8386]: 2012/08/21_17:24:28 ERROR: Can't initialize management library.Shutting down.(-1)
heartbeat[26040]: 2012/08/21_17:24:28 WARN: Exiting /usr/local/lib/heartbeat/mgmtd -v process 8386 returned rc 1.
heartbeat[26040]: 2012/08/21_17:24:28 ERROR: Respawning client "/usr/local/lib/heartbeat/mgmtd -v":
heartbeat[26040]: 2012/08/21_17:24:28 info: Starting child client "/usr/local/lib/heartbeat/mgmtd -v" (0,0)
heartbeat[8963]: 2012/08/21_17:24:28 info: Starting "/usr/local/lib/heartbeat/mgmtd -v" as uid 0  gid 0 (pid 8963)
cib[8918]: 2012/08/21_17:24:28 info: startCib: CIB Initialization completed successfully
mgmtd[8963]: 2012/08/21_17:24:28 info: G_main_add_SignalHandler: Added signal handler for signal 15
cib[8918]: 2012/08/21_17:24:28 WARN: init_start: CCM Activation failed
mgmtd[8963]: 2012/08/21_17:24:28 debug: Enabling coredumps
cib[8918]: 2012/08/21_17:24:28 WARN: init_start: CCM Connection failed 1 times (30 max)
mgmtd[8963]: 2012/08/21_17:24:28 info: G_main_add_SignalHandler: Added signal handler for signal 10
mgmtd[8963]: 2012/08/21_17:24:28 info: G_main_add_SignalHandler: Added signal handler for signal 12
mgmtd[8963]: 2012/08/21_17:24:28 info: init_crm
cib[8918]: 2012/08/21_17:24:29 WARN: init_start: CCM Activation failed
cib[8918]: 2012/08/21_17:24:29 WARN: init_start: CCM Connection failed 2 times (30 max)
作者: applegump    时间: 2012-08-21 17:38
下面是一个完整的错误日志:

heartbeat[13720]: 2012/08/21_17:34:10 WARN: File /usr/local/etc/ha.d/haresources exists.
heartbeat[13720]: 2012/08/21_17:34:10 WARN: This file is not used because crm is enabled
heartbeat[13720]: 2012/08/21_17:34:11 WARN: Logging daemon is disabled --enabling logging daemon is recommended
heartbeat[13720]: 2012/08/21_17:34:11 info: **************************
heartbeat[13720]: 2012/08/21_17:34:11 info: Configuration validated. Starting heartbeat 2.0.8
heartbeat[13722]: 2012/08/21_17:34:11 info: heartbeat: version 2.0.8
heartbeat[13722]: 2012/08/21_17:34:11 info: Heartbeat generation: 6
heartbeat[13722]: 2012/08/21_17:34:11 info: G_main_add_TriggerHandler: Added signal manual handler
heartbeat[13722]: 2012/08/21_17:34:11 info: G_main_add_TriggerHandler: Added signal manual handler
heartbeat[13722]: 2012/08/21_17:34:11 info: Removing /usr/local/var/run/heartbeat/rsctmp failed, recreating.
heartbeat[13722]: 2012/08/21_17:34:11 info: glib: ucast: write socket priority set to IPTOS_LOWDELAY on eth0
heartbeat[13722]: 2012/08/21_17:34:11 info: glib: ucast: bound send socket to device: eth0
heartbeat[13722]: 2012/08/21_17:34:11 info: glib: ucast: bound receive socket to device: eth0
heartbeat[13722]: 2012/08/21_17:34:11 info: glib: ucast: started on port 694 interface eth0 to 192.1.101.128
heartbeat[13722]: 2012/08/21_17:34:11 info: G_main_add_SignalHandler: Added signal handler for signal 17
heartbeat[13722]: 2012/08/21_17:34:11 info: Local status now set to: 'up'
heartbeat[13722]: 2012/08/21_17:34:12 info: Link kf28-1:eth0 up.
heartbeat[13722]: 2012/08/21_17:34:12 info: Status update for node kf28-1: status active
heartbeat[13722]: 2012/08/21_17:34:13 info: Comm_now_up(): updating status to active
heartbeat[13722]: 2012/08/21_17:34:13 info: Local status now set to: 'active'
heartbeat[13722]: 2012/08/21_17:34:13 info: Starting child client "/usr/local/lib/heartbeat/ccm" (503,501)
heartbeat[13722]: 2012/08/21_17:34:13 info: Starting child client "/usr/local/lib/heartbeat/cib" (503,501)
heartbeat[13722]: 2012/08/21_17:34:13 info: Starting child client "/usr/local/lib/heartbeat/lrmd -r" (0,0)
heartbeat[13722]: 2012/08/21_17:34:13 info: Starting child client "/usr/local/lib/heartbeat/stonithd" (0,0)
heartbeat[13740]: 2012/08/21_17:34:13 info: Starting "/usr/local/lib/heartbeat/cib" as uid 503  gid 501 (pid 13740)
heartbeat[13722]: 2012/08/21_17:34:13 info: Starting child client "/usr/local/lib/heartbeat/attrd" (503,501)
heartbeat[13722]: 2012/08/21_17:34:13 info: Starting child client "/usr/local/lib/heartbeat/crmd" (503,501)
heartbeat[13739]: 2012/08/21_17:34:13 info: Starting "/usr/local/lib/heartbeat/ccm" as uid 503  gid 501 (pid 13739)
heartbeat[13722]: 2012/08/21_17:34:13 info: Starting child client "/usr/local/lib/heartbeat/mgmtd -v" (0,0)
heartbeat[13742]: 2012/08/21_17:34:13 info: Starting "/usr/local/lib/heartbeat/stonithd" as uid 0  gid 0 (pid 13742)
heartbeat[13741]: 2012/08/21_17:34:13 info: Starting "/usr/local/lib/heartbeat/lrmd -r" as uid 0  gid 0 (pid 13741)
heartbeat[13722]: 2012/08/21_17:34:13 WARN: G_CH_dispatch_int: Dispatch function for read child took too long to execute: 340 ms (> 50 ms) (GSource: 0x872472
heartbeat[13743]: 2012/08/21_17:34:13 info: Starting "/usr/local/lib/heartbeat/attrd" as uid 503  gid 501 (pid 13743)
heartbeat[13744]: 2012/08/21_17:34:13 info: Starting "/usr/local/lib/heartbeat/crmd" as uid 503  gid 501 (pid 13744)
heartbeat[13745]: 2012/08/21_17:34:13 info: Starting "/usr/local/lib/heartbeat/mgmtd -v" as uid 0  gid 0 (pid 13745)
cib[13740]: 2012/08/21_17:34:13 info: G_main_add_SignalHandler: Added signal handler for signal 15
cib[13740]: 2012/08/21_17:34:13 info: G_main_add_TriggerHandler: Added signal manual handler
cib[13740]: 2012/08/21_17:34:13 info: G_main_add_SignalHandler: Added signal handler for signal 17
cib[13740]: 2012/08/21_17:34:13 info: main: Retrieval of a per-action CIB: disabled
cib[13740]: 2012/08/21_17:34:13 info: cib_register_ha: Signing in with Heartbeat
lrmd[13741]: 2012/08/21_17:34:13 info: G_main_add_SignalHandler: Added signal handler for signal 15
cib[13740]: 2012/08/21_17:34:13 info: cib_register_ha: FSA Hostname: oradb2-1
cib[13740]: 2012/08/21_17:34:13 info: readCibXmlFile: Reading cluster configuration from: /usr/local/var/lib/heartbeat/crm/cib.xml
cib[13740]: 2012/08/21_17:34:13 WARN: validate_cib_digest: No on-disk digest present
cib[13740]: 2012/08/21_17:34:13 info: log_data_element: readCibXmlFile: [on-disk] <cib admin_epoch="0" epoch="0" num_updates="0" generated="false" have_quorum="false">
cib[13740]: 2012/08/21_17:34:13 info: log_data_element: readCibXmlFile: [on-disk]   <configuration>
cib[13740]: 2012/08/21_17:34:13 info: log_data_element: readCibXmlFile: [on-disk]     <crm_config>
cib[13740]: 2012/08/21_17:34:13 info: log_data_element: readCibXmlFile: [on-disk]       <cluster_property_set id="cib-bootstrap-options">
mgmtd[13745]: 2012/08/21_17:34:13 info: G_main_add_SignalHandler: Added signal handler for signal 15
attrd[13743]: 2012/08/21_17:34:13 info: G_main_add_SignalHandler: Added signal handler for signal 15
stonithd[13742]: 2012/08/21_17:34:13 info: G_main_add_SignalHandler: Added signal handler for signal 10
lrmd[13741]: 2012/08/21_17:34:13 info: G_main_add_SignalHandler: Added signal handler for signal 17
crmd[13744]: 2012/08/21_17:34:13 info: main: CRM Hg Version: 2d298bca0d0a:STABLE-2.0.8

cib[13740]: 2012/08/21_17:34:13 info: log_data_element: readCibXmlFile: [on-disk]         <attributes>
mgmtd[13745]: 2012/08/21_17:34:13 debug: Enabling coredumps
ccm[13739]: 2012/08/21_17:34:13 info: Hostname: oradb2-1
attrd[13743]: 2012/08/21_17:34:13 info: register_with_ha: Hostname: oradb2-1
stonithd[13742]: 2012/08/21_17:34:13 info: G_main_add_SignalHandler: Added signal handler for signal 12
lrmd[13741]: 2012/08/21_17:34:13 info: G_main_add_SignalHandler: Added signal handler for signal 10
crmd[13744]: 2012/08/21_17:34:13 info: init_start: Starting crmd
cib[13740]: 2012/08/21_17:34:13 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-symmetric-cluster" name="symmetric-cluster" value="true"/>
mgmtd[13745]: 2012/08/21_17:34:13 info: G_main_add_SignalHandler: Added signal handler for signal 10
attrd[13743]: 2012/08/21_17:34:13 info: register_with_ha: UUID: 34bd366d-3b86-4cb1-9bb1-b901f0e4e08b
stonithd[13742]: 2012/08/21_17:34:13 info: Signing in with heartbeat.
lrmd[13741]: 2012/08/21_17:34:13 info: G_main_add_SignalHandler: Added signal handler for signal 12
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-no_quorum-policy" name="no_quorum-policy" value="stop"/>
crmd[13744]: 2012/08/21_17:34:14 info: G_main_add_SignalHandler: Added signal handler for signal 15
mgmtd[13745]: 2012/08/21_17:34:14 info: G_main_add_SignalHandler: Added signal handler for signal 12
attrd[13743]: 2012/08/21_17:34:14 WARN: cib_native_signon: Connection to CIB failed: connection failed
stonithd[13742]: 2012/08/21_17:34:14 notice: /usr/local/lib/heartbeat/stonithd start up successfully.
lrmd[13741]: 2012/08/21_17:34:14 info: Started.
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-resource-stickiness" name="default-resource-stickiness" value="0"/>
crmd[13744]: 2012/08/21_17:34:14 info: G_main_add_TriggerHandler: Added signal manual handler
ccm[13739]: 2012/08/21_17:34:14 info: G_main_add_SignalHandler: Added signal handler for signal 15
stonithd[13742]: 2012/08/21_17:34:14 info: G_main_add_SignalHandler: Added signal handler for signal 17
mgmtd[13745]: 2012/08/21_17:34:14 info: init_crm
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-resource-failure-stickiness" name="default-resource-failure-stickiness" value="0"/>
crmd[13744]: 2012/08/21_17:34:14 info: G_main_add_SignalHandler: Added signal handler for signal 17
ccm[13739]: 2012/08/21_17:34:14 ERROR: socket_wait_conn_new: unlink failure(/usr/local/var/run/heartbeat/ccm/ccm): Permission denied
mgmtd[13745]: 2012/08/21_17:34:14 info: login to cib: 0, ret:-10
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stonith-enabled" name="stonith-enabled" value="false"/>
crmd[13744]: 2012/08/21_17:34:14 WARN: cib_native_signon: Connection to CIB failed: connection failed
ccm[13739]: 2012/08/21_17:34:14 ERROR: socket_wait_conn_new: trying to create in /usr/local/var/run/heartbeat/ccm/ccm bind:: Permission denied
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stonith-action" name="stonith-action" value="reboot"/>
ccm[13739]: 2012/08/21_17:34:14 ERROR: Can't create wait channel: Permission denied
heartbeat[13722]: 2012/08/21_17:34:14 WARN: Exiting /usr/local/lib/heartbeat/ccm process 13739 returned rc 1.
heartbeat[13722]: 2012/08/21_17:34:14 ERROR: Respawning client "/usr/local/lib/heartbeat/ccm":
heartbeat[13722]: 2012/08/21_17:34:14 info: Starting child client "/usr/local/lib/heartbeat/ccm" (503,501)
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stop-orphan-resources" name="stop-orphan-resources" value="true"/>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stop-orphan-actions" name="stop-orphan-actions" value="true"/>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-remove-after-stop" name="remove-after-stop" value="false"/>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-short-resource-names" name="short-resource-names" value="true"/>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-transition-idle-timeout" name="transition-idle-timeout" value="5min"/>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-action-timeout" name="default-action-timeout" value="5s"/>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-is-managed-default" name="is-managed-default" value="true"/>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]         </attributes>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]       </cluster_property_set>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]     </crm_config>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]     <nodes/>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]     <resources>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]       <group id="group_1">
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]         <primitive class="ocf" id="IPaddr_192_1_101_212" provider="heartbeat" type="IPaddr">
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]           <operations>
cib[13740]: 2012/08/21_17:34:14 info: log_data_element: readCibXmlFile: [on-disk]             <op id="IPaddr_192_1_101_212_mon" interval="5s" name="monitor" timeout="5s"/>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]           </operations>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]           <instance_attributes id="IPaddr_192_1_101_212_inst_attr">
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]             <attributes>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]               <nvpair id="IPaddr_192_1_101_212_attr_0" name="ip" value="192.1.101.212"/>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]             </attributes>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]           </instance_attributes>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]         </primitive>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]         <primitive class="lsb" id="myhttpd_2" provider="heartbeat" type="myhttpd">
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]           <operations>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]             <op id="myhttpd_2_mon" interval="120s" name="monitor" timeout="60s"/>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]           </operations>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]         </primitive>
mgmtd[13745]: 2012/08/21_17:34:15 info: login to cib: 1, ret:-10
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]       </group>
crmd[13744]: 2012/08/21_17:34:15 WARN: cib_native_signon: Connection to CIB failed: connection failed
heartbeat[13756]: 2012/08/21_17:34:15 info: Starting "/usr/local/lib/heartbeat/ccm" as uid 503  gid 501 (pid 13756)
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]     </resources>
crmd[13744]: 2012/08/21_17:34:15 WARN: do_cib_control: Couldn't complete CIB registration 1 times... pause and retry
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]     <constraints>
crmd[13744]: 2012/08/21_17:34:15 info: init_start: Starting crmd's mainloop
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]       <rsc_location id="rsc_location_group_1" rsc="group_1">
ccm[13756]: 2012/08/21_17:34:15 info: Hostname: oradb2-1
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]         <rule id="prefered_location_group_1" score="100">
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]           <expression attribute="#uname" id="prefered_location_group_1_expr" operation="eq" value="kf28-1"/>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]         </rule>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]       </rsc_location>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]     </constraints>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]   </configuration>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk]   <status/>
cib[13740]: 2012/08/21_17:34:15 info: log_data_element: readCibXmlFile: [on-disk] </cib>
cib[13740]: 2012/08/21_17:34:15 notice: readCibXmlFile: Enabling DTD validation on the existing (sane) configuration
cib[13740]: 2012/08/21_17:34:16 info: startCib: CIB Initialization completed successfully
cib[13740]: 2012/08/21_17:34:16 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:16 WARN: init_start: CCM Connection failed 1 times (30 max)
crmd[13744]: 2012/08/21_17:34:16 info: crm_timer_popped: Wait Timer (I_NULL) just popped!
ccm[13756]: 2012/08/21_17:34:16 info: G_main_add_SignalHandler: Added signal handler for signal 15
ccm[13756]: 2012/08/21_17:34:16 ERROR: socket_wait_conn_new: unlink failure(/usr/local/var/run/heartbeat/ccm/ccm): Permission denied
ccm[13756]: 2012/08/21_17:34:16 ERROR: socket_wait_conn_new: trying to create in /usr/local/var/run/heartbeat/ccm/ccm bind:: Permission denied
ccm[13756]: 2012/08/21_17:34:16 ERROR: Can't create wait channel: Permission denied
heartbeat[13722]: 2012/08/21_17:34:16 WARN: Exiting /usr/local/lib/heartbeat/ccm process 13756 returned rc 1.
heartbeat[13722]: 2012/08/21_17:34:16 ERROR: Respawning client "/usr/local/lib/heartbeat/ccm":
heartbeat[13722]: 2012/08/21_17:34:16 info: Starting child client "/usr/local/lib/heartbeat/ccm" (503,501)
cib[13740]: 2012/08/21_17:34:17 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:17 WARN: init_start: CCM Connection failed 2 times (30 max)
heartbeat[13803]: 2012/08/21_17:34:17 info: Starting "/usr/local/lib/heartbeat/ccm" as uid 503  gid 501 (pid 13803)
作者: applegump    时间: 2012-08-21 17:39
续上:

cib[13740]: 2012/08/21_17:34:18 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:18 WARN: init_start: CCM Connection failed 3 times (30 max)
ccm[13803]: 2012/08/21_17:34:18 info: Hostname: oradb2-1
cib[13740]: 2012/08/21_17:34:19 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:19 WARN: init_start: CCM Connection failed 4 times (30 max)
cib[13740]: 2012/08/21_17:34:20 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:20 WARN: init_start: CCM Connection failed 5 times (30 max)
ccm[13803]: 2012/08/21_17:34:20 info: G_main_add_SignalHandler: Added signal handler for signal 15
ccm[13803]: 2012/08/21_17:34:20 ERROR: socket_wait_conn_new: unlink failure(/usr/local/var/run/heartbeat/ccm/ccm): Permission denied
ccm[13803]: 2012/08/21_17:34:20 ERROR: socket_wait_conn_new: trying to create in /usr/local/var/run/heartbeat/ccm/ccm bind:: Permission denied
ccm[13803]: 2012/08/21_17:34:20 ERROR: Can't create wait channel: Permission denied
heartbeat[13722]: 2012/08/21_17:34:20 WARN: Exiting /usr/local/lib/heartbeat/ccm process 13803 returned rc 1.
heartbeat[13722]: 2012/08/21_17:34:20 ERROR: Respawning client "/usr/local/lib/heartbeat/ccm":
heartbeat[13722]: 2012/08/21_17:34:20 info: Starting child client "/usr/local/lib/heartbeat/ccm" (503,501)
cib[13740]: 2012/08/21_17:34:21 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:21 WARN: init_start: CCM Connection failed 6 times (30 max)
heartbeat[13821]: 2012/08/21_17:34:21 info: Starting "/usr/local/lib/heartbeat/ccm" as uid 503  gid 501 (pid 13821)
ccm[13821]: 2012/08/21_17:34:21 info: Hostname: oradb2-1
cib[13740]: 2012/08/21_17:34:22 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:22 WARN: init_start: CCM Connection failed 7 times (30 max)
cib[13740]: 2012/08/21_17:34:23 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:23 WARN: init_start: CCM Connection failed 8 times (30 max)
cib[13740]: 2012/08/21_17:34:24 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:24 WARN: init_start: CCM Connection failed 9 times (30 max)
cib[13740]: 2012/08/21_17:34:25 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:25 WARN: init_start: CCM Connection failed 10 times (30 max)
cib[13740]: 2012/08/21_17:34:26 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:26 WARN: init_start: CCM Connection failed 11 times (30 max)
ccm[13821]: 2012/08/21_17:34:26 info: G_main_add_SignalHandler: Added signal handler for signal 15
ccm[13821]: 2012/08/21_17:34:26 ERROR: socket_wait_conn_new: unlink failure(/usr/local/var/run/heartbeat/ccm/ccm): Permission denied
ccm[13821]: 2012/08/21_17:34:26 ERROR: socket_wait_conn_new: trying to create in /usr/local/var/run/heartbeat/ccm/ccm bind:: Permission denied
ccm[13821]: 2012/08/21_17:34:26 ERROR: Can't create wait channel: Permission denied
heartbeat[13722]: 2012/08/21_17:34:26 WARN: Exiting /usr/local/lib/heartbeat/ccm process 13821 returned rc 1.
heartbeat[13722]: 2012/08/21_17:34:26 ERROR: Respawning client "/usr/local/lib/heartbeat/ccm":
heartbeat[13722]: 2012/08/21_17:34:26 info: Starting child client "/usr/local/lib/heartbeat/ccm" (503,501)
cib[13740]: 2012/08/21_17:34:27 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:27 WARN: init_start: CCM Connection failed 12 times (30 max)
heartbeat[13886]: 2012/08/21_17:34:27 info: Starting "/usr/local/lib/heartbeat/ccm" as uid 503  gid 501 (pid 13886)
cib[13740]: 2012/08/21_17:34:28 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:28 WARN: init_start: CCM Connection failed 13 times (30 max)
ccm[13886]: 2012/08/21_17:34:28 info: Hostname: oradb2-1
cib[13740]: 2012/08/21_17:34:29 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:29 WARN: init_start: CCM Connection failed 14 times (30 max)
ccm[13886]: 2012/08/21_17:34:29 info: G_main_add_SignalHandler: Added signal handler for signal 15
ccm[13886]: 2012/08/21_17:34:29 ERROR: socket_wait_conn_new: unlink failure(/usr/local/var/run/heartbeat/ccm/ccm): Permission denied
ccm[13886]: 2012/08/21_17:34:29 ERROR: socket_wait_conn_new: trying to create in /usr/local/var/run/heartbeat/ccm/ccm bind:: Permission denied
ccm[13886]: 2012/08/21_17:34:29 ERROR: Can't create wait channel: Permission denied
heartbeat[13722]: 2012/08/21_17:34:29 WARN: Exiting /usr/local/lib/heartbeat/ccm process 13886 returned rc 1.
heartbeat[13722]: 2012/08/21_17:34:29 ERROR: Respawning client "/usr/local/lib/heartbeat/ccm":
heartbeat[13722]: 2012/08/21_17:34:29 info: Starting child client "/usr/local/lib/heartbeat/ccm" (503,501)
cib[13740]: 2012/08/21_17:34:30 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:30 WARN: init_start: CCM Connection failed 15 times (30 max)
heartbeat[13896]: 2012/08/21_17:34:30 info: Starting "/usr/local/lib/heartbeat/ccm" as uid 503  gid 501 (pid 13896)
ccm[13896]: 2012/08/21_17:34:30 info: Hostname: oradb2-1
cib[13740]: 2012/08/21_17:34:31 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:31 WARN: init_start: CCM Connection failed 16 times (30 max)
cib[13740]: 2012/08/21_17:34:32 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:32 WARN: init_start: CCM Connection failed 17 times (30 max)
cib[13740]: 2012/08/21_17:34:33 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:33 WARN: init_start: CCM Connection failed 18 times (30 max)
cib[13740]: 2012/08/21_17:34:34 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:34 WARN: init_start: CCM Connection failed 19 times (30 max)
cib[13740]: 2012/08/21_17:34:35 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:35 WARN: init_start: CCM Connection failed 20 times (30 max)
ccm[13896]: 2012/08/21_17:34:35 info: G_main_add_SignalHandler: Added signal handler for signal 15
ccm[13896]: 2012/08/21_17:34:35 ERROR: socket_wait_conn_new: unlink failure(/usr/local/var/run/heartbeat/ccm/ccm): Permission denied
ccm[13896]: 2012/08/21_17:34:35 ERROR: socket_wait_conn_new: trying to create in /usr/local/var/run/heartbeat/ccm/ccm bind:: Permission denied
ccm[13896]: 2012/08/21_17:34:35 ERROR: Can't create wait channel: Permission denied
heartbeat[13722]: 2012/08/21_17:34:35 WARN: Exiting /usr/local/lib/heartbeat/ccm process 13896 returned rc 1.
heartbeat[13722]: 2012/08/21_17:34:35 ERROR: Respawning client "/usr/local/lib/heartbeat/ccm":
heartbeat[13722]: 2012/08/21_17:34:35 info: Starting child client "/usr/local/lib/heartbeat/ccm" (503,501)
cib[13740]: 2012/08/21_17:34:36 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:36 WARN: init_start: CCM Connection failed 21 times (30 max)
heartbeat[13959]: 2012/08/21_17:34:36 info: Starting "/usr/local/lib/heartbeat/ccm" as uid 503  gid 501 (pid 13959)
ccm[13959]: 2012/08/21_17:34:36 info: Hostname: oradb2-1
cib[13740]: 2012/08/21_17:34:37 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:37 WARN: init_start: CCM Connection failed 22 times (30 max)
cib[13740]: 2012/08/21_17:34:38 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:38 WARN: init_start: CCM Connection failed 23 times (30 max)
ccm[13959]: 2012/08/21_17:34:38 info: G_main_add_SignalHandler: Added signal handler for signal 15
ccm[13959]: 2012/08/21_17:34:38 ERROR: socket_wait_conn_new: unlink failure(/usr/local/var/run/heartbeat/ccm/ccm): Permission denied
ccm[13959]: 2012/08/21_17:34:38 ERROR: socket_wait_conn_new: trying to create in /usr/local/var/run/heartbeat/ccm/ccm bind:: Permission denied
ccm[13959]: 2012/08/21_17:34:38 ERROR: Can't create wait channel: Permission denied
heartbeat[13722]: 2012/08/21_17:34:38 WARN: Exiting /usr/local/lib/heartbeat/ccm process 13959 returned rc 1.
heartbeat[13722]: 2012/08/21_17:34:38 ERROR: Respawning client "/usr/local/lib/heartbeat/ccm":
heartbeat[13722]: 2012/08/21_17:34:38 info: Starting child client "/usr/local/lib/heartbeat/ccm" (503,501)
cib[13740]: 2012/08/21_17:34:39 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:39 WARN: init_start: CCM Connection failed 24 times (30 max)
heartbeat[13977]: 2012/08/21_17:34:39 info: Starting "/usr/local/lib/heartbeat/ccm" as uid 503  gid 501 (pid 13977)
ccm[13977]: 2012/08/21_17:34:39 info: Hostname: oradb2-1
cib[13740]: 2012/08/21_17:34:40 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:40 WARN: init_start: CCM Connection failed 25 times (30 max)
cib[13740]: 2012/08/21_17:34:41 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:41 WARN: init_start: CCM Connection failed 26 times (30 max)
cib[13740]: 2012/08/21_17:34:42 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:42 WARN: init_start: CCM Connection failed 27 times (30 max)
cib[13740]: 2012/08/21_17:34:43 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:43 WARN: init_start: CCM Connection failed 28 times (30 max)
cib[13740]: 2012/08/21_17:34:44 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:44 WARN: init_start: CCM Connection failed 29 times (30 max)
ccm[13977]: 2012/08/21_17:34:44 info: G_main_add_SignalHandler: Added signal handler for signal 15
ccm[13977]: 2012/08/21_17:34:44 ERROR: socket_wait_conn_new: unlink failure(/usr/local/var/run/heartbeat/ccm/ccm): Permission denied
ccm[13977]: 2012/08/21_17:34:44 ERROR: socket_wait_conn_new: trying to create in /usr/local/var/run/heartbeat/ccm/ccm bind:: Permission denied
ccm[13977]: 2012/08/21_17:34:44 ERROR: Can't create wait channel: Permission denied
heartbeat[13722]: 2012/08/21_17:34:44 WARN: Exiting /usr/local/lib/heartbeat/ccm process 13977 returned rc 1.
heartbeat[13722]: 2012/08/21_17:34:44 ERROR: Respawning client "/usr/local/lib/heartbeat/ccm":
heartbeat[13722]: 2012/08/21_17:34:44 info: Starting child client "/usr/local/lib/heartbeat/ccm" (503,501)
cib[13740]: 2012/08/21_17:34:45 WARN: init_start: CCM Activation failed
cib[13740]: 2012/08/21_17:34:45 ERROR: init_start: CCM Activation failed 30 (max) times
cib[13740]: 2012/08/21_17:34:45 ERROR: init_start: Couldnt start all communication channels, exiting.
attrd[13743]: 2012/08/21_17:34:45 ERROR: cib_native_signon: No reply message - disconnected - 0
heartbeat[13722]: 2012/08/21_17:34:45 info: Exiting /usr/local/lib/heartbeat/cib process 13740 returned rc 0.
attrd[13743]: 2012/08/21_17:34:45 WARN: cib_native_signon: Connection to CIB failed: not connected
heartbeat[13722]: 2012/08/21_17:34:45 ERROR: Respawning client "/usr/local/lib/heartbeat/cib":
heartbeat[13722]: 2012/08/21_17:34:45 info: Starting child client "/usr/local/lib/heartbeat/cib" (503,501)
mgmtd[13745]: 2012/08/21_17:34:45 ERROR: cib_native_signon: No reply message - disconnected - 0
mgmtd[13745]: 2012/08/21_17:34:45 info: login to cib: 2, ret:-3
crmd[13744]: 2012/08/21_17:34:45 ERROR: cib_native_signon: No reply message - disconnected - 0
crmd[13744]: 2012/08/21_17:34:45 WARN: cib_native_signon: Connection to CIB failed: not connected
heartbeat[14008]: 2012/08/21_17:34:45 info: Starting "/usr/local/lib/heartbeat/cib" as uid 503  gid 501 (pid 1400
cib[14008]: 2012/08/21_17:34:45 info: G_main_add_SignalHandler: Added signal handler for signal 15
cib[14008]: 2012/08/21_17:34:45 info: G_main_add_TriggerHandler: Added signal manual handler
cib[14008]: 2012/08/21_17:34:45 info: G_main_add_SignalHandler: Added signal handler for signal 17
cib[14008]: 2012/08/21_17:34:45 info: main: Retrieval of a per-action CIB: disabled
cib[14008]: 2012/08/21_17:34:45 info: cib_register_ha: Signing in with Heartbeat
cib[14008]: 2012/08/21_17:34:45 info: cib_register_ha: FSA Hostname: oradb2-1
cib[14008]: 2012/08/21_17:34:45 info: readCibXmlFile: Reading cluster configuration from: /usr/local/var/lib/heartbeat/crm/cib.xml
cib[14008]: 2012/08/21_17:34:45 WARN: validate_cib_digest: No on-disk digest present
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk] <cib admin_epoch="0" epoch="0" num_updates="0" generated="false" have_quorum="false">
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]   <configuration>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]     <crm_config>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]       <cluster_property_set id="cib-bootstrap-options">
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]         <attributes>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-symmetric-cluster" name="symmetric-cluster" value="true"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-no_quorum-policy" name="no_quorum-policy" value="stop"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-resource-stickiness" name="default-resource-stickiness" value="0"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-resource-failure-stickiness" name="default-resource-failure-stickiness" value="0"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stonith-enabled" name="stonith-enabled" value="false"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stonith-action" name="stonith-action" value="reboot"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stop-orphan-resources" name="stop-orphan-resources" value="true"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stop-orphan-actions" name="stop-orphan-actions" value="true"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-remove-after-stop" name="remove-after-stop" value="false"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-short-resource-names" name="short-resource-names" value="true"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-transition-idle-timeout" name="transition-idle-timeout" value="5min"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-action-timeout" name="default-action-timeout" value="5s"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-is-managed-default" name="is-managed-default" value="true"/>
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]         </attributes>
heartbeat[14007]: 2012/08/21_17:34:45 info: Starting "/usr/local/lib/heartbeat/ccm" as uid 503  gid 501 (pid 14007)
cib[14008]: 2012/08/21_17:34:45 info: log_data_element: readCibXmlFile: [on-disk]       </cluster_property_set>
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]     </crm_config>
作者: applegump    时间: 2012-08-21 17:40
再续上:

cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]     <nodes/>
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]     <resources>
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]       <group id="group_1">
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]         <primitive class="ocf" id="IPaddr_192_1_101_212" provider="heartbeat" type="IPaddr">
mgmtd[13745]: 2012/08/21_17:34:46 info: login to cib: 3, ret:-10
crmd[13744]: 2012/08/21_17:34:46 WARN: cib_native_signon: Connection to CIB failed: connection failed
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]           <operations>
crmd[13744]: 2012/08/21_17:34:46 WARN: do_cib_control: Couldn't complete CIB registration 2 times... pause and retry
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]             <op id="IPaddr_192_1_101_212_mon" interval="5s" name="monitor" timeout="5s"/>
crmd[13744]: 2012/08/21_17:34:46 ERROR: crm_fsa_trigger: FSA took 30160ms to complete
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]           </operations>
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]           <instance_attributes id="IPaddr_192_1_101_212_inst_attr">
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]             <attributes>
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]               <nvpair id="IPaddr_192_1_101_212_attr_0" name="ip" value="192.1.101.212"/>
ccm[14007]: 2012/08/21_17:34:46 info: Hostname: oradb2-1
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]             </attributes>
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]           </instance_attributes>
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]         </primitive>
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]         <primitive class="lsb" id="myhttpd_2" provider="heartbeat" type="myhttpd">
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]           <operations>
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]             <op id="myhttpd_2_mon" interval="120s" name="monitor" timeout="60s"/>
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]           </operations>
crmd[13744]: 2012/08/21_17:34:46 info: crm_timer_popped: Wait Timer (I_NULL) just popped!
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]         </primitive>
crmd[13744]: 2012/08/21_17:34:46 WARN: cib_native_signon: Connection to CIB failed: connection failed
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]       </group>
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]     </resources>
cib[14008]: 2012/08/21_17:34:46 info: log_data_element: readCibXmlFile: [on-disk]     <constraints>
cib[14008]: 2012/08/21_17:34:47 info: log_data_element: readCibXmlFile: [on-disk]       <rsc_location id="rsc_location_group_1" rsc="group_1">
cib[14008]: 2012/08/21_17:34:47 info: log_data_element: readCibXmlFile: [on-disk]         <rule id="prefered_location_group_1" score="100">
cib[14008]: 2012/08/21_17:34:47 info: log_data_element: readCibXmlFile: [on-disk]           <expression attribute="#uname" id="prefered_location_group_1_expr" operation="eq" value="kf28-1"/>
cib[14008]: 2012/08/21_17:34:47 info: log_data_element: readCibXmlFile: [on-disk]         </rule>
cib[14008]: 2012/08/21_17:34:47 info: log_data_element: readCibXmlFile: [on-disk]       </rsc_location>
cib[14008]: 2012/08/21_17:34:47 info: log_data_element: readCibXmlFile: [on-disk]     </constraints>
mgmtd[13745]: 2012/08/21_17:34:47 info: login to cib: 4, ret:-10
cib[14008]: 2012/08/21_17:34:47 info: log_data_element: readCibXmlFile: [on-disk]   </configuration>
cib[14008]: 2012/08/21_17:34:47 info: log_data_element: readCibXmlFile: [on-disk]   <status/>
cib[14008]: 2012/08/21_17:34:47 info: log_data_element: readCibXmlFile: [on-disk] </cib>
cib[14008]: 2012/08/21_17:34:47 notice: readCibXmlFile: Enabling DTD validation on the existing (sane) configuration
cib[14008]: 2012/08/21_17:34:47 info: startCib: CIB Initialization completed successfully
cib[14008]: 2012/08/21_17:34:47 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:47 WARN: init_start: CCM Connection failed 1 times (30 max)
ccm[14007]: 2012/08/21_17:34:47 info: G_main_add_SignalHandler: Added signal handler for signal 15
ccm[14007]: 2012/08/21_17:34:47 ERROR: socket_wait_conn_new: unlink failure(/usr/local/var/run/heartbeat/ccm/ccm): Permission denied
ccm[14007]: 2012/08/21_17:34:47 ERROR: socket_wait_conn_new: trying to create in /usr/local/var/run/heartbeat/ccm/ccm bind:: Permission denied
ccm[14007]: 2012/08/21_17:34:47 ERROR: Can't create wait channel: Permission denied
heartbeat[13722]: 2012/08/21_17:34:47 WARN: Exiting /usr/local/lib/heartbeat/ccm process 14007 returned rc 1.
heartbeat[13722]: 2012/08/21_17:34:47 ERROR: Respawning client "/usr/local/lib/heartbeat/ccm":
heartbeat[13722]: 2012/08/21_17:34:47 info: Starting child client "/usr/local/lib/heartbeat/ccm" (503,501)
mgmtd[13745]: 2012/08/21_17:34:48 info: login to cib failed
mgmtd[13745]: 2012/08/21_17:34:48 ERROR: Can't initialize management library.Shutting down.(-1)
heartbeat[13722]: 2012/08/21_17:34:48 WARN: Exiting /usr/local/lib/heartbeat/mgmtd -v process 13745 returned rc 1.
heartbeat[13722]: 2012/08/21_17:34:48 ERROR: Respawning client "/usr/local/lib/heartbeat/mgmtd -v":
heartbeat[13722]: 2012/08/21_17:34:48 info: Starting child client "/usr/local/lib/heartbeat/mgmtd -v" (0,0)
heartbeat[14063]: 2012/08/21_17:34:48 info: Starting "/usr/local/lib/heartbeat/mgmtd -v" as uid 0  gid 0 (pid 14063)
mgmtd[14063]: 2012/08/21_17:34:48 info: G_main_add_SignalHandler: Added signal handler for signal 15
mgmtd[14063]: 2012/08/21_17:34:48 debug: Enabling coredumps
mgmtd[14063]: 2012/08/21_17:34:48 info: G_main_add_SignalHandler: Added signal handler for signal 10
mgmtd[14063]: 2012/08/21_17:34:48 info: G_main_add_SignalHandler: Added signal handler for signal 12
mgmtd[14063]: 2012/08/21_17:34:48 info: init_crm
cib[14008]: 2012/08/21_17:34:48 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:48 WARN: init_start: CCM Connection failed 2 times (30 max)
heartbeat[14054]: 2012/08/21_17:34:48 info: Starting "/usr/local/lib/heartbeat/ccm" as uid 503  gid 501 (pid 14054)
ccm[14054]: 2012/08/21_17:34:48 info: Hostname: oradb2-1
cib[14008]: 2012/08/21_17:34:49 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:49 WARN: init_start: CCM Connection failed 3 times (30 max)
cib[14008]: 2012/08/21_17:34:50 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:50 WARN: init_start: CCM Connection failed 4 times (30 max)
cib[14008]: 2012/08/21_17:34:51 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:51 WARN: init_start: CCM Connection failed 5 times (30 max)
cib[14008]: 2012/08/21_17:34:52 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:52 WARN: init_start: CCM Connection failed 6 times (30 max)
cib[14008]: 2012/08/21_17:34:53 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:53 WARN: init_start: CCM Connection failed 7 times (30 max)
ccm[14054]: 2012/08/21_17:34:53 info: G_main_add_SignalHandler: Added signal handler for signal 15
ccm[14054]: 2012/08/21_17:34:53 ERROR: socket_wait_conn_new: unlink failure(/usr/local/var/run/heartbeat/ccm/ccm): Permission denied
ccm[14054]: 2012/08/21_17:34:53 ERROR: socket_wait_conn_new: trying to create in /usr/local/var/run/heartbeat/ccm/ccm bind:: Permission denied
ccm[14054]: 2012/08/21_17:34:53 ERROR: Can't create wait channel: Permission denied
heartbeat[13722]: 2012/08/21_17:34:53 WARN: Exiting /usr/local/lib/heartbeat/ccm process 14054 returned rc 1.
heartbeat[13722]: 2012/08/21_17:34:53 ERROR: Respawning client "/usr/local/lib/heartbeat/ccm":
heartbeat[13722]: 2012/08/21_17:34:53 info: Starting child client "/usr/local/lib/heartbeat/ccm" (503,501)
cib[14008]: 2012/08/21_17:34:54 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:54 WARN: init_start: CCM Connection failed 8 times (30 max)
heartbeat[14084]: 2012/08/21_17:34:54 info: Starting "/usr/local/lib/heartbeat/ccm" as uid 503  gid 501 (pid 14084)
ccm[14084]: 2012/08/21_17:34:54 info: Hostname: oradb2-1
cib[14008]: 2012/08/21_17:34:55 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:55 WARN: init_start: CCM Connection failed 9 times (30 max)
cib[14008]: 2012/08/21_17:34:56 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:56 WARN: init_start: CCM Connection failed 10 times (30 max)
cib[14008]: 2012/08/21_17:34:57 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:57 WARN: init_start: CCM Connection failed 11 times (30 max)
ccm[14084]: 2012/08/21_17:34:58 info: G_main_add_SignalHandler: Added signal handler for signal 15
ccm[14084]: 2012/08/21_17:34:58 ERROR: socket_wait_conn_new: unlink failure(/usr/local/var/run/heartbeat/ccm/ccm): Permission denied
ccm[14084]: 2012/08/21_17:34:58 ERROR: socket_wait_conn_new: trying to create in /usr/local/var/run/heartbeat/ccm/ccm bind:: Permission denied
ccm[14084]: 2012/08/21_17:34:58 ERROR: Can't create wait channel: Permission denied
heartbeat[13722]: 2012/08/21_17:34:58 WARN: Exiting /usr/local/lib/heartbeat/ccm process 14084 returned rc 1.
heartbeat[13722]: 2012/08/21_17:34:58 ERROR: Client /usr/local/lib/heartbeat/ccm "respawning too fast"
cib[14008]: 2012/08/21_17:34:58 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:58 WARN: init_start: CCM Connection failed 12 times (30 max)
cib[14008]: 2012/08/21_17:34:59 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:34:59 WARN: init_start: CCM Connection failed 13 times (30 max)
cib[14008]: 2012/08/21_17:35:00 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:00 WARN: init_start: CCM Connection failed 14 times (30 max)
cib[14008]: 2012/08/21_17:35:01 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:01 WARN: init_start: CCM Connection failed 15 times (30 max)
cib[14008]: 2012/08/21_17:35:02 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:02 WARN: init_start: CCM Connection failed 16 times (30 max)
cib[14008]: 2012/08/21_17:35:03 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:03 WARN: init_start: CCM Connection failed 17 times (30 max)
cib[14008]: 2012/08/21_17:35:04 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:04 WARN: init_start: CCM Connection failed 18 times (30 max)
cib[14008]: 2012/08/21_17:35:05 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:05 WARN: init_start: CCM Connection failed 19 times (30 max)
cib[14008]: 2012/08/21_17:35:06 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:06 WARN: init_start: CCM Connection failed 20 times (30 max)
cib[14008]: 2012/08/21_17:35:07 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:07 WARN: init_start: CCM Connection failed 21 times (30 max)
cib[14008]: 2012/08/21_17:35:08 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:08 WARN: init_start: CCM Connection failed 22 times (30 max)
cib[14008]: 2012/08/21_17:35:09 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:09 WARN: init_start: CCM Connection failed 23 times (30 max)
cib[14008]: 2012/08/21_17:35:10 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:10 WARN: init_start: CCM Connection failed 24 times (30 max)
cib[14008]: 2012/08/21_17:35:11 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:11 WARN: init_start: CCM Connection failed 25 times (30 max)
cib[14008]: 2012/08/21_17:35:12 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:12 WARN: init_start: CCM Connection failed 26 times (30 max)
cib[14008]: 2012/08/21_17:35:13 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:13 WARN: init_start: CCM Connection failed 27 times (30 max)
cib[14008]: 2012/08/21_17:35:14 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:14 WARN: init_start: CCM Connection failed 28 times (30 max)
cib[14008]: 2012/08/21_17:35:15 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:15 WARN: init_start: CCM Connection failed 29 times (30 max)
cib[14008]: 2012/08/21_17:35:16 WARN: init_start: CCM Activation failed
cib[14008]: 2012/08/21_17:35:16 ERROR: init_start: CCM Activation failed 30 (max) times
cib[14008]: 2012/08/21_17:35:16 ERROR: init_start: Couldnt start all communication channels, exiting.
mgmtd[14063]: 2012/08/21_17:35:16 ERROR: cib_native_signon: No reply message - disconnected - 0
crmd[13744]: 2012/08/21_17:35:16 ERROR: cib_native_signon: No reply message - disconnected - 0
attrd[13743]: 2012/08/21_17:35:16 ERROR: cib_native_signon: No reply message - disconnected - 0
heartbeat[13722]: 2012/08/21_17:35:16 info: Exiting /usr/local/lib/heartbeat/cib process 14008 returned rc 0.
crmd[13744]: 2012/08/21_17:35:16 WARN: cib_native_signon: Connection to CIB failed: not connected
mgmtd[14063]: 2012/08/21_17:35:16 info: login to cib: 0, ret:-3
heartbeat[13722]: 2012/08/21_17:35:16 ERROR: Respawning client "/usr/local/lib/heartbeat/cib":
attrd[13743]: 2012/08/21_17:35:16 WARN: cib_native_signon: Connection to CIB failed: not connected
crmd[13744]: 2012/08/21_17:35:16 WARN: do_cib_control: Couldn't complete CIB registration 3 times... pause and retry
heartbeat[13722]: 2012/08/21_17:35:16 info: Starting child client "/usr/local/lib/heartbeat/cib" (503,501)
crmd[13744]: 2012/08/21_17:35:16 WARN: crm_fsa_trigger: FSA took 29660ms to complete
heartbeat[14291]: 2012/08/21_17:35:16 info: Starting "/usr/local/lib/heartbeat/cib" as uid 503  gid 501 (pid 14291)
cib[14291]: 2012/08/21_17:35:16 info: G_main_add_SignalHandler: Added signal handler for signal 15
cib[14291]: 2012/08/21_17:35:16 info: G_main_add_TriggerHandler: Added signal manual handler
作者: applegump    时间: 2012-08-21 17:42
再续上:

cib[14291]: 2012/08/21_17:35:16 info: G_main_add_SignalHandler: Added signal handler for signal 17
cib[14291]: 2012/08/21_17:35:16 info: main: Retrieval of a per-action CIB: disabled
cib[14291]: 2012/08/21_17:35:16 info: cib_register_ha: Signing in with Heartbeat
cib[14291]: 2012/08/21_17:35:16 info: cib_register_ha: FSA Hostname: oradb2-1
cib[14291]: 2012/08/21_17:35:16 info: readCibXmlFile: Reading cluster configuration from: /usr/local/var/lib/heartbeat/crm/cib.xml
cib[14291]: 2012/08/21_17:35:16 WARN: validate_cib_digest: No on-disk digest present
cib[14291]: 2012/08/21_17:35:16 info: log_data_element: readCibXmlFile: [on-disk] <cib admin_epoch="0" epoch="0" num_updates="0" generated="false" have_quorum="false">
cib[14291]: 2012/08/21_17:35:16 info: log_data_element: readCibXmlFile: [on-disk]   <configuration>
cib[14291]: 2012/08/21_17:35:16 info: log_data_element: readCibXmlFile: [on-disk]     <crm_config>
cib[14291]: 2012/08/21_17:35:16 info: log_data_element: readCibXmlFile: [on-disk]       <cluster_property_set id="cib-bootstrap-options">
cib[14291]: 2012/08/21_17:35:16 info: log_data_element: readCibXmlFile: [on-disk]         <attributes>
cib[14291]: 2012/08/21_17:35:16 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-symmetric-cluster" name="symmetric-cluster" value="true"/>
cib[14291]: 2012/08/21_17:35:16 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-no_quorum-policy" name="no_quorum-policy" value="stop"/>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-resource-stickiness" name="default-resource-stickiness" value="0"/>
crmd[13744]: 2012/08/21_17:35:17 info: crm_timer_popped: Wait Timer (I_NULL) just popped!
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-resource-failure-stickiness" name="default-resource-failure-stickiness" value="0"/>
crmd[13744]: 2012/08/21_17:35:17 WARN: cib_native_signon: Connection to CIB failed: connection failed
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stonith-enabled" name="stonith-enabled" value="false"/>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stonith-action" name="stonith-action" value="reboot"/>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stop-orphan-resources" name="stop-orphan-resources" value="true"/>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stop-orphan-actions" name="stop-orphan-actions" value="true"/>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-remove-after-stop" name="remove-after-stop" value="false"/>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-short-resource-names" name="short-resource-names" value="true"/>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-transition-idle-timeout" name="transition-idle-timeout" value="5min"/>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-action-timeout" name="default-action-timeout" value="5s"/>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-is-managed-default" name="is-managed-default" value="true"/>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]         </attributes>
mgmtd[14063]: 2012/08/21_17:35:17 info: login to cib: 1, ret:-10
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]       </cluster_property_set>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]     </crm_config>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]     <nodes/>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]     <resources>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]       <group id="group_1">
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]         <primitive class="ocf" id="IPaddr_192_1_101_212" provider="heartbeat" type="IPaddr">
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <operations>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]             <op id="IPaddr_192_1_101_212_mon" interval="5s" name="monitor" timeout="5s"/>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           </operations>
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]           <instance_attributes id="IPaddr_192_1_101_212_inst_attr">
cib[14291]: 2012/08/21_17:35:17 info: log_data_element: readCibXmlFile: [on-disk]             <attributes>
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]               <nvpair id="IPaddr_192_1_101_212_attr_0" name="ip" value="192.1.101.212"/>
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]             </attributes>
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]           </instance_attributes>
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]         </primitive>
crmd[13744]: 2012/08/21_17:35:18 WARN: cib_native_signon: Connection to CIB failed: connection failed
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]         <primitive class="lsb" id="myhttpd_2" provider="heartbeat" type="myhttpd">
crmd[13744]: 2012/08/21_17:35:18 WARN: do_cib_control: Couldn't complete CIB registration 4 times... pause and retry
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]           <operations>
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]             <op id="myhttpd_2_mon" interval="120s" name="monitor" timeout="60s"/>
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]           </operations>
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]         </primitive>
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]       </group>
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]     </resources>
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]     <constraints>
mgmtd[14063]: 2012/08/21_17:35:18 info: login to cib: 2, ret:-10
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]       <rsc_location id="rsc_location_group_1" rsc="group_1">
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]         <rule id="prefered_location_group_1" score="100">
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]           <expression attribute="#uname" id="prefered_location_group_1_expr" operation="eq" value="kf28-1"/>
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]         </rule>
crmd[13744]: 2012/08/21_17:35:18 info: crm_timer_popped: Wait Timer (I_NULL) just popped!
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]       </rsc_location>
crmd[13744]: 2012/08/21_17:35:18 WARN: cib_native_signon: Connection to CIB failed: connection failed
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]     </constraints>
cib[14291]: 2012/08/21_17:35:18 info: log_data_element: readCibXmlFile: [on-disk]   </configuration>
cib[14291]: 2012/08/21_17:35:19 info: log_data_element: readCibXmlFile: [on-disk]   <status/>
cib[14291]: 2012/08/21_17:35:19 info: log_data_element: readCibXmlFile: [on-disk] </cib>
cib[14291]: 2012/08/21_17:35:19 notice: readCibXmlFile: Enabling DTD validation on the existing (sane) configuration
cib[14291]: 2012/08/21_17:35:19 info: startCib: CIB Initialization completed successfully
cib[14291]: 2012/08/21_17:35:19 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:19 WARN: init_start: CCM Connection failed 1 times (30 max)
cib[14291]: 2012/08/21_17:35:20 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:20 WARN: init_start: CCM Connection failed 2 times (30 max)
cib[14291]: 2012/08/21_17:35:21 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:21 WARN: init_start: CCM Connection failed 3 times (30 max)
cib[14291]: 2012/08/21_17:35:22 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:22 WARN: init_start: CCM Connection failed 4 times (30 max)
cib[14291]: 2012/08/21_17:35:23 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:23 WARN: init_start: CCM Connection failed 5 times (30 max)
cib[14291]: 2012/08/21_17:35:24 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:24 WARN: init_start: CCM Connection failed 6 times (30 max)
cib[14291]: 2012/08/21_17:35:25 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:25 WARN: init_start: CCM Connection failed 7 times (30 max)
cib[14291]: 2012/08/21_17:35:26 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:26 WARN: init_start: CCM Connection failed 8 times (30 max)
cib[14291]: 2012/08/21_17:35:27 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:27 WARN: init_start: CCM Connection failed 9 times (30 max)
cib[14291]: 2012/08/21_17:35:28 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:28 WARN: init_start: CCM Connection failed 10 times (30 max)
cib[14291]: 2012/08/21_17:35:29 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:29 WARN: init_start: CCM Connection failed 11 times (30 max)
cib[14291]: 2012/08/21_17:35:30 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:30 WARN: init_start: CCM Connection failed 12 times (30 max)
cib[14291]: 2012/08/21_17:35:31 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:31 WARN: init_start: CCM Connection failed 13 times (30 max)
cib[14291]: 2012/08/21_17:35:32 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:32 WARN: init_start: CCM Connection failed 14 times (30 max)
cib[14291]: 2012/08/21_17:35:33 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:33 WARN: init_start: CCM Connection failed 15 times (30 max)
cib[14291]: 2012/08/21_17:35:34 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:34 WARN: init_start: CCM Connection failed 16 times (30 max)
cib[14291]: 2012/08/21_17:35:35 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:35 WARN: init_start: CCM Connection failed 17 times (30 max)
cib[14291]: 2012/08/21_17:35:36 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:36 WARN: init_start: CCM Connection failed 18 times (30 max)
cib[14291]: 2012/08/21_17:35:37 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:37 WARN: init_start: CCM Connection failed 19 times (30 max)
cib[14291]: 2012/08/21_17:35:38 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:38 WARN: init_start: CCM Connection failed 20 times (30 max)
cib[14291]: 2012/08/21_17:35:39 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:39 WARN: init_start: CCM Connection failed 21 times (30 max)
cib[14291]: 2012/08/21_17:35:40 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:40 WARN: init_start: CCM Connection failed 22 times (30 max)
cib[14291]: 2012/08/21_17:35:41 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:41 WARN: init_start: CCM Connection failed 23 times (30 max)
cib[14291]: 2012/08/21_17:35:42 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:42 WARN: init_start: CCM Connection failed 24 times (30 max)
cib[14291]: 2012/08/21_17:35:43 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:43 WARN: init_start: CCM Connection failed 25 times (30 max)
cib[14291]: 2012/08/21_17:35:44 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:44 WARN: init_start: CCM Connection failed 26 times (30 max)
cib[14291]: 2012/08/21_17:35:45 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:45 WARN: init_start: CCM Connection failed 27 times (30 max)
cib[14291]: 2012/08/21_17:35:46 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:46 WARN: init_start: CCM Connection failed 28 times (30 max)
cib[14291]: 2012/08/21_17:35:47 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:47 WARN: init_start: CCM Connection failed 29 times (30 max)
cib[14291]: 2012/08/21_17:35:48 WARN: init_start: CCM Activation failed
cib[14291]: 2012/08/21_17:35:48 ERROR: init_start: CCM Activation failed 30 (max) times
cib[14291]: 2012/08/21_17:35:48 ERROR: init_start: Couldnt start all communication channels, exiting.
crmd[13744]: 2012/08/21_17:35:48 ERROR: cib_native_signon: No reply message - disconnected - 0
attrd[13743]: 2012/08/21_17:35:48 ERROR: cib_native_signon: No reply message - disconnected - 0
heartbeat[13722]: 2012/08/21_17:35:48 info: Exiting /usr/local/lib/heartbeat/cib process 14291 returned rc 0.
crmd[13744]: 2012/08/21_17:35:48 WARN: cib_native_signon: Connection to CIB failed: not connected
attrd[13743]: 2012/08/21_17:35:48 WARN: cib_native_signon: Connection to CIB failed: not connected
heartbeat[13722]: 2012/08/21_17:35:48 ERROR: Respawning client "/usr/local/lib/heartbeat/cib":
crmd[13744]: 2012/08/21_17:35:48 WARN: do_cib_control: Couldn't complete CIB registration 5 times... pause and retry
heartbeat[13722]: 2012/08/21_17:35:48 info: Starting child client "/usr/local/lib/heartbeat/cib" (503,501)
crmd[13744]: 2012/08/21_17:35:48 WARN: crm_fsa_trigger: FSA took 29370ms to complete
mgmtd[14063]: 2012/08/21_17:35:48 ERROR: cib_native_signon: No reply message - disconnected - 0
mgmtd[14063]: 2012/08/21_17:35:48 info: login to cib: 3, ret:-3
heartbeat[14549]: 2012/08/21_17:35:48 info: Starting "/usr/local/lib/heartbeat/cib" as uid 503  gid 501 (pid 14549)
cib[14549]: 2012/08/21_17:35:48 info: G_main_add_SignalHandler: Added signal handler for signal 15
cib[14549]: 2012/08/21_17:35:48 info: G_main_add_TriggerHandler: Added signal manual handler
cib[14549]: 2012/08/21_17:35:48 info: G_main_add_SignalHandler: Added signal handler for signal 17
cib[14549]: 2012/08/21_17:35:48 info: main: Retrieval of a per-action CIB: disabled
cib[14549]: 2012/08/21_17:35:48 info: cib_register_ha: Signing in with Heartbeat
cib[14549]: 2012/08/21_17:35:48 info: cib_register_ha: FSA Hostname: oradb2-1
cib[14549]: 2012/08/21_17:35:48 info: readCibXmlFile: Reading cluster configuration from: /usr/local/var/lib/heartbeat/crm/cib.xml
cib[14549]: 2012/08/21_17:35:48 WARN: validate_cib_digest: No on-disk digest present
cib[14549]: 2012/08/21_17:35:48 info: log_data_element: readCibXmlFile: [on-disk] <cib admin_epoch="0" epoch="0" num_updates="0" generated="false" have_quorum="false">
cib[14549]: 2012/08/21_17:35:48 info: log_data_element: readCibXmlFile: [on-disk]   <configuration>
作者: applegump    时间: 2012-08-21 17:43
再续上:


cib[14549]: 2012/08/21_17:35:48 info: log_data_element: readCibXmlFile: [on-disk]     <crm_config>
cib[14549]: 2012/08/21_17:35:48 info: log_data_element: readCibXmlFile: [on-disk]       <cluster_property_set id="cib-bootstrap-options">
cib[14549]: 2012/08/21_17:35:48 info: log_data_element: readCibXmlFile: [on-disk]         <attributes>
cib[14549]: 2012/08/21_17:35:48 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-symmetric-cluster" name="symmetric-cluster" value="true"/>
cib[14549]: 2012/08/21_17:35:48 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-no_quorum-policy" name="no_quorum-policy" value="stop"/>
crmd[13744]: 2012/08/21_17:35:48 info: crm_timer_popped: Wait Timer (I_NULL) just popped!
cib[14549]: 2012/08/21_17:35:48 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-resource-stickiness" name="default-resource-stickiness" value="0"/>
crmd[13744]: 2012/08/21_17:35:48 WARN: cib_native_signon: Connection to CIB failed: connection failed
cib[14549]: 2012/08/21_17:35:48 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-resource-failure-stickiness" name="default-resource-failure-stickiness" value="0"/>
cib[14549]: 2012/08/21_17:35:48 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stonith-enabled" name="stonith-enabled" value="false"/>
cib[14549]: 2012/08/21_17:35:48 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stonith-action" name="stonith-action" value="reboot"/>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stop-orphan-resources" name="stop-orphan-resources" value="true"/>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-stop-orphan-actions" name="stop-orphan-actions" value="true"/>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-remove-after-stop" name="remove-after-stop" value="false"/>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-short-resource-names" name="short-resource-names" value="true"/>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-transition-idle-timeout" name="transition-idle-timeout" value="5min"/>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-default-action-timeout" name="default-action-timeout" value="5s"/>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]           <nvpair id="cib-bootstrap-options-is-managed-default" name="is-managed-default" value="true"/>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]         </attributes>
mgmtd[14063]: 2012/08/21_17:35:49 info: login to cib: 4, ret:-10
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]       </cluster_property_set>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]     </crm_config>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]     <nodes/>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]     <resources>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]       <group id="group_1">
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]         <primitive class="ocf" id="IPaddr_192_1_101_212" provider="heartbeat" type="IPaddr">
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]           <operations>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]             <op id="IPaddr_192_1_101_212_mon" interval="5s" name="monitor" timeout="5s"/>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]           </operations>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]           <instance_attributes id="IPaddr_192_1_101_212_inst_attr">
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]             <attributes>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]               <nvpair id="IPaddr_192_1_101_212_attr_0" name="ip" value="192.1.101.212"/>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]             </attributes>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]           </instance_attributes>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]         </primitive>
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]         <primitive class="lsb" id="myhttpd_2" provider="heartbeat" type="myhttpd">
crmd[13744]: 2012/08/21_17:35:49 WARN: cib_native_signon: Connection to CIB failed: connection failed
cib[14549]: 2012/08/21_17:35:49 info: log_data_element: readCibXmlFile: [on-disk]           <operations>
crmd[13744]: 2012/08/21_17:35:50 WARN: do_cib_control: Couldn't complete CIB registration 6 times... pause and retry
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]             <op id="myhttpd_2_mon" interval="120s" name="monitor" timeout="60s"/>
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]           </operations>
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]         </primitive>
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]       </group>
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]     </resources>
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]     <constraints>
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]       <rsc_location id="rsc_location_group_1" rsc="group_1">
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]         <rule id="prefered_location_group_1" score="100">
mgmtd[14063]: 2012/08/21_17:35:50 info: login to cib failed
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]           <expression attribute="#uname" id="prefered_location_group_1_expr" operation="eq" value="kf28-1"/>
mgmtd[14063]: 2012/08/21_17:35:50 ERROR: Can't initialize management library.Shutting down.(-1)
heartbeat[13722]: 2012/08/21_17:35:50 WARN: Exiting /usr/local/lib/heartbeat/mgmtd -v process 14063 returned rc 1.
heartbeat[13722]: 2012/08/21_17:35:50 ERROR: Respawning client "/usr/local/lib/heartbeat/mgmtd -v":
heartbeat[13722]: 2012/08/21_17:35:50 info: Starting child client "/usr/local/lib/heartbeat/mgmtd -v" (0,0)
heartbeat[14561]: 2012/08/21_17:35:50 info: Starting "/usr/local/lib/heartbeat/mgmtd -v" as uid 0  gid 0 (pid 14561)
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]         </rule>
mgmtd[14561]: 2012/08/21_17:35:50 info: G_main_add_SignalHandler: Added signal handler for signal 15
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]       </rsc_location>
mgmtd[14561]: 2012/08/21_17:35:50 debug: Enabling coredumps
crmd[13744]: 2012/08/21_17:35:50 info: crm_timer_popped: Wait Timer (I_NULL) just popped!
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]     </constraints>
mgmtd[14561]: 2012/08/21_17:35:50 info: G_main_add_SignalHandler: Added signal handler for signal 10
crmd[13744]: 2012/08/21_17:35:50 WARN: cib_native_signon: Connection to CIB failed: connection failed
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]   </configuration>
mgmtd[14561]: 2012/08/21_17:35:50 info: G_main_add_SignalHandler: Added signal handler for signal 12
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk]   <status/>
mgmtd[14561]: 2012/08/21_17:35:50 info: init_crm
cib[14549]: 2012/08/21_17:35:50 info: log_data_element: readCibXmlFile: [on-disk] </cib>
mgmtd[14561]: 2012/08/21_17:35:50 info: login to cib: 0, ret:-10
cib[14549]: 2012/08/21_17:35:50 notice: readCibXmlFile: Enabling DTD validation on the existing (sane) configuration
cib[14549]: 2012/08/21_17:35:50 info: startCib: CIB Initialization completed successfully
cib[14549]: 2012/08/21_17:35:50 WARN: init_start: CCM Activation failed
cib[14549]: 2012/08/21_17:35:51 WARN: init_start: CCM Connection failed 1 times (30 max)
cib[14549]: 2012/08/21_17:35:52 WARN: init_start: CCM Activation failed
cib[14549]: 2012/08/21_17:35:52 WARN: init_start: CCM Connection failed 2 times (30 max)

作者: applegump    时间: 2012-08-21 18:34
工程的背景,配置,出错信息已经全部附上了,请各位不吝赐教,谢谢大家
作者: sacry    时间: 2012-08-21 19:41
本帖最后由 sacry 于 2012-08-21 19:41 编辑

瞅了两眼没看出root case。
不过有几点

1,使用的是什么版本的...
2,oradb2-1上crm_mon出现[Not connected:Refresh in 3s...]的话,是oradb2-1上crm还没启动起来吧。
3,版本不明,所以这里也不太确定。不过ha.cf里配置了crm yes,那haresources文件应该没用了(也许还有效,但是不推荐再在那里配置)。
4,http有community版脚本的,不需要自己写。
4a,ps | grep会把grep这个进程也算进去,所以会比实际的多。
4b,rh的话可以引入/etc/rc.d/init.d/functions, 里面有status函数。 不是rh也可以参照下其他脚本怎么写的,你现在写的这个用在ha里有点纠结...
5,如果有你用的版本有crm命令的话,可以用crm configure show贴出配置。xml的配置看起来还是有点麻烦的。
5a,cib配置似乎没有什么问题。

以上所说不太能解决你描述的问题的样子,唯一像一点的原因还是2,oradb2-1上crm根本没有启动起来
看log里有:
ccm[13739]: 2012/08/21_17:34:14 ERROR: socket_wait_conn_new: unlink failure(/usr/local/var/run/heartbeat/ccm/ccm): Permission denied

heartbeat[13821]: 2012/08/21_17:34:21 info: Starting "/usr/local/lib/heartbeat/ccm" as uid 503  gid 501

503 501是你的ha用户组?有/usr/local/var/run/heartbeat/ccm/的访问权限吗?
作者: applegump    时间: 2012-08-21 19:58
drwxr-x--- 2        17       65 4096 08-16 19:25 ccm
drwxr-xr-t 2 root      root     4096 08-21 17:34 rsctmp
srwxrwxrwx 1 root      root        0 08-21 17:34 stonithd_callback
srwxrwxrwx 1 root      root        0 08-21 17:34 stonithd
srwxrwxrwx 1 root      root        0 08-21 17:34 register
srwxrwxrwx 1 root      root        0 08-21 17:34 lrm_cmd_sock
srwxrwxrwx 1 root      root        0 08-21 17:34 lrm_callback_sock
drwxr-x--- 2 hacluster haclient 4096 08-21 19:40 crm
作者: applegump    时间: 2012-08-21 20:00
是这个有问题吧,请教怎么解决呢?

直接chown -R hacluster ccm

然后 chgrp -R haclient  ccm

这样吗?

我想应该是我在安装的时候没有注意到什么吧。kf28-1上这个是好的,请教在安装的时候如何规避这个问题呢?

谢谢楼上的
作者: beyondfly    时间: 2012-08-21 21:45
我最近也碰到类似的问题,顶起
作者: applegump    时间: 2012-08-22 09:05
我设置了权限,现在两个节点的crm应该是都起来了。现在两台机器的crm_mon运行情况如下


============
Last updated: Wed Aug 22 08:50:00 2012
Current DC: oradb2-1 (34bd366d-3b86-4cb1-9bb1-b901f0e4e08b)
2 Nodes configured.
1 Resources configured.
============

Node: oradb2-1 (34bd366d-3b86-4cb1-9bb1-b901f0e4e08b): online
Node: kf28-1 (b275747d-b787-43c1-b05e-15a84603ebbf): online

Resource Group: group_1
    IPaddr_192_1_101_212        (heartbeat:cf:IPaddr):        Started kf28-1
    myhttpd_2   (lsb:myhttpd):  Started kf28-1




============
Last updated: Wed Aug 22 08:42:18 2012
Current DC: oradb2-1 (34bd366d-3b86-4cb1-9bb1-b901f0e4e08b)
2 Nodes configured.
1 Resources configured.
============

Node: oradb2-1 (34bd366d-3b86-4cb1-9bb1-b901f0e4e08b): online
Node: kf28-1 (b275747d-b787-43c1-b05e-15a84603ebbf): online

Resource Group: group_1
    IPaddr_192_1_101_212        (heartbeat:cf:IPaddr):        Started kf28-1
    myhttpd_2   (lsb:myhttpd):  Started kf28-1


主节点的heartbeat能正常拉起httpd,能分配到 192.1.101.212这个IP,但是备节点oradb2-1还是不能拉起httpd,主节点的ha-debug日志看不出异常,备节点oradb2-1


的日志里面有少量错误如下:

pengine[2945]: 2012/08/22_08:47:56 notice: cluster_option: Using default value 'stop' for cluster option 'no-quorum-policy'
pengine[2945]: 2012/08/22_08:47:56 notice: cluster_option: Using default value '60s' for cluster option 'cluster-delay'
pengine[2945]: 2012/08/22_08:47:56 notice: cluster_option: Using default value '-1' for cluster option 'pe-error-series-max'
pengine[2945]: 2012/08/22_08:47:56 notice: cluster_option: Using default value '-1' for cluster option 'pe-warn-series-max'
pengine[2945]: 2012/08/22_08:47:56 notice: cluster_option: Using default value '-1' for cluster option 'pe-input-series-max'
pengine[2945]: 2012/08/22_08:47:56 notice: cluster_option: Using default value 'true' for cluster option 'startup-fencing'
pengine[2945]: 2012/08/22_08:47:56 info: determine_online_status: Node oradb2-1 is online
pengine[2945]: 2012/08/22_08:47:56 info: determine_online_status: Node kf28-1 is online
pengine[2945]: 2012/08/22_08:47:56 ERROR: native_add_running: Resource lsb::myhttpd:myhttpd_2 appears to be active on 2 nodes.
pengine[2945]: 2012/08/22_08:47:56 ERROR: See http://linux-ha.org/v2/faq/resource_too_active for more information.
pengine[2945]: 2012/08/22_08:47:56 info: group_print: Resource Group: group_1
pengine[2945]: 2012/08/22_08:47:56 info: native_print:     IPaddr_192_1_101_212 (heartbeat:cf:IPaddr):        Stopped
pengine[2945]: 2012/08/22_08:47:56 info: native_print:     myhttpd_2    (lsb:myhttpd)
pengine[2945]: 2012/08/22_08:47:56 info: native_print:  0 : oradb2-1
pengine[2945]: 2012/08/22_08:47:56 info: native_print:  1 : kf28-1
pengine[2945]: 2012/08/22_08:47:56 info: native_color: Combine scores from myhttpd_2 and IPaddr_192_1_101_212
pengine[2945]: 2012/08/22_08:47:56 notice: StartRsc:  kf28-1    Start IPaddr_192_1_101_212
pengine[2945]: 2012/08/22_08:47:57 notice: Recurring: kf28-1       IPaddr_192_1_101_212_monitor_5000
pengine[2945]: 2012/08/22_08:47:57 ERROR: native_create_actions: Attempting recovery of resource myhttpd_2
pengine[2945]: 2012/08/22_08:47:57 notice: StopRsc:   oradb2-1  Stop myhttpd_2
pengine[2945]: 2012/08/22_08:47:57 notice: StopRsc:   kf28-1    Stop myhttpd_2
pengine[2945]: 2012/08/22_08:47:57 notice: StartRsc:  kf28-1    Start myhttpd_2
pengine[2945]: 2012/08/22_08:47:57 notice: Recurring: kf28-1       myhttpd_2_monitor_120000


我在网上查了一下“ERROR: native_add_running: Resource lsb::myhttpd:myhttpd_2 appears to be active on 2 nodes”这个错误,找到了下面这个页面

http://www.gossamer-threads.com/lists/linuxha/users/65560

这个页面描述的问题跟我的好像比较类似,可是我英文不是太好,背景知识也不够,看不懂是什么意思,请教大家能不能给我一点指教,谢谢大家






作者: applegump    时间: 2012-08-22 09:07
前述的朋友提到软件版本,我安装的heartbeat是heartbeat-2.0.8
作者: sacry    时间: 2012-08-22 09:29
http://www.clusterlabs.org/wiki/FAQ#Resource_is_Too_Active
作者: sacry    时间: 2012-08-22 09:31
主节点的heartbeat能正常拉起httpd,能分配到 192.1.101.212这个IP,但是备节点oradb2-1还是不能拉起httpd


primitive资源本来就不能同时启动,只能Failover。
如果有crm命令的话,贴一下crm configure show的结果,xml看起来麻烦。
不过多半不是资源配置的问题,你那lsb脚本....
作者: applegump    时间: 2012-08-22 09:35
Pacemaker will try and determine what resources are active on a machine when it starts. To do this, it sends what we call a probe which uses the monitor operation of your ResourceAgent.
There are two common reasons for seeing this message:
Your resource really is active on more than one node
Check you are _not_ starting it on boot
Did Pacemaker suffer an internal failure? If so, please check the Help:Contents page and report it
Your resource doesn't implement the monitor operation correctly
Make sure your Resource Agent conforms to the OCF-spec by using the ocf-tester script


我查到了这段FAQ,看这个意思,在正常情况下

httpd似乎本来就不应该在oradb2-1上被启动,只有kf28-1上的httpd服务停掉了,随着vip转移到oradb2-1上,oradb2-1上的httpd才被启动。

这样理解对吗?

那我应该怎么做配置呢?





作者: sacry    时间: 2012-08-22 09:43
本帖最后由 sacry 于 2012-08-22 09:52 编辑

如果你贴出来的脚本是全部的话,那铁定错了。

heartbeat要的lsb脚本是遵循linux standard base的脚本(主要是指返回值)。
http://linux-ha.org/wiki/LSB_Resource_Agents
http://refspecs.linuxbase.org/LS ... ic/iniscrptact.html

你写的脚本是给人看,不是给程序看的。
不管启动停止都是echo,返回值都是0,
两台机器,不管自己的http有没有启动,读到的都是0(program is running or service is OK),所以会悲剧。

PS;判断程序是否启动不是这么判断的..
一个简单通用的做法是
  1. if [ -f $pidfile ] ; then
  2.     pid=`cat $pidfile`
  3.     if [ "x"$pid != "x" ] && kill -0 $pid > /dev/null 2>&1 ; then
  4.         return 0
  5.     else
  6.         return 1
  7.     fi
  8. else
  9.     return 3
  10. fi
复制代码

作者: applegump    时间: 2012-08-22 09:43
找不到crm命令啊。我find了一下,这个版本没有crm命令,只有crm_mon

我再把两台机器上的cib.xml贴出来,请各位看看

<cib admin_epoch="0" epoch="0" num_updates="0">
        <configuration>
                <crm_config>
                        <cluster_property_set id="cib-bootstrap-options">
                                <attributes>
                                        <nvpair id="cib-bootstrap-options-symmetric-cluster" name="symmetric-cluster" value="true"/>
                                        <nvpair id="cib-bootstrap-options-no_quorum-policy" name="no_quorum-policy" value="stop"/>
                                        <nvpair id="cib-bootstrap-options-default-resource-stickiness" name="default-resource-stickiness" value="0"/>
                                        <nvpair id="cib-bootstrap-options-default-resource-failure-stickiness" name="default-resource-failure-stickiness" value="0"/>
                                        <nvpair id="cib-bootstrap-options-stonith-enabled" name="stonith-enabled" value="false"/>
                                        <nvpair id="cib-bootstrap-options-stonith-action" name="stonith-action" value="reboot"/>
                                        <nvpair id="cib-bootstrap-options-stop-orphan-resources" name="stop-orphan-resources" value="true"/>
                                        <nvpair id="cib-bootstrap-options-stop-orphan-actions" name="stop-orphan-actions" value="true"/>
                                        <nvpair id="cib-bootstrap-options-remove-after-stop" name="remove-after-stop" value="false"/>
                                        <nvpair id="cib-bootstrap-options-short-resource-names" name="short-resource-names" value="true"/>
                                        <nvpair id="cib-bootstrap-options-transition-idle-timeout" name="transition-idle-timeout" value="5min"/>
                                        <nvpair id="cib-bootstrap-options-default-action-timeout" name="default-action-timeout" value="5s"/>
                                        <nvpair id="cib-bootstrap-options-is-managed-default" name="is-managed-default" value="true"/>
                                </attributes>
                        </cluster_property_set>
                </crm_config>
                <nodes/>
                <resources>
                        <group id="group_1">
                                <primitive class="ocf" id="IPaddr_192_1_101_212" provider="heartbeat" type="IPaddr">
                                        <operations>
                                                <op id="IPaddr_192_1_101_212_mon" interval="5s" name="monitor" timeout="5s"/>
                                        </operations>
                                        <instance_attributes id="IPaddr_192_1_101_212_inst_attr">
                                                <attributes>
                                                        <nvpair id="IPaddr_192_1_101_212_attr_0" name="ip" value="192.1.101.212"/>
                                                </attributes>
                                        </instance_attributes>
                                </primitive>
                                <primitive class="lsb" id="myhttpd_2" provider="heartbeat" type="myhttpd">
                                        <operations>
                                                <op id="myhttpd_2_mon" interval="120s" name="monitor" timeout="60s"/>
                                        </operations>
                                </primitive>
                        </group>
                </resources>
                <constraints>
                        <rsc_location id="rsc_location_group_1" rsc="group_1">
                                <rule id="prefered_location_group_1" score="100">
                                        <expression attribute="#uname" id="prefered_location_group_1_expr" operation="eq" value="kf28-1"/>
                                </rule>
                        </rsc_location>
                </constraints>
        </configuration>
        <status/>
</cib>


作者: applegump    时间: 2012-08-22 09:47
22楼的兄弟,我先改改脚本,受教了,谢谢
作者: applegump    时间: 2012-08-22 10:52
脚本不会写,哪位能指点指点啊
作者: applegump    时间: 2012-08-22 12:59
sacry:


您好

我现在在配heartbeat的crm模式。目前的状况还是启动脚本不太会写,就是符合lsb规范的apache httpd脚本。

我使用的apache httpd是自己安装的,安装路径是/users/ems/apache

您是否能给我一个例子?我对shell 编程了解不多,麻烦您了
作者: sacry    时间: 2012-08-22 13:43
回复 26# applegump

apache httpd这种很流行的软件,community肯定会提供脚本的。
事实上推荐的是OCF标准的脚本而不是LSB标准的,你设置的IP资源就是用的OCF的脚本
<primitive class="ocf" id="IPaddr_192_1_101_212" provider="heartbeat" type="IPaddr">

可以试着用这个命令找一下ocf资源放哪个文件夹的,默认提供了哪些脚本。
    locate resource.d

想自己写的话,即使不熟悉shell,照着规范努力下也能敲出来。但是测试环境用用可以,生产环境就.....


=====

为什么还要使用2.0.8版本啊,应该没什么特别的好处。


作者: applegump    时间: 2012-08-22 13:58
您好!

你说我设的IP资源是用OCF脚本,这个我还真没仔细看,我这个礼拜才接触heartbeat,很多东西还不是很明白,见笑了。

查了一下 /usr/lib/ocf/resource.d/heartbeat下面确实有个名叫apache的脚本,但是使用这个脚本 ./apache start,启动的好像不是我自己安装的apache,而是系统默认自带的,我的apache安装在/users/ems/apache下。

而且,我这个apache是通过源码编译过来的,自定义了一些功能,它加入了ajp模块,能向后端的tomcat转发请求。我试着改了一下/usr/lib/ocf/resource.d/heartbeat/apache这个脚本,把我安装的apache httpd加了进去,

再./apache start ,报了错。

不知道您有什么建议

2.0.8是我随手在网上搜了一个安装教程,随手找到的一个安装版本,我也不知道这个版本怎样,现在heartbeat最新是什么版本,哪个版本比较稳定,我随手下了这个2.0.8,安装是正常的,在crm功能之前,它表现也比较正常

我就没理会版本的问题了,毕竟接触这东西也才三四天,很多还不明白,见笑了
作者: applegump    时间: 2012-08-22 14:18
刚才改了一下/usr/lib/ocf/resource.d/heartbeat/apache这个脚本  

把  HTTPDLIST="/sbin/httpd2 /usr/sbin/httpd2 /sbin/httpd /usr/sbin/httpd $IBMHTTPD" 注掉了

新加了  HTTPDLIST="/users/ems/apache/bin/httpd $IBMHTTPD"

把 DEFAULT_NORMCONFIG="/etc/apache2/httpd.conf" 注掉了

新加了DEFAULT_NORMCONFIG="/users/ems/apache/conf/httpd.conf"

现在使用./apache start 能启动我自己安装的apache httpd

但是使用./apache stop  不能停止

使用./apache status   报  2012/08/22_14:04:23 INFO: apache is stopped.   可是这时apache正在运行着

使用./apache status  报  2012/08/22_14:04:04 ERROR: Monitoring not supported by /users/ems/apache/conf/httpd.conf

不知道是怎么回事


作者: sacry    时间: 2012-08-22 15:00
本帖最后由 sacry 于 2012-08-22 15:01 编辑

回复 28# applegump

===================================
》我真正想说的

ocf脚本不是这么启动的,这又是另外一个挺长的话题。

不知道你是学习,还是急着就要部署。
如果时间充分的话重头来比较好,重装一个新点的版本什么的。
不是稳定不稳定的问题,而是2.0.x其实文档较少(整个heartbeat文档就不多),依赖MaillingList,别人的经验。
相比之下,3.x的crm其实就是pacemaker,资料全一点(虽然是英文的)。

===================================
》 可能对你有用的。

LSB是Linux Starnd Base,其实就是/etc/init.d下面的东西,一般可以用来被集群使用,
但是有一些不足,比如不能传参数,只能启动一个实例等。
OCF是open cluster framework,集群专用的标准,对使用来说最大的特点就是【参数】。

想命令行启动的话
export OCF_ROOT=/usr/lib/ocf
export OCF_RESKEY_httpd=/apache/bin/httpd
export OCF_RESKEY_configfile=/apache/conf/httpd.conf
注:我记得要配置PIDFILE的样子。
然后再使用脚本。

想配置到集群的话,没有crm命令只能用cibadmin命令改xml,参照pacemaker文档配吧,
简单点的只要照着ip那个来就可以了。

===================================
》 补充说明

-Hearbeat 1.0
    只支持2台机器...
    没有资源管理。(也就是不可以Apache在节点A上跑MySQL在节点B上跑,要么全A要么全B。)

-Heartbeat 2.0
   多了CRM(集群资源管理) 同时支持旧的haresources和新的cib.xml配置。

-Heartbeat 3.0
   CRM从Heartbeat项目分离成Pacemaker,成为主要要使用的东西,
   Heartbeat本身只是个通信层而已,man一下ha.cf各种deprecated。

heartbeat2.0的最后版本是2.0.14的样子,已经是很久以前的事了(估计四五年是有的了)。
目前heartbeat社区这么个东西已经废了,pacemaker的还挺活跃的,文档还在更新。

所以如果没什么特别要求,现在开始学的话学3(其实是学Pacemaker了)可能好点,资料相对较多。

===================================
》 其他

问道有先后,术业有专攻,人人都是“在路上”,没什么见笑。

我不懂,因为我不想懂不需要懂!
我不懂,但是我明天会懂!!
学,必有所得!
作者: applegump    时间: 2012-08-28 13:40
明白了,谢谢。
作者: hustlxf    时间: 2012-09-03 15:17
可以加我qq:693093804吗?我也遇到crm的一个问题了




欢迎光临 Chinaunix (http://bbs.chinaunix.net/) Powered by Discuz! X3.2