- 论坛徽章:
- 0
|
os :redhat linux 4.3
cluster:cluster 4.3
手工切换的时候,由于在b机上不能umount某个共享的文件系统,导致不能进行接管?
为什么啊?
没有使用gfs
cluster.conf文件
<?xml version="1.0"?>
<cluster config_version="10" name="alpha_cluster">
<fence_daemon post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="mainserver1" votes="1">
<fence/>
</clusternode>
<clusternode name="mainserver2" votes="1">
<fence/>
</clusternode>
</clusternodes>
<cman expected_votes="1" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_manual" name="test"/>
</fencedevices>
<rm>
<failoverdomains>
<failoverdomain name="oracle" ordered="1" restricted="1">
<failoverdomainnode name="mainserver1" priority="1"/>
<failoverdomainnode name="mainserver2" priority="2"/>
</failoverdomain>
</failoverdomains>
<resources>
<ip address="138.148.221.3" monitor_link="1"/>
<fs device="/dev/mapper/VolGroupArray-lv_oracle_data" force_fsck="0" force_unmount="1" fsid="15674" fstype="ext3" mountpoint="/data/oradata" name="oradata" options="" self_fence="1"/>
<fs device="/dev/mapper/VolGroupArray-lv_oracle_log" force_fsck="0" force_unmount="1" fsid="36402" fstype="ext3" mountpoint="/data/oralog" name="oralog" options="" self_fence="0"/>
<fs device="/dev/mapper/VolGroupArray-lv_images" force_fsck="0" force_unmount="1" fsid="48746" fstype="ext3" mountpoint="/data/images" name="images" options="" self_fence="0"/>
<script file="/etc/init.d/hongsy.sh" name="orace"/>
<script file="/etc/init.d/cluster_svr" name="cluster_svr"/>
</resources>
<service autostart="1" domain="oracle" name="oracle">
<ip ref="138.148.221.3"/>
<fs ref="oradata"/>
<fs ref="oralog"/>
<fs ref="images"/>
<script ref="orace"/>
</service>
</rm>
</cluster>
相关操作系统日志
Sep 28 03:24:46 mainserver1 ccsd[6494]: Starting ccsd 1.0.3:
Sep 28 03:24:46 mainserver1 ccsd[6494]: Built: Jan 25 2006 16:54:55
Sep 28 03:24:46 mainserver1 ccsd[6494]: Copyright (C) Red Hat, Inc. 2004 All rights reserved.
Sep 28 03:24:47 mainserver1 ccsd: startup succeeded
Sep 28 03:24:52 mainserver1 kernel: CMAN 2.6.9-43.8 (built Feb 26 2006 21:06:1 installed
Sep 28 03:24:52 mainserver1 kernel: NET: Registered protocol family 30
Sep 28 03:24:52 mainserver1 ccsd[6494]: cluster.conf (cluster name = alpha_cluster, version = found.
Sep 28 03:24:52 mainserver1 ccsd[6494]: Remote copy of cluster.conf is from quorate node.
Sep 28 03:24:52 mainserver1 ccsd[6494]: Local version # : 8
Sep 28 03:24:52 mainserver1 ccsd[6494]: Remote version #: 9
Sep 28 03:24:52 mainserver1 ccsd[6494]: Switching to remote copy.
Sep 28 03:24:52 mainserver1 kernel: CMAN: Waiting to join or form a Linux-cluster
Sep 28 03:24:52 mainserver1 ccsd[6494]: Connected to cluster infrastruture via: CMAN/SM Plugin v1.1.5
Sep 28 03:24:52 mainserver1 ccsd[6494]: Initial status:: Inquorate
Sep 28 03:24:52 mainserver1 kernel: CMAN: sending membership request
Sep 28 03:24:52 mainserver1 kernel: CMAN: got node mainserver2
Sep 28 03:24:52 mainserver1 kernel: CMAN: quorum regained, resuming activity
Sep 28 03:24:52 mainserver1 ccsd[6494]: Cluster is quorate. Allowing connections.
Sep 28 03:24:53 mainserver1 kernel: DLM 2.6.9-41.7 (built Feb 26 2006 21:30:10) installed
Sep 28 03:24:53 mainserver1 cman: startup succeeded
Sep 28 03:24:59 mainserver1 fenced: startup succeeded
Sep 28 03:25:13 mainserver1 lock_gulmd: no <gulm> section detected in /etc/cluster/cluster.conf succeeded
Sep 28 03:25:18 mainserver1 clurgmgrd[6580]: <notice> Resource Group Manager Starting
Sep 28 03:25:18 mainserver1 clurgmgrd[6580]: <info> Loading Service Data
Sep 28 03:25:18 mainserver1 rgmanager: clurgmgrd 启动 succeeded
Sep 28 03:25:19 mainserver1 clurgmgrd[6580]: <info> Initializing Services
Sep 28 03:25:19 mainserver1 clurgmgrd: [6580]: <info> Executing /etc/init.d/hongsy.sh stop
Sep 28 03:25:19 mainserver1 su(pam_unix)[6691]: session opened for user oracle by (uid=0)
Sep 28 03:25:19 mainserver1 su(pam_unix)[6691]: session closed for user oracle
Sep 28 03:25:19 mainserver1 su(pam_unix)[6737]: session opened for user oracle by (uid=0)
Sep 28 03:25:22 mainserver1 su(pam_unix)[6737]: session closed for user oracle
Sep 28 03:25:22 mainserver1 clurgmgrd: [6580]: <info> /dev/mapper/VolGroupArray-lv_oracle_data is not mounted
Sep 28 03:25:24 mainserver1 clurgmgrd: [6580]: <info> /dev/mapper/VolGroupArray-lv_oracle_log is not mounted
Sep 28 03:25:26 mainserver1 clurgmgrd: [6580]: <info> /dev/mapper/VolGroupArray-lv_images is not mounted
Sep 28 03:25:28 mainserver1 clurgmgrd[6580]: <info> Services Initialized
Sep 28 03:25:28 mainserver1 clurgmgrd[6580]: <info> Logged in SG "usrm::manager"
Sep 28 03:25:28 mainserver1 clurgmgrd[6580]: <info> Magma Event: Membership Change
Sep 28 03:25:28 mainserver1 clurgmgrd[6580]: <info> State change: Local UP
Sep 28 03:25:30 mainserver1 clurgmgrd[6580]: <notice> Starting stopped service oracle
Sep 28 03:25:30 mainserver1 clurgmgrd: [6580]: <info> mounting /dev/mapper/VolGroupArray-lv_oracle_data on /data/oradata
Sep 28 03:25:30 mainserver1 kernel: kjournald starting. Commit interval 5 seconds
Sep 28 03:25:30 mainserver1 kernel: EXT3-fs warning: maximal mount count reached, running e2fsck is recommended
Sep 28 03:25:30 mainserver1 kernel: EXT3 FS on dm-2, internal journal
Sep 28 03:25:30 mainserver1 kernel: EXT3-fs: mounted filesystem with ordered data mode.
Sep 28 03:25:30 mainserver1 clurgmgrd: [6580]: <info> mounting /dev/mapper/VolGroupArray-lv_oracle_log on /data/oralog
Sep 28 03:25:31 mainserver1 kernel: kjournald starting. Commit interval 5 seconds
Sep 28 03:25:31 mainserver1 kernel: EXT3-fs warning: checktime reached, running e2fsck is recommended
Sep 28 03:25:31 mainserver1 kernel: EXT3 FS on dm-4, internal journal
Sep 28 03:25:31 mainserver1 kernel: EXT3-fs: recovery complete.
Sep 28 03:25:31 mainserver1 kernel: EXT3-fs: mounted filesystem with ordered data mode.
Sep 28 03:25:31 mainserver1 clurgmgrd: [6580]: <info> mounting /dev/mapper/VolGroupArray-lv_images on /data/images
Sep 28 03:25:31 mainserver1 kernel: kjournald starting. Commit interval 5 seconds
Sep 28 03:25:31 mainserver1 kernel: EXT3-fs warning: mounting unchecked fs, running e2fsck is recommended
Sep 28 03:25:31 mainserver1 kernel: EXT3 FS on dm-3, internal journal
Sep 28 03:25:31 mainserver1 kernel: EXT3-fs: mounted filesystem with ordered data mode.
Sep 28 03:25:31 mainserver1 clurgmgrd: [6580]: <info> Adding IPv4 address 138.148.221.3 to eth2
Sep 28 03:25:32 mainserver1 clurgmgrd: [6580]: <info> Executing /etc/init.d/hongsy.sh start
Sep 28 03:25:32 mainserver1 su(pam_unix)[7120]: session opened for user oracle by (uid=0)
Sep 28 03:25:42 mainserver1 clurgmgrd[6580]: <info> Magma Event: Membership Change
Sep 28 03:25:42 mainserver1 clurgmgrd[6580]: <info> State change: mainserver2 UP
Sep 28 03:27:16 mainserver1 kernel: oracle(719 : floating-point assist fault at ip 4000000009f5e0e2, isr 0000020000001001
Sep 28 03:27:16 mainserver1 last message repeated 3 times
Sep 28 03:27:17 mainserver1 su(pam_unix)[7120]: session closed for user oracle
Sep 28 03:27:17 mainserver1 su(pam_unix)[7207]: session opened for user oracle by (uid=0)
Sep 28 03:27:19 mainserver1 su(pam_unix)[7207]: session closed for user oracle
Sep 28 03:27:26 mainserver1 clurgmgrd[6580]: <notice> Service oracle started
Sep 28 03:28:02 mainserver1 clurgmgrd: [6580]: <info> Executing /etc/init.d/hongsy.sh status
Sep 28 03:28:04 mainserver1 su(pam_unix)[7820]: session opened for user oracle by root(uid=0)
Sep 28 03:28:25 mainserver1 su(pam_unix)[7820]: session closed for user oracle
Sep 28 03:28:32 mainserver1 clurgmgrd: [6580]: <info> Executing /etc/init.d/hongsy.sh status
Sep 28 03:29:33 mainserver1 last message repeated 2 times
Sep 28 03:29:37 mainserver1 clurgmgrd[6580]: <notice> Stopping service oracle
Sep 28 03:29:37 mainserver1 clurgmgrd: [6580]: <info> Executing /etc/init.d/hongsy.sh stop
Sep 28 03:29:40 mainserver1 su(pam_unix)[8927]: session opened for user oracle by (uid=0)
Sep 28 03:29:41 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 1)
Sep 28 03:29:52 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 2)
Sep 28 03:29:55 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 1)
Sep 28 03:29:57 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 2)
Sep 28 03:29:59 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 1)
Sep 28 03:30:01 mainserver1 crond(pam_unix)[8975]: session opened for user root by (uid=0)
Sep 28 03:30:01 mainserver1 crond(pam_unix)[8974]: session opened for user root by (uid=0)
Sep 28 03:30:01 mainserver1 su(pam_unix)[8976]: session opened for user oracle by (uid=0)
Sep 28 03:30:09 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 2)
Sep 28 03:30:10 mainserver1 su(pam_unix)[8927]: session closed for user oracle
Sep 28 03:30:10 mainserver1 su(pam_unix)[9008]: session opened for user oracle by (uid=0)
Sep 28 03:30:10 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 1)
Sep 28 03:30:11 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 2)
Sep 28 03:30:13 mainserver1 su(pam_unix)[9008]: session closed for user oracle
Sep 28 03:30:14 mainserver1 clurgmgrd: [6580]: <info> Removing IPv4 address 138.148.221.3 from eth2
Sep 28 03:30:15 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 1)
Sep 28 03:30:22 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 2)
Sep 28 03:30:23 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 1)
Sep 28 03:30:26 mainserver1 clurgmgrd: [6580]: <info> unmounting /data/oradata
Sep 28 03:30:27 mainserver1 clurgmgrd: [6580]: <info> unmounting /data/oralog
Sep 28 03:30:27 mainserver1 clurgmgrd: [6580]: <notice> Forcefully unmounting /data/oralog
Sep 28 03:30:28 mainserver1 clurgmgrd: [6580]: <warning> killing process 4644 (root gam_serve /data/oralog)
Sep 28 03:30:29 mainserver1 clurgmgrd: [6580]: <crit> Could not clean up mountpoint /data/oralog
Sep 28 03:30:32 mainserver1 su(pam_unix)[8976]: session closed for user oracle
Sep 28 03:30:34 mainserver1 clurgmgrd: [6580]: <info> unmounting /data/oralog
Sep 28 03:30:34 mainserver1 clurgmgrd: [6580]: <notice> Forcefully unmounting /data/oralog
Sep 28 03:30:34 mainserver1 clurgmgrd: [6580]: <warning> killing process 9203 (root gam_serve /data/oralog)
Sep 28 03:30:35 mainserver1 clurgmgrd: [6580]: <crit> Could not clean up mountpoint /data/oralog
Sep 28 03:30:35 mainserver1 clurgmgrd: [6580]: <err> 'umount /data/oralog' failed, error=0
Sep 28 03:30:35 mainserver1 clurgmgrd[6580]: <notice> stop on fs "oralog" returned 2 (invalid argument(s))
Sep 28 03:30:35 mainserver1 clurgmgrd[6580]: <crit> #12: RG oracle failed to stop; intervention required
Sep 28 03:30:35 mainserver1 clurgmgrd[6580]: <notice> Service oracle is failed
Sep 28 03:30:36 mainserver1 clurgmgrd[6580]: <warning> #70: Attempting to restart service oracle locally.
Sep 28 03:30:36 mainserver1 clurgmgrd[6580]: <err> #43: Service oracle has failed; can not start.
Sep 28 03:30:36 mainserver1 clurgmgrd[6580]: <alert> #2: Service oracle returned failure code. Last Owner: mainserver1
Sep 28 03:30:36 mainserver1 clurgmgrd[6580]: <alert> #4: Administrator intervention required.
Sep 28 03:30:44 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 2)
Sep 28 03:30:47 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 1)
Sep 28 03:31:01 mainserver1 crond(pam_unix)[8974]: session closed for user root
Sep 28 03:31:01 mainserver1 crond(pam_unix)[8975]: session closed for user root
Sep 28 03:31:33 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 1)
Sep 28 03:31:33 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 2)
Sep 28 03:31:36 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 1)
Sep 28 03:31:42 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 2)
Sep 28 03:32:36 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 1)
Sep 28 03:33:08 mainserver1 htt_server[3175]: status has not been enabled yet. (1, 2)
Sep 28 03:38:11 mainserver1 kernel: clurgmgrd(9343): unaligned access to 0x2000000001b2e904, ip=0x4000000000010091
Sep 28 03:38:11 mainserver1 kernel: clurgmgrd(9343): unaligned access to 0x2000000001b2e904, ip=0x40000000000100b0
Sep 28 03:38:11 mainserver1 kernel: clurgmgrd(9343): unaligned access to 0x2000000001b2e90c, ip=0x40000000000100f1
Sep 28 03:38:11 mainserver1 kernel: clurgmgrd(9343): unaligned access to 0x2000000001b2e90c, ip=0x4000000000010110
请兄弟们说道说道? |
|