免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
最近访问板块 发新帖
楼主: SUNfan
打印 上一主题 下一主题

redhat AS 4用RHCS做HA,断掉第一台机器网线,服务不能切换! [复制链接]

论坛徽章:
0
21 [报告]
发表于 2006-12-28 16:03 |只看该作者
嗯,好的,我马上去看看添加fence关于brocade switch的。
您说AS4的RHCS里面添加了”quorum disk“,是不是在共享磁盘里面添加两个共享裸分区,用于fence?
但是我不知道共享磁盘,在图里面什么地方添加?

123.JPG (65.54 KB, 下载次数: 42)

123.JPG

论坛徽章:
0
22 [报告]
发表于 2006-12-28 16:06 |只看该作者
Red Hat Cluster Suite 4 RHEL4 U4 Release Notes


Copyright(c) 2006 Red Hat, Inc.
        -------------------------------------------------------

September 19, 2006

Introduction

   The following topics are covered in this document:
   
     o Changes to Red Hat Cluster Suite 4

     o Important Notes
     
     o Bugs Fixed in the Release

     o Related Documentation
     
   
Changes to Red Hat Cluster Suite 4

  Quorum Disk
   
   Quorum Disk is a new feature available with this release. The
   Quorum Disk feature (also known as qdisk) allows you to configure
   arbitrary heuristics so that each cluster member can determine its
   fitness for participating in a cluster. The fitness information is
   communicated to other cluster members via a "quorum disk" residing
   on shared storage.
   
   With properly configured heuristics, you could define the following
   cluster behavior:

   * In the event of a network-partition failure, provide a method to
     decide which member wins the fence race in a two-node cluster.

   * Allow continued cluster operation after a majority failure without
     manual intervention.

  Quorum Disk communicates with CMAN, ccsd (the Cluster Configuration
  System daemon), and shared storage. It communicates with CMAN to
  advertise quorum-device availability. It communicates with ccsd to
  obtain configuration information. It communicates with shared storage to
  check and record states.

  You can find more information about Quorum Disk in the following man
  pages: mkqdisk(, qdiskd(, and qdisk(5).

  NOTE: For this release, you must configure Quorum Disk by editing
  the cluster configuration file, /etc/cluster/cluster.conf, directly
  rather than by using the cluster configuration graphical user
  interface (system-config-cluster).


ccs_tool Enhancements
  
  The ccs_tool includes new commands for this release. The new
  commands provide the ability to configure certain portions of the
  cluster configuration file (/etc/cluster/cluster.conf). In previous
  releases, the only tool available for creating and managing the
  cluster configuration file was the Cluster Configuration GUI
  (system-config-cluster). For more information about the new commands
  and usage examples, refer to the ccs_tool man page, ccs_tool(.


Important Notes

  The up2date command has changed for RHEL4 U4. When installing
  cluster suite software, use this syntax:

  up2date --installall=<channel-label>

论坛徽章:
0
23 [报告]
发表于 2006-12-28 16:12 |只看该作者
好像没有看到如何添加仲裁磁盘到配置文件中啊?

论坛徽章:
0
24 [报告]
发表于 2006-12-28 16:20 |只看该作者
这种APC,Brocade Switch的fence,都是自动的把,看了一下brocade的fence配置,好像就配这四项:
配置的信息,是不是直接远程登陆到这台机器的IP地址的用户名和密码?
还有是不是这些地址是不是要和服务器的地址一个网段,或者至少是相互能ping通?

456.JPG (25.36 KB, 下载次数: 53)

456.JPG

论坛徽章:
0
25 [报告]
发表于 2006-12-28 16:50 |只看该作者
You can find more information about Quorum Disk in the following man
  pages: mkqdisk(, qdiskd(, and qdisk(5).

论坛徽章:
0
26 [报告]
发表于 2006-12-28 16:59 |只看该作者
通过man看的东西太混乱,有没有直接说如何设置共享的仲裁空间的?

论坛徽章:
0
27 [报告]
发表于 2006-12-29 19:18 |只看该作者
断网之后,sybase服务切换,切换一直不过去,看了第二台机器的日志信息如下:
Dec 29 17:30:21 web kernel: bnx2: eth1: using MSI
Dec 29 17:30:21 web kernel: bonding: bond0: enslaving eth1 as a backup interface with a down link.
Dec 29 17:30:21 web kernel: ip_tables: (C) 2000-2002 Netfilter core team
Dec 29 17:30:21 web kernel: bnx2: eth0: using MSI
Dec 29 17:30:21 web kernel: bnx2: eth1 NIC Link is Up, 1000 Mbps full duplex, receive & transmit flow control ON
Dec 29 17:30:21 web kernel: bonding: bond0: link status definitely up for interface eth1.
Dec 29 17:30:21 web kernel: bonding: bond0: making interface eth1 the new active one.
Dec 29 17:30:21 web kernel: bnx2: eth0 NIC Link is Up, 1000 Mbps full duplex, receive & transmit flow control ON
Dec 29 17:30:21 web kernel: ip_tables: (C) 2000-2002 Netfilter core team
Dec 29 17:30:21 web kernel: NET: Registered protocol family 10
Dec 29 17:30:21 web kernel: Disabled Privacy Extensions on device c0344160(lo)
Dec 29 17:30:21 web kernel: IPv6 over IPv4 tunneling driver
Dec 29 17:30:21 web kernel: CMAN 2.6.9-45.2 (built Jul 13 2006 11:42:36) installed
Dec 29 17:30:22 web kernel: NET: Registered protocol family 30
Dec 29 17:30:22 web kernel: DLM 2.6.9-42.10 (built Jul 13 2006 11:48:04) installed
Dec 29 17:30:22 web kernel: CMAN: Waiting to join or form a Linux-cluster
Dec 29 17:30:22 web kernel: CMAN: sending membership request
Dec 29 17:30:22 web kernel: CMAN: sending membership request
Dec 29 17:30:22 web kernel: CMAN: got node sybase
Dec 29 17:30:22 web kernel: CMAN: quorum regained, resuming activity
Dec 29 17:31:43 web fenced: startup succeeded
Dec 29 17:31:43 web kernel: Attached scsi generic sg0 at scsi0, channel 0, id 8, lun 0,  type 13
Dec 29 17:31:43 web kernel: Attached scsi generic sg1 at scsi0, channel 2, id 0, lun 0,  type 0
Dec 29 17:31:43 web kernel: Attached scsi generic sg2 at scsi1, channel 0, id 0, lun 0,  type 0
Dec 29 17:31:43 web kernel: Attached scsi generic sg3 at scsi1, channel 0, id 0, lun 1,  type 0
Dec 29 17:31:43 web kernel: Attached scsi generic sg4 at scsi1, channel 0, id 0, lun 2,  type 0
Dec 29 17:31:46 web Navisphere Agent[4243]: Agent initializing with pid 4243
Dec 29 17:31:46 web EV_AGENT[4254]: Agent daemon process created, pid 4254
Dec 29 17:31:46 web EV_AGENT[4254]: Agent has started up.
Dec 29 17:31:46 web naviagent: naviagent startup succeeded
Dec 29 17:31:46 web netfs: Mounting other filesystems:  succeeded
Dec 29 17:31:46 web kernel: i2c /dev entries driver
Dec 29 17:31:46 web rc: Starting lm_sensors:  succeeded
Dec 29 17:31:46 web autofs: automount startup succeeded
Dec 29 17:31:46 web smartd[4335]: smartd version 5.33 [i386-redhat-linux-gnu] Copyright (C) 2002-4 Bruce Allen
Dec 29 17:31:46 web smartd[4335]: Home page is [url]http://smartmontools.sourceforge.net/[/url]  
Dec 29 17:31:46 web smartd[4335]: Opened configuration file /etc/smartd.conf
Dec 29 17:31:46 web smartd[4335]: Configuration file /etc/smartd.conf parsed.
Dec 29 17:31:46 web smartd[4335]: Device: /dev/sda, opened
Dec 29 17:31:46 web smartd[4335]: Device: /dev/sda, Bad IEC (SMART) mode page, err=-5, skip device
Dec 29 17:31:46 web smartd[4335]: Unable to register SCSI device /dev/sda at line 30 of file /etc/smartd.conf
Dec 29 17:31:46 web smartd[4335]: Unable to register device /dev/sda (no Directive -d removable). Exiting.
Dec 29 17:31:46 web smartd: smartd startup failed
Dec 29 17:31:46 web acpid: acpid startup succeeded
Dec 29 17:31:47 web kernel: lp: driver loaded but no devices found
Dec 29 17:31:48 web cups: cupsd startup succeeded
Dec 29 17:31:48 web sshd:  succeeded
Dec 29 17:31:48 web xinetd: xinetd startup succeeded
Dec 29 17:31:48 web gpm[4420]: *** info [startup.c(95)]:
Dec 29 17:31:48 web gpm[4420]: Started gpm successfully. Entered daemon mode.
Dec 29 17:31:48 web xinetd[4410]: xinetd Version 2.3.13 started with libwrap loadavg options compiled in.
Dec 29 17:31:48 web xinetd[4410]: Started working: 0 available services
Dec 29 17:31:48 web gpm[4420]: *** info [mice.c(1766)]:
Dec 29 17:31:48 web gpm[4420]: imps2: Auto-detected intellimouse PS/2
Dec 29 17:31:48 web gpm: gpm startup succeeded
Dec 29 17:31:49 web iiim: htt startup succeeded
Dec 29 17:31:49 web crond: crond startup succeeded
Dec 29 17:31:49 web htt_server[4452]: started.
Dec 29 17:31:50 web xfs: xfs startup succeeded
Dec 29 17:31:50 web anacron: anacron startup succeeded
Dec 29 17:31:50 web atd: atd startup succeeded
Dec 29 17:31:50 web messagebus: messagebus startup succeeded
Dec 29 17:31:51 web cups-config-daemon: cups-config-daemon startup succeeded
Dec 29 17:31:51 web haldaemon: haldaemon startup succeeded
Dec 29 17:31:51 web clurgmgrd[4556]: <notice> Resource Group Manager Starting
Dec 29 17:31:51 web clurgmgrd[4556]: <info> Loading Service Data
Dec 29 17:31:51 web rgmanager: clurgmgrd startup succeeded
Dec 29 17:31:51 web fstab-sync[5131]: removed all generated mount points
Dec 29 17:31:51 web fstab-sync[5231]: added mount point /media/cdrom for /dev/hda
Dec 29 17:31:51 web clurgmgrd[4556]: <info> Initializing Services
Dec 29 17:31:51 web clurgmgrd: [4556]: <info> /dev/sdb1 is not mounted
Dec 29 17:31:51 web clurgmgrd: [4556]: <info> /dev/sdc1 is not mounted
Dec 29 17:31:51 web clurgmgrd: [4556]: <info> /dev/sdd1 is not mounted
Dec 29 17:31:52 web fstab-sync[5659]: added mount point /media/floppy for /dev/fd0
Dec 29 17:31:56 web clurgmgrd: [4556]: <info> Executing /etc/rc.d/init.d/sybaseHA.sh stop
Dec 29 17:31:56 web clurgmgrd: [4556]: <info> Executing /etc/rc.d/init.d/webHA.sh stop
Dec 29 17:31:56 web sybaseHA.sh: dataserver shutdown failed
Dec 29 17:31:56 web clurgmgrd[4556]: <notice> stop on script "cms-content" returned 5 (program not installed)
Dec 29 17:31:57 web clurgmgrd[4556]: <info> Services Initialized
Dec 29 17:31:58 web clurgmgrd[4556]: <info> Logged in SG "usrm::manager"
Dec 29 17:31:58 web clurgmgrd[4556]: <info> Magma Event: Membership Change
Dec 29 17:31:58 web clurgmgrd[4556]: <info> State change: Local UP
Dec 29 17:31:58 web clurgmgrd[4556]: <info> State change: sybase UP
Dec 29 17:31:59 web clurgmgrd[4556]: <info> Magma Event: Membership Change
Dec 29 17:31:59 web clurgmgrd[4556]: <info> State change: cms UP
Dec 29 17:32:01 web clurgmgrd[4556]: <notice> Starting stopped service webservice
Dec 29 17:32:01 web clurgmgrd: [4556]: <info> Adding IPv4 address 61.160.65.10 to eth0
Dec 29 17:32:02 web clurgmgrd: [4556]: <info> mounting /dev/sdc1 on /export/home/web
Dec 29 17:32:03 web kernel: kjournald starting.  Commit interval 5 seconds
Dec 29 17:32:03 web kernel: EXT3-fs warning: maximal mount count reached, running e2fsck is recommended
Dec 29 17:32:03 web kernel: EXT3 FS on sdc1, internal journal
Dec 29 17:32:03 web kernel: EXT3-fs: recovery complete.
Dec 29 17:32:03 web kernel: EXT3-fs: mounted filesystem with ordered data mode.
Dec 29 17:32:03 web clurgmgrd: [4556]: <info> Executing /etc/rc.d/init.d/webHA.sh start
Dec 29 17:33:20 web login(pam_unix)[4561]: session opened for user root by LOGIN(uid=0)
Dec 29 17:33:20 web  -- root[4561]: ROOT LOGIN ON tty1
Dec 29 17:34:29 web kernel: CMAN: removing node sybase from the cluster : Missed too many heartbeats
Dec 29 17:34:29 web fenced[4117]: sybase not a cluster member after 0 sec post_fail_delay
Dec 29 17:34:29 web fenced[4117]: fencing node "sybase"
Dec 29 17:34:31 web fenced[4117]: agent "fence_brocade" reports: failed: portshow 80 does not show DISABLED  
Dec 29 17:34:31 web fenced[4117]: fence "sybase" failed
Dec 29 17:34:36 web fenced[4117]: fencing node "sybase"
Dec 29 17:34:37 web fenced[4117]: agent "fence_brocade" reports: failed: portshow 80 does not show DISABLED  
Dec 29 17:34:37 web fenced[4117]: fence "sybase" failed
Dec 29 17:34:42 web fenced[4117]: fencing node "sybase"
Dec 29 17:34:44 web fenced[4117]: agent "fence_brocade" reports: failed: portshow 80 does not show DISABLED  
Dec 29 17:34:44 web fenced[4117]: fence "sybase" failed
Dec 29 17:34:49 web fenced[4117]: fencing node "sybase"
Dec 29 17:34:51 web fenced[4117]: agent "fence_brocade" reports: failed: portshow 80 does not show DISABLED  
Dec 29 17:34:51 web fenced[4117]: fence "sybase" failed
Dec 29 17:34:56 web fenced[4117]: fencing node "sybase"
Dec 29 17:34:57 web fenced[4117]: agent "fence_brocade" reports: failed: portshow 80 does not show DISABLED  
Dec 29 17:34:57 web fenced[4117]: fence "sybase" failed
Dec 29 17:35:02 web fenced[4117]: fencing node "sybase"

123.JPG (87.99 KB, 下载次数: 43)

123.JPG

论坛徽章:
0
28 [报告]
发表于 2006-12-30 09:30 |只看该作者
添加Fence对于SW200e,选用什么端口?

论坛徽章:
0
29 [报告]
发表于 2006-12-30 14:42 |只看该作者
在线等待fence方面的回答!

论坛徽章:
0
30 [报告]
发表于 2006-12-30 16:59 |只看该作者
原帖由 SUNfan 于 2006-12-30 14:42 发表
在线等待fence方面的回答!


这种参数其实和技术无关,至少和linux方面的技术无关,

只有3种人能清楚的解答你的问题

配过同型号设备的人
这种fence设备品牌的厂商或代理
红帽的开发人员

即使把电话打到红帽800,也很难得到一个满意的答复,原因很简单红帽的800工程师也不可能配过所有的fence设备

你可以去红帽邮件列表里查这个设备的关键字,或者选择可靠的硬件厂商支持,google一般很难搜索到这种答案

当然如果是我作这项目,我觉得看硬件说明书然后自己尝试最快

[ 本帖最后由 fuumax 于 2006-12-30 17:02 编辑 ]
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP