- 论坛徽章:
- 0
|
M80的奇怪问题,请各位兄弟帮忙分析解决一下
环境:两台M80+7133、AIX5300-03+HA5.1
故障现象:作为HA环境的A机不定期自动关机
在cluster.log中有如下记录
Nov 30 08:20:32 Test_A daemon:err|error topsvcs[516136]: (Recorded using libct_ffdc.a cv 2):::Error ID: 6o2Oo9/EDWP3/n51/P3.bW0...................:::Reference ID: :::Template ID: 4bd1a134::etails File: : cation: rsct,Hb_Rsock.C,1.57,1493 :::TS_CPU_USE_ER Using too much CPU: exiting CPU usage in milliseconds -1273798618 Interval in milliseconds where CPU usage was measured 32003
Nov 30 08:20:34 Test_A daemon:err|error grpsvcs[598080]: (Recorded using libct_ffdc.a cv 2):::Error ID: 62IcBY/GDWP3/5N7/P3.bW0...................:::Reference ID: :::Template ID: 64368504::etails File: : cation: RSCT,PMClient.C,1.72,1049 :::GS_TS_RETCODE_ER Connection failure between Group Services and Topology Services DIAGNOSTIC EXPLANATION topsvcs subsystem died with hb_errno = 16, grpsvcs will also exit.
Nov 30 08:20:34 Test_A local0:crit clstrmgrES[610358]: Thu Nov 30 08:20:34 announcementCb: Called, state=ST_STABLE
Nov 30 08:20:34 Test_A local0:crit clstrmgrES[610358]: Thu Nov 30 08:20:34 announcementCb: GRPSVCS announcment code=512; exiting
Nov 30 08:20:34 Test_A local0:crit clstrmgrES[610358]: Thu Nov 30 08:20:34 CHECK FOR FAILURE OF RSCT SUBSYSTEMS (topsvcs or grpsvcs)
Nov 30 08:20:34 Test_A local0:crit clstrmgrES[610358]: Thu Nov 30 08:20:34 clstrmgr on node 1 is exiting with code 4
Nov 30 08:20:34 Test_A daemon:notice snmpd[614488]: NOTICE: SMUX packet from (127.0.0.1+32771+2)
Nov 30 08:20:34 Test_A daemon:notice snmpd[614488]: NOTICE: SMUX trap: (6 10) (127.0.0.1+32771+2)
Nov 30 08:20:34 Test_A daemon:err|error haemd[606256]: LPP=PSSP,Fn=emd_gsi.c,SID=1.4.1.36,L#=1361, haemd: 2521-032 Cannot dispatch group services (1).
Nov 30 08:20:34 Test_A daemon:notice snmpd[614488]: NOTICE: SMUX packet from (127.0.0.1+32771+2)
Nov 30 08:20:34 Test_A daemon:notice snmpd[614488]: NOTICE: SMUX trap: (6 11) (127.0.0.1+32771+2)
Nov 30 08:20:34 Test_A daemon:notice snmpd[614488]: NOTICE: SMUX packet from (127.0.0.1+32771+2)
Nov 30 08:20:34 Test_A daemon:notice snmpd[614488]: NOTICE: SMUX trap: (6 15) (127.0.0.1+32771+2)
Nov 30 08:20:34 Test_A user:notice HACMP for AIX: clexit.rc : Unexpected termination of clstrmgrES.
Nov 30 08:20:35 Test_A user:notice HACMP for AIX: clexit.rc : Halting system immediately!!!
errpt中无CPU方面的错误
请教各位兄弟帮忙啦 |
|