免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
最近访问板块 发新帖
查看: 1515 | 回复: 9
打印 上一主题 下一主题

solaris 7 系统每天重启,查不出原因,急! [复制链接]

论坛徽章:
0
跳转到指定楼层
1 [收藏(0)] [报告]
发表于 2003-04-25 08:43 |只看该作者 |倒序浏览
系统安装solaris7 ,硬件为4 cpu  4G 内存 sun420R ,平时内存、cpu利用率都很小,但是最近系统每天定时在1:00及7-8点左右重启两次,第一次没有不正常提示,具体见下面messages,哪位高人指点一下原因,谢谢!
Apr 24 01:04:35 server unix: ^MSunOS Release 5.7 Version Generic_106541-23 64-bit [UNIX(R) System V Release 4.0
]
Apr 24 01:04:35 server unix: Copyright (c) 1983-1999, Sun Microsystems, Inc.
Apr 24 01:04:35 server unix: Ethernet address = 0:3:ba:12:70:57
Apr 24 01:04:35 server unix: mem = 4194304K (0x100000000)
Apr 24 01:04:35 server unix: avail mem = 4136648704
Apr 24 01:04:35 server unix: System booting after fatal error FATAL
Apr 24 01:04:35 server unix: root nexus = Sun Enterprise 420R (4 X UltraSPARC-II 450MHz)
Apr 24 01:04:36 server unix: pci0 at root: UPA 0x1f 0x4000
Apr 24 01:04:36 server unix: pci0 is /pci@1f,4000
Apr 24 01:04:36 server unix: /pci@1f,4000/scsi@3 (glm0):
Apr 24 01:04:36 server  Rev. 5 Symbios 53c875 found.
Apr 24 01:04:36 server unix: PCI-device: scsi@3, glm0
Apr 24 01:04:36 server unix: glm0 is /pci@1f,4000/scsi@3
Apr 24 01:04:36 server unix: /pci@1f,4000/scsi@3,1 (glm1):
Apr 24 01:04:36 server  Rev. 5 Symbios 53c875 found.
Apr 24 01:04:36 server unix: PCI-device: scsi@3,1, glm1
Apr 24 01:04:36 server unix: glm1 is /pci@1f,4000/scsi@3,1
Apr 24 01:04:36 server unix: sd0 at glm0:
Apr 24 01:04:36 server unix:  target 0 lun 0
Apr 24 01:04:36 server unix: sd0 is /pci@1f,4000/scsi@3/sd@0,0
Apr 24 01:04:36 server unix:    <SUN36G cyl 24620 alt 2 hd 27 sec 107>;
Apr 24 01:04:36 server unix: sd1 at glm0:
Apr 24 01:04:36 server unix:  target 1 lun 0
Apr 24 01:04:36 server unix: sd1 is /pci@1f,4000/scsi@3/sd@1,0
Apr 24 01:04:36 server unix:    <SUN36G cyl 24620 alt 2 hd 27 sec 107>;
Apr 24 01:04:37 server unix: sd6 at glm0:
Apr 24 01:04:37 server unix:  target 6 lun 0
Apr 24 01:04:37 server unix: sd6 is /pci@1f,4000/scsi@3/sd@6,0
Apr 24 01:04:44 server unix: root on /pseudo/md@0:0,30,blk fstype ufs
Apr 24 01:04:44 server unix: WARNING: forceload of misc/md_trans failed
Apr 24 01:04:44 server unix: WARNING: forceload of misc/md_raid failed
Apr 24 01:04:44 server unix: WARNING: forceload of misc/md_hotspares failed
Apr 24 01:04:44 server unix: pci1 at root: UPA 0x1f 0x2000
Apr 24 01:04:44 server unix: pci1 is /pci@1f,2000
Apr 24 01:04:46 server unix: PCI-device: ebus@1, ebus0
Apr 24 01:04:46 server unix: su0 at ebus0: offset 14,3083f8
Apr 24 01:04:46 server unix: su0 is /pci@1f,4000/ebus@1/su@14,3083f8
Apr 24 01:04:46 server unix: su1 at ebus0: offset 14,3062f8
Apr 24 01:04:46 server unix: su1 is /pci@1f,4000/ebus@1/su@14,3062f8
Apr 24 01:04:46 server unix: keyboard is </pci@1f,4000/ebus@1/su@14,3083f8>; major <37>; minor <0>;
Apr 24 01:04:46 server unix: mouse is </pci@1f,4000/ebus@1/su@14,3062f8>; major <37>; minor <1>;
Apr 24 01:04:46 server unix: stdin is </pci@1f,4000/ebus@1/su@14,3083f8>; major <37>; minor <0>;
Apr 24 01:04:46 server unix: PCI-device: SUNW,m64B@1, m640
Apr 24 01:04:46 server unix: m640 is /pci@1f,2000/SUNW,m64B@1
Apr 24 01:04:46 server unix: m64#0: 1152x900, 2M mappable, rev 4752.27
Apr 24 01:04:46 server unix: stdout is </pci@1f,2000/SUNW,m64B@1>; major <87>; minor <0>;
Apr 24 01:04:46 server unix: cpu0: SUNW,UltraSPARC-II (upaid 0 impl 0x11 ver 0xa0 clock 450 MHz)
Apr 24 01:04:46 server unix: cpu1: SUNW,UltraSPARC-II (upaid 1 impl 0x11 ver 0xa0 clock 450 MHz)
Apr 24 01:04:46 server unix: cpu 1 initialization complete - online
Apr 24 01:04:46 server unix: cpu2: SUNW,UltraSPARC-II (upaid 2 impl 0x11 ver 0xa0 clock 450 MHz)
Apr 24 01:04:46 server unix: cpu 2 initialization complete - online
Apr 24 01:04:46 server unix: cpu3: SUNW,UltraSPARC-II (upaid 3 impl 0x11 ver 0xa0 clock 450 MHz)
Apr 24 01:04:46 server unix: cpu 3 initialization complete - online
Apr 24 01:04:48 server unix: se0 at ebus0: offset 14,400000
Apr 24 01:04:48 server unix: se0 is /pci@1f,4000/ebus@1/se@14,400000
Apr 24 01:04:49 server unix: SUNW,hme0: CheerIO 2.0 (Rev Id = c1) Found
Apr 24 01:04:49 server unix: PCI-device: network@1,1, hme0
Apr 24 01:04:49 server unix: hme0 is /pci@1f,4000/network@1,1
Apr 24 01:04:49 server unix: SUNW,hme1: CheerIO 2.0 (Rev Id = c1) Found
Apr 24 01:04:49 server unix: SUNW,hme1: Local Ethernet address = 0:3:ba:1e:c8:b5
Apr 24 01:04:49 server unix: PCI-device: SUNW,hme@4,1, hme1
Apr 24 01:04:49 server unix: hme1 is /pci@1f,4000/SUNW,hme@4,1
Apr 24 01:04:52 server unix: SUNW,hme0: Using Internal Transceiver
Apr 24 01:04:52 server unix: SUNW,hme0: 100 Mbps full-duplex Link Up
Apr 24 01:04:53 server unix: dump on /dev/md/dsk/d31 size 2048 MB
Apr 24 01:06:24 server syslogd: line 24: WARNING: loghost could not be resolved
Apr 24 01:06:25 server unix: dump on /dev/dsk/c0t0d0s1 size 2048 MB
Apr 24 01:06:28 server unix: pseudo-device: pm0
Apr 24 01:06:28 server unix: pm0 is /pseudo/pm@0
Apr 24 01:06:28 server syslog: /usr/sbin/pmconfig: /etc/power.conf line (31) failed to convert mount point /dev
/md/dsk/d30 to prom name
Apr 24 01:06:28 server unix: pseudo-device: tod0
Apr 24 01:06:28 server unix: tod0 is /pseudo/tod@0
Apr 24 01:06:29 server unix: power0 at ebus0: offset 14,724000
Apr 24 01:06:29 server unix: power0 is /pci@1f,4000/ebus@1/power@14,724000
Apr 24 01:06:29 server unix: pseudo-device: devinfo0
Apr 24 01:06:29 server unix: devinfo0 is /pseudo/devinfo@0
Apr 24 01:06:30 server unix: pseudo-device: vol0
Apr 24 01:06:30 server unix: vol0 is /pseudo/vol@0
Apr 24 08:08:22 server unix: WARNING: [AFT1] Uncorrectable Memory Error on CPU0 Data access at TL=0, errID 0x00
00173d.3084a55f
Apr 24 08:08:22 server     AFSR 0x00000000.00200000<UE>; AFAR 0x00000000.d7728820
Apr 24 08:08:22 server     AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00 Fault_PC 0xfeec5860
Apr 24 08:08:22 server     UDBH 0x0203<UE>; UDBH.ESYND 0x03 UDBL 0x0000 UDBL.ESYND 0x00
Apr 24 08:08:22 server     UDBH Syndrome 0x3 Memory Module U1302 U0302 U1301 U0301
Apr 24 08:08:22 server unix: WARNING: [AFT1] errID 0x0000173d.3084a55f Syndrome 0x3 indicates that this may not
be a memory module problem
Apr 24 08:08:22 server unix: [AFT2] errID 0x0000173d.3084a55f PA=0x00000000.d7728820
Apr 24 08:08:22 server     E$tag 0x00000000.0a401aee E$State: Shared E$parity 0x05
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x00): 0x00000000.00000000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x0: 0x00000000.00000000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x10): 0x00000000.00000000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x1: 0x00000000.00000000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x20): 0x00000100.00000000 *Bad* PSYND=0xff00
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x2: 0x00000000.00000000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x30): 0x00000000.00000000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x3: 0x00211360.0020a6c0
Apr 24 08:08:22 server unix: WARNING: [AFT1] CP event on CPU2 (caused Data access error on CPU0), errID 0x00001
73d.3084a55f
Apr 24 08:08:22 server     AFSR 0x00000000.01002000<CP>; AFAR 0x00000000.d7728820
Apr 24 08:08:22 server     AFSR.PSYND 0x2000(Score 95) AFSR.ETS 0x00
Apr 24 08:08:22 server     UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0000 UDBL.ESYND 0x00
Apr 24 08:08:22 server unix: [AFT2] errID 0x0000173d.3084a55f PA=0x00000000.d7728820
Apr 24 08:08:22 server     E$tag 0x00000000.1b401aee E$State: Owner E$parity 0x0d
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x00): 0x00000000.00000000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x0: 0x00000000.00000000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x10): 0x00000000.00000000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x1: 0x00000000.00000000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x20): 0x00000100.00000000 *Bad* PSYND=0x2000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x2: 0x00000000.00000000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x30): 0x00000000.00000000
Apr 24 08:08:22 server unix: [AFT2] E$Data (0x3: 0x00211360.0020a6c0
Apr 24 08:08:22 server unix: NOTICE: Scheduling clearing of error on page 0x00000000.d7728000
Apr 24 08:08:22 server unix: [AFT3] errID 0x0000173d.3084a55f Above Error is in User Mode
Apr 24 08:08:22 server     and is fatal: will reboot
Apr 24 08:08:22 server unix: WARNING: [AFT1] initiating reboot due to above error in pid 19571 (java)
Apr 24 08:08:24 server syslogd: going down on signal 15
Apr 24 08:11:07 server unix: ^MSunOS Release 5.7 Version Generic_106541-23 64-bit [UNIX(R) System V Release 4.0
]
Apr 24 08:11:07 server unix: Copyright (c) 1983-1999, Sun Microsystems, Inc.
Apr 24 08:11:07 server unix: Ethernet address = 0:3:ba:12:70:57
Apr 24 08:11:07 server unix: mem = 4194304K (0x100000000)
Apr 24 08:11:07 server unix: avail mem = 4136648704
Apr 24 08:11:07 server unix: root nexus = Sun Enterprise 420R (4 X UltraSPARC-II 450MHz)
Apr 24 08:11:07 server unix: pci0 at root: UPA 0x1f 0x4000
Apr 24 08:11:07 server unix: pci0 is /pci@1f,4000
Apr 24 08:11:07 server unix: /pci@1f,4000/scsi@3 (glm0):
Apr 24 08:11:07 server  Rev. 5 Symbios 53c875 found.
Apr 24 08:11:07 server unix: PCI-device: scsi@3, glm0
Apr 24 08:11:07 server unix: glm0 is /pci@1f,4000/scsi@3
Apr 24 08:11:07 server unix: /pci@1f,4000/scsi@3,1 (glm1):
Apr 24 08:11:07 server  Rev. 5 Symbios 53c875 found.
Apr 24 08:11:07 server unix: PCI-device: scsi@3,1, glm1
Apr 24 08:11:07 server unix: glm1 is /pci@1f,4000/scsi@3,1
Apr 24 08:11:07 server unix: sd0 at glm0:
Apr 24 08:11:07 server unix:  target 0 lun 0
Apr 24 08:11:07 server unix: sd0 is /pci@1f,4000/scsi@3/sd@0,0
Apr 24 08:11:07 server unix:    <SUN36G cyl 24620 alt 2 hd 27 sec 107>;
Apr 24 08:11:07 server unix: sd1 at glm0:
Apr 24 08:11:07 server unix:  target 1 lun 0
Apr 24 08:11:07 server unix: sd1 is /pci@1f,4000/scsi@3/sd@1,0
Apr 24 08:11:07 server unix:    <SUN36G cyl 24620 alt 2 hd 27 sec 107>;
Apr 24 08:11:08 server unix: sd6 at glm0:
Apr 24 08:11:08 server unix:  target 6 lun 0
Apr 24 08:11:08 server unix: sd6 is /pci@1f,4000/scsi@3/sd@6,0
Apr 24 08:11:15 server unix: root on /pseudo/md@0:0,30,blk fstype ufs
Apr 24 08:11:15 server unix: WARNING: forceload of misc/md_trans failed
Apr 24 08:11:15 server unix: WARNING: forceload of misc/md_raid failed
Apr 24 08:11:15 server unix: WARNING: forceload of misc/md_hotspares failed
Apr 24 08:11:15 server unix: pci1 at root: UPA 0x1f 0x2000
Apr 24 08:11:15 server unix: pci1 is /pci@1f,2000
Apr 24 08:11:16 server unix: PCI-device: ebus@1, ebus0
Apr 24 08:11:17 server unix: su0 at ebus0: offset 14,3083f8
Apr 24 08:11:17 server unix: su0 is /pci@1f,4000/ebus@1/su@14,3083f8
Apr 24 08:11:17 server unix: su1 at ebus0: offset 14,3062f8
Apr 24 08:11:17 server unix: su1 is /pci@1f,4000/ebus@1/su@14,3062f8
Apr 24 08:11:17 server unix: keyboard is </pci@1f,4000/ebus@1/su@14,3083f8>; major <37>; minor <0>;
Apr 24 08:11:17 server unix: mouse is </pci@1f,4000/ebus@1/su@14,3062f8>; major <37>; minor <1>;
Apr 24 08:11:17 server unix: stdin is </pci@1f,4000/ebus@1/su@14,3083f8>; major <37>; minor <0>;
Apr 24 08:11:17 server unix: PCI-device: SUNW,m64B@1, m640
Apr 24 08:11:17 server unix: m640 is /pci@1f,2000/SUNW,m64B@1
Apr 24 08:11:17 server unix: m64#0: 1152x900, 2M mappable, rev 4752.27
Apr 24 08:11:17 server unix: stdout is </pci@1f,2000/SUNW,m64B@1>; major <87>; minor <0>;
Apr 24 08:11:17 server unix: cpu0: SUNW,UltraSPARC-II (upaid 0 impl 0x11 ver 0xa0 clock 450 MHz)
Apr 24 08:11:17 server unix: cpu1: SUNW,UltraSPARC-II (upaid 1 impl 0x11 ver 0xa0 clock 450 MHz)
Apr 24 08:11:17 server unix: cpu 1 initialization complete - online
Apr 24 08:11:17 server unix: cpu2: SUNW,UltraSPARC-II (upaid 2 impl 0x11 ver 0xa0 clock 450 MHz)
Apr 24 08:11:17 server unix: cpu 2 initialization complete - online
Apr 24 08:11:17 server unix: cpu3: SUNW,UltraSPARC-II (upaid 3 impl 0x11 ver 0xa0 clock 450 MHz)
Apr 24 08:11:17 server unix: cpu 3 initialization complete - online
Apr 24 08:11:18 server unix: se0 at ebus0: offset 14,400000
Apr 24 08:11:18 server unix: se0 is /pci@1f,4000/ebus@1/se@14,400000
Apr 24 08:11:20 server unix: SUNW,hme0: CheerIO 2.0 (Rev Id = c1) Found
Apr 24 08:11:20 server unix: PCI-device: network@1,1, hme0
Apr 24 08:11:20 server unix: hme0 is /pci@1f,4000/network@1,1
Apr 24 08:11:20 server unix: SUNW,hme1: CheerIO 2.0 (Rev Id = c1) Found
Apr 24 08:11:20 server unix: SUNW,hme1: Local Ethernet address = 0:3:ba:1e:c8:b5
Apr 24 08:11:20 server unix: PCI-device: SUNW,hme@4,1, hme1
Apr 24 08:11:20 server unix: hme1 is /pci@1f,4000/SUNW,hme@4,1
Apr 24 08:11:23 server unix: SUNW,hme0: Using Internal Transceiver
Apr 24 08:11:23 server unix: SUNW,hme0: 100 Mbps full-duplex Link Up
Apr 24 08:11:24 server unix: dump on /dev/md/dsk/d31 size 2048 MB
Apr 24 08:11:39 server syslogd: line 24: WARNING: loghost could not be resolved
Apr 24 08:11:40 server unix: dump on /dev/dsk/c0t0d0s1 size 2048 MB
Apr 24 08:11:43 server unix: pseudo-device: pm0
Apr 24 08:11:43 server unix: pm0 is /pseudo/pm@0
Apr 24 08:11:43 server syslog: /usr/sbin/pmconfig: /etc/power.conf line (31) failed to convert mount point /dev
/md/dsk/d30 to prom name
Apr 24 08:11:43 server unix: pseudo-device: tod0
Apr 24 08:11:43 server unix: tod0 is /pseudo/tod@0
Apr 24 08:11:43 server unix: power0 at ebus0: offset 14,724000
Apr 24 08:11:43 server unix: power0 is /pci@1f,4000/ebus@1/power@14,724000
Apr 24 08:11:44 server unix: pseudo-device: devinfo0
Apr 24 08:11:44 server unix: devinfo0 is /pseudo/devinfo@0
Apr 24 08:11:44 server unix: pseudo-device: vol0
Apr 24 08:11:44 server unix: vol0 is /pseudo/vol@0
Apr 24 10:19:39 server su: 'su root' failed for wuxl on /dev/pts/1
Apr 24 13:38:13 server su: 'su root' failed for wuxl on /dev/pts/3
Apr 24 13:38:21 server last message repeated 2 times

论坛徽章:
0
2 [报告]
发表于 2003-04-25 09:58 |只看该作者

solaris 7 系统每天重启,查不出原因,急!

there look's like cpu0 have problem,please take these two jobs:
1.check /var/adm/messages,seache anthing err
2.use /usr/platform/sun4u/bin/prtdiag list hardware in you machine.
After  you having ended this 2 jobs,told me what info you get.

论坛徽章:
0
3 [报告]
发表于 2003-04-25 10:07 |只看该作者

solaris 7 系统每天重启,查不出原因,急!

Apr 24 01:06:28 server syslog: /usr/sbin/pmconfig: /etc/power.conf line (31) failed to convert mount

这个错误在CPU 前。。。

论坛徽章:
0
4 [报告]
发表于 2003-04-25 12:51 |只看该作者
提示: 作者被禁止或删除 内容自动屏蔽

论坛徽章:
0
5 [报告]
发表于 2003-04-25 12:55 |只看该作者

solaris 7 系统每天重启,查不出原因,急!

1.看看你的电源管理

2.装个VTS跑一下子啦~~~~~~~~仔细查查

论坛徽章:
0
6 [报告]
发表于 2003-04-25 14:07 |只看该作者

solaris 7 系统每天重启,查不出原因,急!

/usr/platform/sun4u/sbin/prtdiag
显示cpu正常,
电源管理如何打开,系统的power.conf文件:
# Auto-Shutdown         Idle(min)       Start/Finish(hh:mm)     Behavior
autoshutdown            30              9:00 9:00               noshutdown
statefile               //.CPR

论坛徽章:
0
7 [报告]
发表于 2003-04-25 14:08 |只看该作者

solaris 7 系统每天重启,查不出原因,急!

对了,电源管理如何关闭。

论坛徽章:
0
8 [报告]
发表于 2003-04-25 14:32 |只看该作者

solaris 7 系统每天重启,查不出原因,急!

http://docs.sun.com/db/doc/802-7298/6iau8fb41?q=power.conf&a=view


看看 自己做 出来对你更有好处。。。

论坛徽章:
0
9 [报告]
发表于 2003-04-25 16:10 |只看该作者
提示: 作者被禁止或删除 内容自动屏蔽

论坛徽章:
0
10 [报告]
发表于 2003-04-25 23:26 |只看该作者

solaris 7 系统每天重启,查不出原因,急!

可是我的power.conf中已经设置为noshutdown
应该关闭了autoshutdown功能吧!
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP