Chinaunix

标题: redhat as 5 64 重启日志,帮看看是哪里有问题,谢谢 [打印本页]

作者: liyong705    时间: 2010-10-27 21:32
标题: redhat as 5 64 重启日志,帮看看是哪里有问题,谢谢
本帖最后由 liyong705 于 2010-10-27 21:35 编辑

日志文件

messages.rar (7.77 KB, 下载次数: 20)

以下是一部分,发布完,只能用附件来
=====================================
Oct 24 04:02:04 localhost syslogd 1.4.1: restart.
Oct 24 10:53:56 localhost auditd[2231]: Audit daemon rotating log files
Oct 25 13:13:31 localhost restorecond: Will not restore a file with more than one hard link (/etc/resolv.conf) No such file or directory
Oct 25 14:16:25 localhost restorecond: Will not restore a file with more than one hard link (/etc/resolv.conf) No such file or directory
Oct 25 18:05:18 localhost restorecond: Will not restore a file with more than one hard link (/etc/resolv.conf) No such file or directory
Oct 26 17:58:17 localhost restorecond: Will not restore a file with more than one hard link (/etc/resolv.conf) No such file or directory
Oct 26 23:57:48 localhost auditd[2231]: Audit daemon rotating log files
Oct 27 09:06:48 localhost restorecond: Will not restore a file with more than one hard link (/etc/resolv.conf) No such file or directory
Oct 27 10:38:15 localhost restorecond: Will not restore a file with more than one hard link (/etc/resolv.conf) No such file or directory
Oct 27 15:19:02 localhost restorecond: Will not restore a file with more than one hard link (/etc/resolv.conf) No such file or directory
Oct 27 16:10:27 localhost restorecond: Will not restore a file with more than one hard link (/etc/resolv.conf) No such file or directory
Oct 27 18:53:22 localhost shutdown[20695]: shutting down for system halt
Oct 27 18:53:24 localhost smartd[3005]: smartd received signal 15: Terminated
Oct 27 18:53:24 localhost smartd[3005]: smartd is exiting (exit status 0)
Oct 27 18:53:24 localhost avahi-daemon[2965]: Got SIGTERM, quitting.
Oct 27 18:53:24 localhost avahi-daemon[2965]: Leaving mDNS multicast group on interface eth0.IPv6 with address fe80::215:17ff:feb6:4ebc.
Oct 27 18:53:24 localhost avahi-daemon[2965]: Leaving mDNS multicast group on interface eth0.IPv4 with address 118.122.178.45.
Oct 27 18:53:25 localhost restorecond: Will not restore a file with more than one hard link (/etc/resolv.conf) No such file or directory
Oct 27 18:53:29 localhost xinetd[2678]: Exiting...
Oct 27 18:53:33 localhost hcid[2420]: Got disconnected from the system message bus
Oct 27 18:53:33 localhost rpc.statd[2342]: Caught signal 15, un-registering and exiting.
Oct 27 18:53:33 localhost restorecond: terminated
Oct 27 18:53:34 localhost auditd[2231]: The audit daemon is exiting.
Oct 27 18:53:34 localhost kernel: audit(1288176814.031:40190): audit_pid=0 old=2231 by auid=4294967295 subj=system_u:system_r:auditd_t:s0
Oct 27 18:53:34 localhost pcscd: pcscdaemon.c:572:signal_trap() Preparing for suicide
Oct 27 18:53:34 localhost pcscd: hotplug_libusb.c:376:HPRescanUsbBus() Hotplug stopped
Oct 27 18:53:35 localhost pcscd: readerfactory.c:1379:RFCleanupReaders() entering cleaning function
Oct 27 18:53:35 localhost pcscd: pcscdaemon.c:532:at_exit() cleaning /var/run
Oct 27 18:53:35 localhost kernel: Kernel logging (proc) stopped.
作者: liyong705    时间: 2010-10-27 21:33
Oct 27 18:53:35 localhost kernel: Kernel log daemon terminating.
Oct 27 18:53:36 localhost exiting on signal 15
Oct 27 19:04:36 localhost syslogd 1.4.1: restart.
Oct 27 19:04:36 localhost kernel: klogd 1.4.1, log source = /proc/kmsg started.
Oct 27 19:04:36 localhost kernel: Linux version 2.6.18-164.el5 (mockbuild@x86-003.build.bos.redhat.com) (gcc version 4.1.2 20080704 (Red Hat 4.1.2-46)) #1 SMP Tue Aug 18 15:51:48 EDT 2009
Oct 27 19:04:36 localhost kernel: Command line: ro root=/dev/VolGroup00/LogVol00 rhgb quiet
Oct 27 19:04:36 localhost kernel: BIOS-provided physical RAM map:
Oct 27 19:04:36 localhost kernel:  BIOS-e820: 0000000000010000 - 000000000009d000 (usable)
Oct 27 19:04:36 localhost kernel:  BIOS-e820: 000000000009d000 - 0000000000100000 (reserved)
Oct 27 19:04:36 localhost kernel:  BIOS-e820: 0000000000100000 - 000000009e273000 (usable)
Oct 27 19:04:36 localhost kernel:  BIOS-e820: 000000009e273000 - 000000009e32a000 (ACPI NVS)
Oct 27 19:04:36 localhost kernel:  BIOS-e820: 000000009e32a000 - 000000009fa32000 (usable)
Oct 27 19:04:36 localhost kernel:  BIOS-e820: 000000009fa32000 - 000000009fa9a000 (reserved)
Oct 27 19:04:36 localhost kernel:  BIOS-e820: 000000009fa9a000 - 000000009fab5000 (usable)
Oct 27 19:04:36 localhost kernel:  BIOS-e820: 000000009fab5000 - 000000009fb1a000 (ACPI NVS)
Oct 27 19:04:36 localhost kernel:  BIOS-e820: 000000009fb1a000 - 000000009fb28000 (usable)
作者: Yuri.G.    时间: 2010-10-27 21:43
现在是起不来么?还是什么状态?
  1. echo "8.8.4.4" >/etc/resolv.conf
复制代码
创建一下这个文件,然后再看看。
作者: cst05001    时间: 2010-10-27 23:10
  1. Oct 27 18:53:22 localhost shutdown[20695]: shutting down for system halt
复制代码
只有两种可能:人为,程序故意所为。
作者: liyong705    时间: 2010-10-28 09:24
首先非常感谢2位的热心帮助
1。检查 /etc/resolv.conf 存在

2。有点怀疑是人为

突然之间服务器不通了,打电话给机房,处理后上去看就这样了,因为前几天这个机器因为故障换过一个硬盘,所以担心是不是那个问题又重新出现了

============================================================
另外请教1下上次故障的问题,帮看看是不是硬盘坏了(这个硬盘已经发给商家去换了)

运行过程中突然这样,然后所有命令失效








硬盘型号


作者: cst05001    时间: 2010-10-28 10:08
从lz补充的截图来看,存储通讯中断了。
作者: chenyx    时间: 2010-10-28 10:10
看楼主的图,是硬盘挂掉了
作者: liyong705    时间: 2010-10-28 10:18
是 硬盘硬件故障 还是 软故障 请教~
作者: chenyx    时间: 2010-10-28 10:22
应该是硬件故障。"dead"
作者: Yuri.G.    时间: 2010-10-28 10:24
检查一下是不是电源线或者硬盘数据线松动,开机转不转?
硬件问题很大嫌疑
作者: liyong705    时间: 2010-10-28 10:24
今天商家说检查没有问题,我都搞郁闷了,一般用什么工具测试比较可靠
作者: jerrywjl    时间: 2010-10-28 12:33
我见过的所谓硬件厂商的硬件检测,大都是一些检测工具跑一下获得一下设备的基本状况,能检查的也是设备的基本信息,除非严重错误,否则一般的例如一些兼容性问题、驱动程序bug等等都检查不出来。

但即便这样,这些工具往往也足够让硬件厂商将你搪塞过去了。

当然,我们也不怀疑操作系统或者驱动有bug,如果是这样,你完全可以将系统内核版本有多高拔多高,如果再出问题,除非这个bug是火星来的,否则硬件那里还有什么可以搪塞?

总之一句话,你认为对于一台运行中的服务器,报出这样的物理错误,硬件出错的可能性大还是软件出错的可能性大?

软件的代码都是死的,硬件的问题可就不好说了吧?即便设备是好的,那接口呢?线路呢?干扰呢?电气特性呢?这些东西他都能在硬件检测中提供数据吗?
作者: renxiao2003    时间: 2010-10-28 12:36
机房的管理员不承认啊。
作者: dn833    时间: 2010-10-28 14:59
也不一定就是硬盘问题,也许是SCSI控制器的问题。




欢迎光临 Chinaunix (http://bbs.chinaunix.net/) Powered by Discuz! X3.2