Chinaunix

标题: NBU中tape drive down的10大原因 [打印本页]

作者: maping    时间: 2005-08-19 15:31
标题: NBU中tape drive down的10大原因
在官方网站中找到这样的文档。
但不怎么明白。
现在与大家共享。
希望大家能针对自己遇到的情况,分析分析这个问题。   
tape drive down的10大原因


Symptom:  
Top Ten Reasons Tape Drives Go Down  
  
Solution:  

10. Infrequent, random failure during "normal" operation.
   Diagnosis: Two drive errors occurred within 12 hours.  (See:
   Technotes 234412, 235839)
   Remedies:
   A. Increase the number of allowable drive errors by creating a
      file named /usr/openv/netbackup/DRIVE_ERROR_THRESHOLD
      that contains an override value.
   B. Decrease the duration over which drive errors are accumulated
      by creating a file named /usr/openv/netbackup/TIME_WINDOW
      that contains an override value.
   C. Address the cause of the drive error:
      * Status 134: V3.4, Patch J080645, during manual backup:
        At end of tape, mount request precedes before dismount is
        complete -- create /usr/openv/volmgr/DISABLE_RESOURCES_BUSY
        on media server
      * Unreliable media (near maximum mount count) may need
        replacement.

9. drive goes Down following persistent media errors.
   Diagnosis: A robot inserts a (physically) damaged tape into the drive.
   Remedy: Manual intervention  (e.g., remove the adhesive label
           that's covering the media access and gumming everything up.

8. drive goes Down with intermittent media errors (Status 85 or 86).
   Diagnosis: Tape drive may need cleaning.  (See: Technotes 231451,
   236697, 201201)
   Remedies:
   A. Check whether drive hardware has TapeAlert and is configured
      for automatic cleaning.  (This feature allows a drive to notify
      the Media Manager when it needs cleaning. Using this feature
      is the Veritas-preferred configuration.)
   B. If TapeAlert is not supported (or automatic cleaning is not
      selected), check whether Cleaning Frequency is set in the drive
      configuration.
   C. Find out whether any cleaning tapes are configured, and where
      they are located (in the robotic library, in "that other media
      server," or in "locked up in Jeremy's office."

7. drive goes Down with Status 85 (media read error) when mounting a
   particular tape volume.
   Diagnosis: The tape is really a misconfigured cleaning tape.
   Remedy:
   A. Remove tape from drive (either manually or with robtest).
   B. Determine that tape is misconfigured either from its volume ID
      or by visual examination.
   C. Modify the tape properties (identify it as a cleaning tape.)

6. drive goes Down with Status 85 or 86 for other reasons.
   Diagnosis: Further details are available in /var/logs/syslog or
   /var/adm/messages.

5. drive remains Down from the time the robot was brought on-line.
   Diagnosis: The drive had a tape in it when powered on, and the
              robot doesn't know where it belongs.
   Remedy:    Move tape to an unused slot with robtest and
              re-inventory robot.

4. drive goes Down whenever robot loads a tape.
   Diagnosis: One or more tape drives are misconfigured. (See Technote #193280)
   Remedy:
   A. Run "sgscan" and examine the output.
   B. Run "tpconfig -d" and compare the results to sgscan.
   C. Check for consistency of preceding results, and compare with
      /kernel/drv/st.conf
   D. Try moving tapes from one drive to the next using "robtest".
   Remedy:
   * Review current drive definition/properties with tpconfig or
     in the bpadm/xbpadm/jnbSA device manager functions.
   * Modify as necessary.

3. Hardware problems:
   A. Symptom:   SCSI Log contains send errors
      Diagnosis: Cabling or termination problems  (See Technote #234715)
      Remedy:    Move cables and termination as necessary
   B. Symptom:   SCSI bus reset
      Diagnosis: Defective SCSI controller (See Technote #201880)
      Remedy:    Replace SCSI adapter
   C. Symptom:   System error messages from robtest and sgscan
      Diagnosis: drive hardware, firmware bugs  (See Technote #237227)
      Remedy:    Upgrade hardware as necessary
   Other trouble-shooting tools:
   * Check lights on drive
   * Other command lines fail
   * Errors that appear in various NetBackup logs
   * System error logs, such as /var/logs/syslog or /var/adm/messages
     (if syslogging is enabled in /etc/syslog.conf)
   * /usr/openv/netbackup/db/media/errors

2. Check bptm and daemon logs for clues.  Who knows what you'll find in there!?

And the no. 1 reason for tapes to go Down ...

1. drive goes Down at random times.
   Diagnosis: Tape drive is linked to a stock index.
   Remedy: "/usr/sbin/unlink -f /etc/index.funds" on Media Server.
作者: maping    时间: 2005-08-19 15:32
标题: NBU中tape drive down的10大原因
学习中
望大家多帮忙!
谢谢
作者: roychris    时间: 2005-08-19 16:14
标题: NBU中tape drive down的10大原因
这些都是好文章啊
作者: maping    时间: 2005-08-22 08:02
标题: NBU中tape drive down的10大原因
请大家一起讨论讨论具体遇到的问题!
作者: windsand    时间: 2005-08-31 12:38
标题: NBU中tape drive down的10大原因
如果操作系统内的相关配置格式,你写的不准确,drive显示的就是down,比如solaris 的st.conf,你不注意空格和tab的间距,那原因可就让你好找了
作者: fanyf    时间: 2005-09-09 11:47
标题: NBU中tape drive down的10大原因
好,顶一下!

谢谢楼主!
作者: kaka_wang    时间: 2006-03-11 13:35
谢谢楼主!
作者: henrypan    时间: 2007-04-29 21:25
标题: Cool Thanks
如果操作系统内的相关配置格式,你写的不准确,drive显示的就是down,比如solaris 的st.conf,你不注意空格和tab的间距,那原因可就让你好找了. Yes. That's almost kill me once:<(
作者: rand1985    时间: 2012-01-09 13:25
增加一个:
       贴错标签,把清洁带误贴错成介质带,当机械手把此磁带放入驱动器中后,会把驱动器down掉!
   

      这个原因应该是很少见的!
作者: ZoneJ    时间: 2012-06-26 17:20
学习中,关注NBU!!!
作者: QaSanil    时间: 2012-06-26 22:02
标记下,明天上班看,哈哈
作者: 100心    时间: 2012-06-26 22:48
挖坟贴?
作者: ulovko    时间: 2012-07-26 09:08
很实用的帖子 感谢分享 ^_^




欢迎光临 Chinaunix (http://bbs.chinaunix.net/) Powered by Discuz! X3.2