- 论坛徽章:
- 0
|
Veritas Status Codes and Messages
Status Code: 201
Message: handshaking failed with server backup restore manager
Explanation: A process on the master server encountered an error when communicating with the media server (either the master or a slave server). This error means that the master and media server processes were able to initiate communication, but encountered difficulties in completing them. This problem can occur during a backup, restore, or media list in a single or multiple server configuration.
Recommended Action:
1. Determine the activity that encountered the handshake failure by examining the NetBackup All Log Entries report for the appropriate time period. If there are slave servers, determine if:
• The handshake failure was encountered between the master and a slave server.
or
• Only the master server was involved.
2. If necessary, create the following activity log directories for the following:
• bpcd on the NetBackup media server (either master or slave).
• If the error was encountered during a backup operation, bpsched on the master server.
• If the error was encountered during a restore operation, bprd on the master server.
• If the error was encountered during a media list operation, admin in the NetBackup logs/admin directory on the master server.
3. Retry the operation and examine the resulting activity logs for information on why the error occurred.
Status Code: 202
Message: timed out connecting to server backup restore manager
Explanation: A process on the master server timed out while trying to initiate communications with the media server (either the master or a slave server). This problem can occur during a backup or restore in either a single or multiple server configuration.
Recommended Action: Determine which activity encountered the connection timeout failure by examining the All Log Entries report for the appropriate time period. If there are slave servers, determine if the timeout occurred between the master and a slave or if only the master was involved.
1. Verify that the schedule specifies the correct storage unit.
2. Execute the ping command from one host to another by using the following combinations:
• From the master server, ping the master and all slave servers by using the host names found in the storage unit configuration.
• From each of the slave servers, ping the master server by using the host name specified in the NetBackup server list. On a UNIX server, this is the first SERVER entry in the bp.conf file. On a Windows NT server, the master is designated as the Current server on the Servers tab in the NetBackup Configuration dialog box. To display this dialog box, start the Backup, Archive, and Restore interface on the server and click Configure on the Actions menu (also see "Using the Configure - NetBackup Window" on page 57).
3. Verify that the master server can communicate with bpcd on the host that has the storage unit.
After each backup, the scheduler checks the storage unit to see how many drives are available (in case the backup caused a drive to be automatically downed). If bpsched cannot communicate with bpcd, it sets the number of available drives in that storage unit to 0 and further backups to that storage unit fail.
The available drives remain at 0 until the scheduler is initialized again. Therefore, even if bpcd seems to be operating correctly now, check the bpsched and bpcd activity logs (see below) for records of an earlier failure.
4. See "Testing Slave Server and Clients" on page 18 and "Resolving Network Communication Problems" on page 21.
5. If necessary, create activity log directories for the following processes and retry the operation. Then, check the resulting activity logs on the master server:
• If the error occurred during a backup operation, check the bpsched activity logs. Also, check the bpcd activity logs.
• If the error occurred during a restore operation, check the bprd activity logs.
Status Code: 203
Message: server backup restore manager's network is unreachable
Explanation: A process on the master server could not connect to a particular host on the network when trying to initiate communication with the media server for a particular operation. This problem can occur during a backup or restore in either a single or multiple server configuration.
Recommended Action: Determine which activity encountered the network unreachable failure by examining the All Log Entries report for the appropriate time frame. If there is more than one NetBackup server (that is, one or more slave servers) determine if the network unreachable failure was encountered between the master and a slave server or if only the master server was involved. Execute the ping command from one host to another by using the following combinations:
1. From the master server, ping the master and all slave servers by using the host names in the storage unit configuration.
2. From each of the slave servers, ping the master server host by using the host name specified in the NetBackup server list. On a UNIX server, this is the first SERVER entry in the bp.conf file. On a Windows NT server, the master is designated as the Current server on the Servers tab in the NetBackup Configuration dialog box. To display this dialog box, start the Backup, Archive, and Restore interface and click Configure on the Actions menu (also see "Using the Configure - NetBackup Window" on page 57).
3. See "Testing Slave Server and Clients" on page 18 and "Resolving Network Communication Problems" on page 21.
4. If necessary, create activity log directories for the following processes and retry the operation. Then, check the resulting activity logs on the master server:
• If the error occurred during a backup, check the bpsched activity logs.
• If the error occurred during a restore, check the bprd activity logs.
Status Code: 204
Message: connection refused by server backup restore manager
Explanation: The media server refused a connection on the port number for bpcd. This error can be encountered during a backup or restore.
Recommended Action: Execute the ping command from one host to anotherby using the following combinations: Note: Also, see "Resolving Network Communication Problems" on page 21.
From the master server, ping the master and all slave servers by using the host names in the storage unit configuration.
From each of the slave servers, ping the master server by using the name specified in the NetBackup server list. On a UNIX server, this is the first SERVER entry in the bp.conf file. On a Windows NT server, the master is designated as the Current server on the Servers tab in the NetBackup Configuration dialog box. To display this dialog box, start the Backup, Archive, and Restore interface on the server and click Configure on the Actions menu (also see "Using the Configure - NetBackup Window" on page 57).
On UNIX servers, verify that the bpcd entries in /etc/services or NIS on all the servers are identical. Verify that the media server is listening on the correct port for connections to bpcd by running one of the following commands (depending on platform and operating system):
netstat -a | grep bpcd
netstat -a | grep 13782 (or the value specified during the install)
rpcinfo -p | grep 13782 (or the value specified during the install)
On UNIX servers, you may have to change the service number for bpcd in /etc/services and the NIS services map and send SIGHUP signals to the inetd processes on the clients.
/bin/ps -ef | grep inetd
kill -HUP the_inetd_pid
or
/bin/ps -aux | grep inetd
kill -HUP the_inetd_pid
Note: On a Hewlett-Packard UNIX platform, use inetd -c to send a SIGHUP to inetd.
On Windows NT servers:
Verify that the bpcd entries are correct in:
%SystemRoot%\system32\drivers\etc\services
Verify that the NetBackup Client Service Port Number and NetBackup Request Service Port Number on the Network tab in the NetBackup Configuration dialog box match the settings in the services file. To display this dialog box, start the Backup, Archive, and Restore interface and click Configure on the Actions menu (also see "Using the Configure - NetBackup Window" on page 57). The values on the Network tab are written to the services file when the NetBackup Client service starts.
Stop and restart the NetBackup services.
See "Testing Slave Server and Clients" on page 18 and "Resolving Network Communication Problems" on page 21.
If necessary, create activity log directories for the following processes and retry the operation. Then, check the resulting activity logs on the master server:
If the error occurred during a backup operation, check the bpsched activity logs.
If the error occurred during a restore operation, check the bprd activity logs.
Status Code: 205
Message: cannot connect to server backup restore manager
Explanation: A process on the master server could not connect to a process on a host on the network while trying to initiate communication with the server that has the storage unit for a particular operation. This problem can occur during a backup or restore in either a single or multiple server configuration. This can also occur when the scheduler process (bpsched) is building its list of available storage units to use during backups.
Recommended Action: Execute the ping command from one host to another by using the following combinations:
Note: Also, see "Resolving Network Communication Problems" on page 21.
From the master server, ping the master and all slave servers by using the host names in the storage unit configuration.
From each of the slave servers, ping the master server by using the name specified in the NetBackup server list. On a UNIX server, this is the first SERVER entry in the bp.conf file. On a Windows NT server, the master is designated as the Current server on the Servers tab in the NetBackup Configuration dialog box. To display this dialog box, start the Backup,Archive, and Restore interface and click Configure on the Actions menu (also see "Using the Configure - NetBackup Window" on page 57).
On a UNIX server, verify that the bpcd entry in /etc/services or NIS on all the servers are identical. Verify that the media server is listening on the correct port for connections to bpcd by running one of the following commands (depending on platform and operating system):
netstat -a | grep bpcd
netstat -a | grep 13782 (or the value specified during the install)
rpcinfo -p | grep 13782 (or the value specified during the install)
On Windows NT servers:
Verify that the bpcd entries are correct in the services file:
%SystemRoot%\system32\drivers\etc\services
Verify that the NetBackup Client Service Port Number and NetBackup
Request Service Port Number on the Network tab in the NetBackup Configuration dialog box match the settings in the services file. To display this dialog box, start the Backup, Archive, and Restore interface and click Configure on the Actions menu (also see "Using the Configure-NetBackup Window" on page 57). The values on the Network tab are written to the services file when the NetBackup Client service starts.
Stop and restart the NetBackup services.
See "Testing Slave Server and Clients" on page 18 and "Resolving Network Communication Problems" on page 21.
Create a bpcd activity log directory on the server that has the storage unit and retry the operation. Then, check for additional information in the resulting activity log.
Status Code: 206
Message: access to server backup restore manager denied
Explanation: The master server is trying to start a process on another server (or itself) and the master server does not appear in the Netbackup server list on that server. On a UNIX server, the master is the first SERVER entry in the bp.conf file. On a Windows NT server, the master is designated as the Current server on the Servers tab in the NetBackup Configuration dialog box.
To display this dialog box, start the Backup, Archive, and Restore interface and click Configure on the Actions menu (also see "Using the Configure - NetBackup Window" on page 57).
Recommended Action:
Verify that the master server appears as a server in its own server list as well as being listed on all slaves.
If you change the server list on a master server, stop and restart the NetBackup database manager and request daemons (UNIX) or the NetBackup Database Manager and NetBackup Request Manager services (Windows NT).
If necessary, create activity log directories for the following processes and retry the operation. Then, check the resulting activity logs on the master server:
If the error occurred during a backup operation, check the bpsched activity logs.
If the error occurred during a restore operation, check the bprd activity logs.
Status Code: 207
Message: error obtaining date of last backup for client
Explanation: An error occurred when the backup scheduler (bpsched) tried to obtain the date of the last backup for a particular client, class, and schedule combination.
Recommended Action:
Verify that the NetBackup database manager (bpdbm) process (NetBackup Database Manager service on Windows NT) is running.
Examine the All Log Entries report for the appropriate time frame to gather more information about the failure.
For detailed troubleshooting information, create activity log directories for bpsched and bpdbm on the master server and retry the operation. Then, check the resulting activity logs.
Status Code: 208
Message: failed reading user directed file list
Explanation: An error occurred when the backup scheduler (bpsched) attempted to read the list of files requested for a user backup or archive. This error indicates either a client-server communication problem, or a system problem on the master server where the NetBackup scheduler process (bpsched) is running.
Recommended Action: For detailed troubleshooting information, create activity log directories for bpsched and bprd on the master server and retry the operation. Then, check the resulting activity logs.
Status Code: 209
Message: error creating or getting message queue
Explanation: An error occurred when the backup scheduler (bpsched) attempted to create an internal message queue construct for interprocess communication. This error indicates a problem on the master server and is most likely due to a lack of system resources for System V interprocess communication.
Recommended Action: Create a bpsched activity log directory on the master server and retry the operation. Then, determine the type of system failure by examining the error message in the bpsched activity log.
On UNIX servers, also gather the output of the ipcs -a command to see what system resources are currently in use.
Status Code: 210
Message: error receiving information on message queue
Explanation: An error occurred when one of the backup scheduler (bpsched) processes attempted to receive a message from another bpsched process on an internal message queue construct. This error indicates a problem on the master server and is likely due to problems with or a lack of system resources for System V interprocess communication.
Recommended Action: Create a bpsched activity log directory on the master server and retry the operation. Then, determine the type of system failure by examining the error message in the bpsched activity log on the master server. On UNIX servers, also gather the output of the ipcs -a command to see what system resources are currently in use.
Status Code: 211
Message: scheduler child killed by signal
Explanation: A backup scheduler (bpsched) child process, which interacts with the backup restore manager (bpbrm) on the media server, was terminated. This can occur because of system administrator action.
Recommended Action: Create an activity log directory for bpsched on the master server and retry the operation. Then, to determine the cause of the child termination, examine the messages in the bpsched activity log.
Status Code: 212
Message: error sending information on message queue
Explanation: The backup scheduler (bpsched) encountered an error when attempting to attach to an already existing internal message queue construct for interprocess communication. This error indicates a problem on the master server and is likely due to a lack of system resources for System V interprocess communication.
Recommended Action: Create a bpsched activity log directory on the master server and retry the operation. Then, determine the type of system failure by examining the error message in the bpsched activity log. On a UNIX server, also, gather the output of the ipcs -a command to see what system resources are currently in use.
Status Code: 213
Message: no storage units available for use
Explanation: The NetBackup scheduler process (bpsched) did not find any of its storage units available for use. Either all storage units are unavailable or all storage units are configured for "on demand only" and the class and schedule does not require a specific storage unit.
Recommended Action:
Examine the Backup Status and All Log Entries report for the appropriate time period to determine the class or schedule that received the error.
Verify that the storage unit's drives are not down or waiting for media from a previous operation that did not complete.
Examine the storage unit configuration to verify that all the storage units do not have their "Concurrent Jobs" attribute set to 0.
Verify that the robot number and host name in the storage unit configuration matches the Media Manager device configuration.
Determine if all storage units are set to "On Demand Only" for a class and schedule combination that does not require a specific storage unit. If this is the case, either specify a storage unit for the class and schedule combination or turn off "On Demand Only" for a storage unit.
If the storage unit is on a UNIX NetBackup slave server, it could indicate problem with bpcd. Check /etc/inetd.conf on the slave server to verify that the bpcd entry is ok. If the storage unit is on a Windows NT NetBackup slave server, verify that the NetBackup Client service has been started on the Windows NT NetBackup slave server.
For detailed troubleshooting information, create a bpsched activity log directory on the master server and retry the operation. Then, check the resulting activity log.
Status Code: 214
Message: regular bpsched is already running
Explanation: The NetBackup scheduler (bpsched) performs periodic checking of the class and schedule configuration to determine if there are new backups due. Error 214 indicates that when a new instance of NetBackup starts, it finds that a scheduler process is already checking the class and schedule configuration.
Recommended Action: Usually, no action is required for this condition. However, NEVER kill bpsched before doing some checking. For example, bpsched could be calling bpdbm (NetBackup Database Manager service on Windows NT) to clean up and compress the databases. To determine what the running bpsched is currently doing, examine the bpsched activity log on the master server. If necessary, enable bpsched activity logging by creating a bpsched activity log directory on the master server and retrying the operation. To check for backups do the following:
On a UNIX master server:
Check for active or queued backups by using the job monitor.
Check for active bp processes with bpps. This reveals if there are bpbrm or bptm processes running and a backup is active.
If there is no reason for bpsched to be running, then use kill -HUP to terminate it.
On a Windows NT NetBackup master server:
Status Code: 215
Message: failed reading global config database information
Explanation: During the periodic checking of the NetBackup configuration, the NetBackup scheduler process (bpsched) was unable to read the global configuration parameters.
Recommended Action:
On a UNIX master server, verify that the NetBackup database manager (bpdbm) process is running. On a Windows NT master server, verify that the NetBackup Database Manager service is running.
Attempt to view the global configuration settings by using the the NetBackup administration interface.
For detailed troubleshooting information, create activity log directories for bpsched and bpdbm on the master server and retry the operation. Then, check the resulting activity logs.
Status Code: 216
Message: failed reading retention database information
Explanation: During its periodic checking of the NetBackup configuration, the NetBackup scheduler process (bpsched) could not read the list of retention levels and values.
Recommended Action:
On a UNIX master server, verify that the NetBackup database manager (bpdbm) process is running. On a Windows NT master server, verify that the NetBackup Database Manager service is running.
For detailed troubleshooting information, create activity log directories for bpsched and bpdbm on the master server and retry the operation. Then, check the resulting activity logs.
Status Code: 217
Message: failed reading storage unit database information
Explanation: During its periodic checking of the NetBackup configuration, the NetBackup scheduler process (bpsched) could not read the storage unit configuration.
Recommended Action:
On a UNIX server, verify that the NetBackup database manager (bpdbm) process is running. On a Windows NT server, verify that the NetBackup Database Manager service is running.
Attempt to view the storage unit configuration by using the NetBackup administration interface.
For detailed troubleshooting information, create activity logs for bpsched and bpdbm on the master server and retry the operation. Then, check the resulting activity logs. Ensure that the correct master server is being specified for the connection.
Status Code: 218
Message: failed reading class database information
Explanation: During the periodic checking of the NetBackup configuration, the NetBackup scheduler process (bpsched) could not read the class (backup policy) configuration.
Recommended Action:
On a UNIX server, verify that the NetBackup database manager (bpdbm) process is running. On a Windows NT server, verify that the NetBackup Database Manager service is running.
Attempt to view the class configuration by using the NetBackup administration interface.
For detailed troubleshooting information, create activity log directories for bpsched and bpdbm on the master server and retry the operation. Then, check the resulting activity logs. Ensure that the correct master server is being specified for the connection.
Status Code: 219
Message: the required storage unit is unavailable
Explanation: The class or schedule for the backup requires a specific storage unit, which is currently unavailable. This error also occurs for other attempts to use the storage unit within the current backup session.
Recommended Action: Examine the Backup Status and All Log Entries report for the appropriate time period to determine the class or schedule that received the error. Then, examine the specific class and schedule configuration to determine the required storage unit.
Verify that the schedule specifies the correct storage unit and the storage unit exists.
Verify that the Media Manager device daemon (ltid) is running (if the server is UNIX) or the NetBackup Device Manager service is running (if the server is a Windows NT system). Use bpps on UNIX and the Activity Monitor on Windows NT.
Verify that the Number of Drives attribute for the storage unit is not set to 0.
If the storage unit is a tape or optical disk, verify that at least one of the drives is in the UP state. Use the Device Monitor (on UNIX xdevadm can also be used).
Verify that the robot number and host in the storage unit configuration matches what is specified in the Media Manager device configuration.
Verify that the master server can communicate with the bpcd process on the server that has the storage unit.
Verify that bpcd is listening on the port for connections.
On a UNIX server, executing
netstat -a | grep bpcd
should return something similar to the following:
*.bpcd *.* 0 0 0 0 LISTEN
Do this on the server where the storage unit is connected.
On a Windows NT NetBackup server, executing
netstat -a
prints out several lines of output. If bpcd is listening, one of those lines is similar to the following:
TCP myhost:bpcd 0.0.0.0:0 LISTENING
Do this on the server where the storage unit is connected.
If bpcd seems to be operating correctly, create bpsched and bpcd activity log directories and retry the operation. Check the resulting activity logs for records of an earlier failure.
After each backup, the scheduler checks the storage unit to see how many drives are available (in case the backup caused a drive to be automatically downed). If bpsched cannot communicate with bpcd, it sets the number of available drives in that storage unit to 0 and further backups to that storage unit during this backup session will fail. The number of available drives remains at 0 until the scheduler is initialized again.
c. If the cause of the problem is not obvious, perform some of the steps in "Resolving Network Communication Problems" on page 21.
Status Code: 220
Message: database system error
Explanation: The bpdbm process (NetBackup Database Manager service on Windows NT) could not create a directory path for its configuration databases due to the failure of a system call. This is usually due to a permission problem or an "out of space" condition.
Recommended Action: Create an activity log directory for bpdbm and retry the operation. Check the resulting activity log for information. |
|