Professional Documents
Culture Documents
Page 1 of 20
Details:
Manual: Media Manager System Administrator's Guide, Page: Appendix
Modification Type: Addition
Modification:
Please review all sections before calling VERITAS support to see if this document can help you
resolve your issue. If not, please place a support call and a NetBackup (tm) Technical Support
Engineer will respond within a timely manner.
Quick overview of acsd and acsssi processes: Unlike the tldcd design for TLD robots that requires
one robotic control host, acsd is a robotic daemon that runs on each media server with Automated
Cartridge System
(ACS) drives attached. Robotic mount and dismount requests are sent from each media server to
the Automated Cartridge System Library Software (ACSLS) server via the acsssi daemon which is
an API call. Tape drive requests to read, write, position, and rewind are done through the device
paths on each media server, not through the Automated Cartridge System Library Software
(ACSLS) server.
For a complete overview of Automated Cartridge System (ACS) and NetBackup (tm), please refer
the Automated Cartridge System (ACS) section in the Media Manager System Administrator's
Guide.
TABLE OF CONTENTS
1. LOGGING
1.1 LOG LOCATIONS
1.1.1 Event Logs
1.2 ACS PROCESS TRACING
1.2.1 ACSSSI Tracing on the VERITAS NetBackup ™ media server
1.2.2 ACSSS Tracing on the Library Server
2 ACS LIBRARY SERVER (ACSLS) FUNCTIONS AND COMMANDS
2.1 ACCESS CONTROL ON THE ACS LIBRARY SERVER (ACSLS)
2.2 ACSSA COMMANDS ON THE ACS LIBRARY SERVER
2.2.1 Log on to the ACSLS Server
2.2.2 Query the Library Management Unit
2.2.3 Query the Cartridge Access Ports
2.2.4 Query silos (Library Storage Modules)
2.2.5 Query Drives
2.2.6 Query Volumes
2.2.7 Command to Start Request Processing
2.2.8 Vary on LSM
2.2.9 Logoff from ACSSA (ACSLS Server interface)
2.3 ACSLS TAPE CLEANING
5 MEDIA
5.1 AVAILABLE MEDIA SCRIPT
5.2 HOW TO SPECIFY WHICH MEDIA ACCESS PORT (MAP) TO USE FOR TAPE
EJECTION
6 COMMUNICATION
6.1 REMOTE PROCEDURE CALL
6.1.1 How to Start RPC on Different Operating Systems
6.1.2 How to Verify that RPC is Running
6.1.3 How to Verify the ACSSS Program Registration
6.1.4 Basic snoop Output
7 COMMON ACS ERROR MESSAGES
7.1 ACS (2) UNAVAILABLE: INITIALIZATION FAILED: UNABLE TO INITIALIZE
ROBOT
7.2 ACS STATUS = 54, STATUS_IPC_FAILURE
7.3 ACS STATUS = 72, STATUS_PENDING
7.4 STATUS_NI_FAILURE
7.4.1 ACS status = 104, STATUS_NI_FAILURE
7.4.2 ACS status = 105, STATUS_NI_TIMEDOUT
1. LOGGING
This section covers ACSLS log locations and ACS process tracing.
Typical event log entries are cap operations, remote procedure call (RPC) and client initialization,
robot errors, NI failures, and drive status changes.
Tracing can be turned on or off multiple times using the same kill command.
NOTE: To read the trace.log, it is necessary to use the StorageTek trace_decode. Please contact
StorageTek for a copy and instructions for use.
b. To turn on CSI tracing, simply send a SIGUSR1 signal to the CSI process. This can be
accomplished by using the kill command as follows:
# kill -USR1 [CSI pid]
NOTE: It is necessary to first get the PID of the CSI using the ps command.
To turn off tracing, use the same method used to turn on tracing. Tracing can be toggled on/off
multiple times using the same command.
For Windows:
Windows event logging is turned on using the mini_el and the ACSSEL function shipped with the
ACS product. The packet trace is controlled using the toggle_trace script. Both tools are in the
NOTE: Do not leave tracing on indefinitely because it may fill disk space over time.
Volume control can be done through the set owner command in the cmd_proc utility or in the file
ownership.assignment. A 'STATUS_INVALID_OPTION' status will return to commands which
are rejected due to access control violations.
To disable TapeAlert checking and eliminate "TapeAlert is not supported" messages in the syslog,
add the NO_TAPEALERT touch file.
For UNIX:
/usr/openv/volmgr/database/NO_TAPEALERT
For Windows:
<install path>\volmgr\database\NO_TAPEALERT
The StorageTek library transport control unit tracks how much tape passes through each transport
and sends a message to ACSLS when a transport requires cleaning. If auto-cleaning is enabled,
ACSLS automatically mounts a cleaning cartridge on the transport. If all the cleaning cartridges
have expired (MAX_USAGE), ACSLS will post an error message 376N into the acsss_event log.
If auto-cleaning is disabled, ACSLS logs a message in the event log and displays a message at the
cmd_proc when cleaning is required.
This option is enabled or disabled using the acsss_config configuration utility. This utility will
allow you to specify how the cartridges are ordered for selection and queries.
NOTE: You cannot use the acsss_config configuration program to enable auto-cleaning for drives
attached to a SCSI connected library storage module (LSM).
For more information regarding ACSLS tape cleaning, please contact StorageTek.
NetBackup does not yet obtain drive serial numbers from the ACS robotic library control interface,
so manual configuration is required. The manual configuration cannot be avoided in a non-Shared
Storage Option (non-SSO) environment, where drives are not being shared. Using NetBackup 4.5
FP6, the user can significantly reduce the amount of manual configuration required by following
these steps in an SSO environment.
1. Run the device configuration wizard on just one of the hosts where drives in an ACS-controlled
library are attached. Let the drives be added as standalone drives.
2. Add the ACS robot definition, and update each drive to indicate its appropriate position in the
ACS robot. (Make the drive robotic, and add the ACS, LSM, Panel, and Drive information.) See
the VERITAS Media Manager System Administration Guide, Configuring Storage Devices
chapter, in the section "Co-relating Device Files to Physical Drives When Adding Drives."
3. Verify the drive paths, if this hasn't already been done in the previous step, based on the
documentation referenced above
4. Once the drive paths have been verified on one host, re-run the device configuration wizard, and
specify all hosts with ACS drives in the library to be scanned. The device configuration wizard will
add the ACS robot definition and the drives to the remaining servers automatically, with correct
device paths, assuming that the devices were successfully discovered, along with their serial
numbers.
By following the above steps, the time savings can be significant. For example, if there are 20
drives shared on 30 hosts, the above configuration steps require just 20 paths to be manually
configured, instead of 600 paths.
3.1.3 Initial configuration of StorageTek T9940A and T9940B tape drives in an ACSLS
environment
It is advised to separate the two drive types within the NetBackup Media Management device
configuration to alleviate density conflicts.
This issue surfaces because ACS treats the T9940A and T9940B drive media as identical, however,
the T9940B version writes at a higher density therefore the T9940A drive cannot read a tape
written by a T9940B drive. So, when trying to use both drives within a single library, different
densities must be used for each drive. The same issue will occur with SDLT220 and SDLT320
drives in the same ACS-based library.
Workaround:
Add the ACS robot to the NetBackup device configuration according to the steps described within
the NetBackup Media Manager Device Configuration Guide.
Configure the STK 9940A drives as type hcart and configure the STK 9940B drives as type hcart2.
Then define a NetBackup storage unit for each density, hcart and hcart2.
When an inventory of ACS robotics is done, NetBackup will receive a vendor media type back as
well as the barcode. That vendor media type is mapped to one of the NetBackup media types. The
tag "IGNORE_WRONG_MEDIA_TYPE" allows NetBackup to map a single ACS vendor media
type to multiple NetBackup media types.
Disadvantages:
1. If for any reason, media is ejected from the library, verification is required when re-injecting
media that it goes to the correct media type (hcart for 9940A, hcart2 for 9940B).
2. T9940B drives cannot be used to read the 9940A media; they have to be segregated.
Robot Selection
---------------
1) ACS 0
2) none/quit
Enter choice: 1
SCSI commands:
where:
<acs>=0-126, <lsm>=0-23, <cap>=0-2, <drive>=0-15,
<priority>=0-127
<drive_id> = [<acs>,<lsm>,<panel>,<drive>]
<drive> = d1 if drive 1, d2 if drive 2, ..., d15 if drive
15
<lwm> = scratch pool low water mark
<hwm> = scratch pool high water mark
<cap_id> = <acs>,<lsm>,<cap>
<vol_list> = <vol1>[:<vol2>:...:<vol42>]
Drive 1 information:
ID (acs,lsm,panel,drv): 0,0,0,0
drive type: DLT7000
volume ID: <none>
state: STATE_ONLINE
status: STATUS_DRIVE_AVAILABLE
Drive 2 information:
ID (acs,lsm,panel,drv): 0,0,0,1
drive type: DLT7000
volume ID: <none>
state: STATE_ONLINE
status: STATUS_DRIVE_AVAILABLE
Drive 3 information:
ID (acs,lsm,panel,drv): 0,0,0,2
drive type: 9840
volume ID: <none>
state: STATE_ONLINE
status: STATUS_DRIVE_AVAILABLE
Drive 4 information:
ID (acs,lsm,panel,drv): 0,0,0,3
drive type: 9840
volume ID: <none>
state: STATE_ONLINE
status: STATUS_DRIVE_AVAILABLE
DRIVE STATUS complete
MOUNT complete
DISMOUNT complete
4.3 HOW TO DEFINE ACSLS SCRATCH POOLS AND ADD VOLUMES USING
ROBTEST
Start the robtest utility:
On UNIX:
# /usr/openv/volmgr/bin/robtest
On Windows:
<install_path>veritas\volmgr\bin\robtest.exe
Select the ACS robot. Enter the 'define pool' command as follows:
defpool 4 0 500 1
Scratch pool 4 has been defined
qpool 4
Pool ID 4 has 0 volumes
QUERY POOL complete
000043 STATUS_SUCCESS
000044 STATUS_SUCCESS
SET SCRATCH complete
qpool 4
Pool ID 4 has 5 volumes
QUERY POOL complete
5 MEDIA
This section covers "available_media" script output and tape ejection.
Below is a sample output from the available_media report, which can be generated by running the
following command:
Windows: <install_path>\netbackup\bin\goodies\available_media
or
UNIX: /usr/openv/netbackup/bin/goodies/available_media
5.2 HOW TO SPECIFY WHICH MEDIA ACCESS PORT (MAP) TO USE FOR TAPE
EJECTION
In the media server's /usr/openv/volmgr/vm.conf file, it is possible to specify the media access
port (MAP) to use when ejecting media to a particular ACS robot. If this entry is present,
NetBackup (including the Vault extension) will eject to the specified MAP instead of the default
0,0,0 MAP.
Example: If a user wants the ACS(0) robot to eject via its 0,0,1 MAP and the ACS(1) robot to eject
via its 0,1,0 MAP, the following vm.conf entries would be necessary on the media servers that use
these robots:
MAP_ID = 0 0,0,1
MAP_ID = 1 0,1,0
6. COMMUNICATION
This section covers RPC and communication.
# rpcinfo
program version netid address service
owner
100000 4 ticots hotdog.rpc rpcbind
superuser
100000 3 ticots hotdog.rpc rpcbind
superuser
100000 4 ticotsord hotdog.rpc rpcbind
superuser
If the service is not running, rpcinfo will report: "can't contact rpcbind: RPC: rpcbind failure -
RPC: Failed ( unspecified error )"
Examine the /export/home/ACSSS/log/acsss_event.log for "RPC: Rpcbind failure." The error
message should include an IP that it is trying to communicate with. Verify it is the correct IP
address.
You should get a response from both programs, but you only need a response from the version that
you are using
(2 = UDP or 3 = TCP). The NetBackup default for this communications service is UDP.
This trace shows that the portmapper and program registration was successful. For additional
assistance with NetBackup options as they pertain to ACSLS, reference the Media Manager System
Administrator's Guide.
acs_query_server() failed
Unable to query server taco, ACS status = 54,
STATUS_IPC_FAILURE Robotic test
utility /usr/openv/volmgr/bin/acstest
returned abnormal exit status (1).
STATUS_PENDING
Cause: local network interface is down.
Example 1:
Example 2:
Media server system log :
Nov 15 11:18:06 hoehpt07 acsd[8807]: ACS(0) Response has
not been returned by Mount command sequence 4434, ACS
status = 72, STATUS_PENDING
Nov 15 11:28:06 hoehpt07 acsd[8807]: ACS(0) Response has
not been returned by Mount command sequence 4434, ACS
status = 72, STATUS_PENDING
Cause:
The errors above indicate that NetBackup is able to reach the ACSLS server with a status request,
but the ACSLS server is unable to respond.
By default, the IP address sent to the ACSLS as part of the packet STATUS request is that of the
primary hostname of the media server, i.e. the hostname given by uname -a. This error occurs
when the ACSLS cannot resolve reverse name or is configured (via routing) to only reach a
secondary interface on the media server.
If the issue is failure to do reserve name lookup, add the media server's IP to the domain name
If the ACSLS server cannot route to the media server's hostname, override the default behavior by
using "ACS_SSI_HOSTNAME = <hostname>" in the /usr/openv/volmgr/vm.conf file where the
value for <hostname> is a hostname associated with an IP address the ACSLS can reach on the
media server.
7.4 STATUS_NI_FAILURE
Explanation of message: STATUS_NI_TIMEDOUT:
The CSI (media server) has timed out waiting for a response from a client (ACSLS). The actual
"Waiting to obtain XXXXX ACS sequence YYY acknowledge response" indicates that the daemon
has not received an acknowledgment from the LibraryStation module for 30 seconds (which is a
hard-coded limit) following a sent command. "Unable to obtain" means it has given up.
The timeouts seen occur after a 30 second delay, when no acknowledgment has been received.
These timeout periods are determined by two tunable environment variables:
CSI_RETRY_TIMEOUT - The default for which is 3 seconds, not 2 as described in the Media
Management manual.
CSI_RETRY_TIMEOUT=30
CSI_RETRY_TRIES=10
or
#CSI_RETRY_TIMEOUT=30;export CSI_RETRY_TIMEOUT
#CSI_RETRY_TRIES=10;export CSI_RETRY_TRIES
a. Ensure that the media server can successfully ping the ACS server and vice-versa
b. Check that ltid has started the acsd, acsssi and assel service daemons on the VERITAS media
server by performing a bpps -a from /usr/openv/netbackup/bin or vmps
from /usr/openv/volmgr/bin/volmgr. If acsssi is not running, ensure that RPC is running
c. Run snoop between the media server and ACS. Initiate robtest and check that the query server,
sent upon robtest initialization, is responded to by the ACSLS server (reference basic snoop
below)
d. Verify the RPC communications between the media server and ACSLS host using the
rpcinfo command. The rpcinfo -t <acs_host> 300031 1 h command
checks RPC connectivity, portmapper registration and that the ACSLS program (service) is
available (reference 'How to start RPC' above).
e. Check the event logs from the media server and library server for errors (reference Logging
above)
h. This error can occur if the users.ALL.allow file on the ACSLS Library Server does not contain
an entry for the requesting media server. This file is found on the ACSLS server in the
$ACS_HOME/data/external/access_control directory and is used for granting/denying library
access. For example, entries of the users.ALL.allow consult the ACSLS administrators guide.
acs_response() failed
Unable to obtain Query Server acknowledge response, ACS
status = 104, STATUS_NI_FAILURE
Robotic test utility /usr/openv/volmgr/bin/acstest
returned abnormal exit status (1).
acsssi event.log:
02-05-04 13:31:54 SSI[0]:
ONC RPC: csi_rpccall(): status:STATUS_NI_FAILURE; failed:
clntudp_create()
RPC UDP client connection failed, RPC: Program not
registered
Remote Internet address:10.82.56.67, Port: 0;
Cause: ACSLS is down or not responding
Related Documents:
246747: NetBackup DataCenter 4.5 Media Manager System Administrator's Guide for UNIX
http://support.veritas.com/docs/246747
264249: VERITAS NetBackup (tm) 5.0 Media Manager System Administrator's Guide For UNIX
http://support.veritas.com/docs/264249
268103: VERITAS NetBackup (tm) 5.1 Media Manager System Administrators Guide for UNIX
http://support.veritas.com/docs/268103
Products Applied:
NetBackup DataCenter 4.5, 4.5 (FP3), 4.5 (FP4),
4.5 (FP5), 4.5 (FP6), 4.5 (MP1), 4.5 (MP2), 4.5
(MP3), 4.5 (MP4), 4.5 (MP5), 4.5 (MP6)
NetBackup Enterprise Server 5.0, 5.0 MP1
Languages:
English (US)
Operating Systems:
HP-UX
11.0, 11.11
Solaris
7.0 (32-bit), 8.0 (32-bit), 9.0 (32-bit)
THE INFORMATION PROVIDED IN THE SYMANTEC SOFTWARE KNOWLEDGE BASE IS PROVIDED "AS IS" WITHOUT WARRANTY
OF ANY KIND. SYMANTEC SOFTWARE DISCLAIMS ALL WARRANTIES, EITHER EXPRESS OR IMPLIED, INCLUDING THE
WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. IN NO EVENT SHALL SYMANTEC
SOFTWARE OR ITS SUPPLIERS BE LIABLE FOR ANY DAMAGES WHATSOEVER INCLUDING DIRECT, INDIRECT, INCIDENTAL,
CONSEQUENTIAL, LOSS OF BUSINESS PROFITS OR SPECIAL DAMAGES,EVEN IF SYMANTEC SOFTWARE OR ITS SUPPLIERS
HAVE BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. SOME STATES DO NOT ALLOW THE EXCLUSION OR
LIMITATION OF LIABILITY FOR CONSEQUENTIAL OR INCIDENTAL DAMAGES SO THE FOREGOING LIMITATION MAY NOT
APPLY.