Saturday, April 27, 2019

Commvault frequently asked interview questions - With best described answers

If you're looking for CommVault Interview Questions & Answers for Experienced or Freshers, you are at right place. There are lot of opportunities from many reputed companies in the world. According to research CommVault has a market share of about 4.3%. So, You still have opportunity to move ahead in your career in CommVault Administration. Mindmajix offers Advanced CommVault Interview Questions 2019 that helps you in cracking your interview & acquire dream career as CommVault Administrator.

Q: What is Commvault?
Commvault is a data management platform where your data can be protected using its software products. The software is capable of taking up backups and also has a recovery management process
Q: What are the features of Commvault?
1. The commvault software is entitled to deliver access, protection and sharing of the data in a reliable and in a secure connection.
2. It provides enhanced productivity which has secure file sync system
3. It reduces the risk of data loss with the help of backup software
4. Using this software, it can reduce the cost of protecting remote and mobile data
5. One can access the data from anywhere
Q: Let us know what are the other different ways of doing High availability?
We can do the following:
1. Clustering SQL level
2. Log shipping
3. SQL mirroring
Q: Explain what is a storage policy?
The storage policy usually depends upon the client backup and storage plans. It defines what amount of data needs to be stored and what type of information can be stored. Further, the data stored is in a secured environment or not.  The following are the three different types of copies that are available within the storage policy for commvault:
1. Primary snap copy
2. Primary disk copy
3. Offsite disk/tape copy
Q: What are the types of copies that are available and explain them in detail?
We have only on primary copy and to that, we will be having different types of auxiliary/ secondary types of copies. The list of copies that are available follows:
1. Selective Copy
2. Inline Copy
3. Synchronous Copy
1. Selective Copy:
A selective copy is a process which will allow an individual to copy a specific set backups from a source copy. The source can be one of the following, asynchronous copy or a primary copy.  A selective copy works better for tape rotation.
Inline Copy:
Within this copy phase, both the primary and secondary copies are written to a storage so that it will be available during the backup phase. If the secondary storage doesn’t have the required space then the backup job will fail.
Synchronous copy:
Within synchronous copy process, all the data of the primary is copied over.
Q: Explain the process of moving the DDB to the new media agent?
The following is the process of moving the DDB to the new media agent:
1. Right-click on the mount path option
2. Click on share mount path option
3. Point it towards the new Media agent
4. Set the default mount path
5. Tape library configures and change the data path.
Q: What is Intei snap? Explain the process?
The inteli snap is a process where it takes a snapshot and then it resumes all the live operations.
Intellisnap backup works in line with the storage arrays so that it can provide a snapshot functionality for the backups.
Q: Explain the main use of proxy server?
The use of the proxy server is mainly to reduce the load on the production server. The backup operations generally use the proxy servers to move the snapshot of the information to backup media.
1. The storage array is generally accessed to create a snapshot of the information
2. The snapshot that we can capture is used to mount of the ESX servers which are designated for the snap mount.
3. The snapshot is then unmounted for further utilization.
Q: What happened in Backup copy operations in the proxy server?
During the backup copy operations the following activities are executed:
1. The snapshot of the information is mounted to the ESX server which is solely designated for the snap mount.
2. Once the snap is mounted then it is treated as a file system and the required contents are processed and read
3. The backup of the file system is performed to the Primary copy of the storage policy.
4. Once the backup copy job is finished, then the snapshot will be unmounted.
Q: What are the common/regular backup failures that one experiences most of the time?
Some of the common or more frequent backup issues that are reported as follows:
1.  Network communication issues
2. Client services might be shut down for maintenance
3. Media agent offline
4. Destination disk library full
5. Data path offline
6. VSS snapshot backup failure
Q: How can one use only a specific set of tapes for a server backup or for a list of server backups?
You can create a new scratch pool and all the required media can be moved to the scratch pool. A sub-client is attached to a scratch pool which will be used. This process can also be done at two levels they are:
1. Storage policy level
2. Copy policy level
Q: What is Silo ( de-duplication to Tape)?
First of all the media is de-duplicated onto the disk which is configured on global-dedup storage policy. Once this process is completed when the data is de-duplicated to the tape. Thus the name came up as de-duplication.
Q: Explain what is a hash algorithm?
The hash algorithm is an algorithm which is applied to a specific set of data where the data is read from the client machine and a unique signature is generated which is used to execute the deduplication process.
Q: What is the default block size set for de-duplication which are enabled for storage policies?
The default block size for deduplication which is enabled for storage policies is set to 128KB
Q: What percentage of the deduplication ratio that you were able to maintain so far?
This can be a generalized answer and the ratio that you were able to maintain is as follows:
1. About 90% or more when it comes to file systems backups
2. Between 70% to 95% when it comes to the database backups
Q: Please explain how does a VMware backup works?
The VMware backups are processed in the following manner:
1. Usually, VMware snapshot backups are performed using the proxy servers.
2. Within the proxy server, we will be installing a software called Virtual server agent, i.e. (VSA)
3. So commvault actually communicated via Vcenter.
4. Where a snapshot of VM is performed and it is mounted in proxy server only.
5. From the proxy server, the snapshot image is backed up with the backup process implementation and it will be fed to the media agent.
Q: Explain the process of configuring a cluster backup?
The following process will help an individual to configure a cluster backup:
>> First of all create a cluster group
>> Add all the client machines into it so that a synch action will be performed
>> It prompts for installation on the passive node for windows system.
>> Complete the installation
>> Then schedule a backup process for the same.
Q: What are the different types of transport modes are available in Vmware backup?
The following are different types of transport modes that are available in Vmware backup:
1. Transport Mode SAN
2. HotAdd
3. Local area network
Q: Explain what is Transport mode SAN in detail?
Within this mode:
** SAN model is actually directly connected to a storage system and it is supported.
** The virtual server agent will come into the picture which will have access to the storage system
** Once the data is read from the storage it will remain in Vmware's.
** The ESX host is only connected or coordinates the access to the LUN.
Q: Explain what is the Hotadd mode in detail?
Within this mode:
>> The Virtual server agent is installed on the VM which is actually residing on the actual ESX server.
>> The data is automatically mounted onto a proxy
>> The data that is mounted on the proxy will be accessed as a local disk later on in the process.

Q: Explain what is Local area network mode in detail?
1. NBD stands for Network block device
2. NBDSSL stands for Encrypted Network block device.
The data is transmitted or passed via a secure TCP/IP connection between the ESX server and the proxy computer. This is the process where the data is passed on within NBD mode.
The above process is same and the data is transmitted via a secure TCP/IP connection between the ESX server and the proxy computer but the data will be encrypted for NBDSSL.
Q: List out all the day-to-day activities of a backup admin?
First thing first, the backup admin will have to monitor whether all the scheduled jobs are running as per they are designated. If not then they have to go through the jobs and understand the fault and rectify the same.
Further, they have to involve themselves with some of the health checkups:
** Have to check for Media agent so that they are not in offline mode
** Have to make sure and regularly check the free space or the remaining space that is available in the disk libraries
** Extract a backup report which is failed and analyze the same so that the rectification process can be continued.
** Check regularly for deployment, decommissions, restores requests within the reporting tool.
Q: Please explain about the infrastructure that you have worked on previously?
This should be an honest reply back to the interviewer because this is a real-time question which can be answered based on the experience available.
I worked in an infrastructure where we had
1 production comm server, 10-20 pair of media agents and have backed up around 200 servers which includes a regular file system.
Q: What is the main product of Commvault?
They are into data protection and information management service businesses. Using their products most of the organizations are helped out with their data and backup and recovery processes with the help of cloud infrastructure and management.
Q: Is commvault software is an enterprise level software?
Yes, it is an enterprise level software.
Using Commvault software is simply because it is built up from scratch and has different modules which can be used independently based on the need.
Q: What does IRM stand for in commvault?
IRM stands for IntelliSnap recovery manager.
It describes all the recovery management concepts within the best practices.
Q: What is the main difference between a clone and a snapshot?
The clone is like a copy of everything that is available within the database. When it comes to Snapshot it is more sort of versioning. So the backups that we take has a certain timestamp. It is similar to that of our restore points in our windows systems. So when the system crashes we have a restore point to start with. The same way snapshots work and this is more catering towards the data availability from a certain point-in-time.
Q: Brief few bullet points on Intellisnap technology?
Intellisnap technology:

1. This technology is developed as lightweight
2. It is a standalone solution which is used for automating snapshot management
3. It is an application-aware where multiple storage arrays can be recovered.
4. It speeds up the recovery management system
5. It helps in maintaining and archiving database snapshots
Q: Where can we access reports in commvault?
All the reports can be accessed within web console or on the clouds server sites and also available in the commcell console.
Q: Explain what is a Commvault appliance?
The commvault appliance is nothing but an end to end solution which helps in extending the data protection aspects and also helps in the storage capabilities using simpana platform.  The commvault appliance provides following benefits, they are:
1. Provides a simple configuration system
2. Provides a simple management system
3. It helps you to meet your enterprise-level data protection requirements

Please add this page in your bookmarks as we keep update this page with more frequently asked questions.

Thursday, March 28, 2019

Enabling Automatic Backups of the Oracle Control File at the Subclient Level


This process applies to both: Oracle and Oracle RAC

Use this additional setting to override the automatic backups of control files on the instance level.

When you enable this additional setting and perform a backup on an Oracle subclient, the Commvault software ignores the control file automatic backup option that is set for the instance. If the subclient that you back up contains the control file, then  AUTO BACKUP CONTROLFILE ON is added to the backup script.

Procedure

  1. From the CommCell Browser, expand Client Computers.
  2. Right-click the client, and click Properties.
  3. On the Properties dialog box, click the Advanced button.
  4. On the Additional Settings tab of the Advanced Properties dialog box click Add.
  5. In the Add Additional Setting dialog box:
    1. In the Name box, type sCTRLFILE_AUTOBACKUP_ATSUBCLIENT.
    2. In the Category box, type OracleAgent.
    3. In the Type box, select String.
    4. In the Value box, enter Y.
    5. Click OK to close the Add Additional Setting dialog box.
  6. Click OK to close the Advanced Properties dialog box.

Create a Subclient - CommVault Step by Step tutorial

Use this procedure to create a subclient named "Test_Subclient" to perform your first backup.  
Procedure
  1. From the CommCell Browser, expand Client Computers > client> Agent.
  2. Right-click the defaultBackupSet, point to All Tasks and then click New Subclient.
    The Create New Subclient dialog box appears.
  3. The default subclient is automatically created when you install the software.
    The content of the default subclient includes all of the data in the computer.
    When you create a user-defined subclient, the contents of the user-defined subclient are automatically removed from the default subclient content.
  4. On the General tab, in the Subclientname box, type a name for the subclient.
    Note: In this example, we will use Test_Subclient.
  5. On the Content tab, click Browse.
  6. Navigate through the list of folders to select the files and folders that you want to include in the backup.
    In this example, we will include the My Documents folder as the subclient content.
    Select My Documents and then click Add.
    Click Close to close the Browsing content for dialog box.
  7. To create a storage policy, click the Storage Device tab and then click Create Storage Policy.
  8. In the What will this storage policy be used for? dialog box, click Data Protection and Archiving and then click Next.
  9. In the Enter the storage policy name dialog box, type the name of the Storage Policy in the Storage Policy Name box.
    In this example, we will use Test_Storage_Policy.
    Click Next.
  10. In the Select a default library for this primary copy dialog box, select a disk or tape library from the list.
    Note: The disk or tape library was configured previously. See Commvault Software Quick Start Guide - Configure a Storage Device.
    Click Next.
  11. In the Select a MediaAgent for this copy dialog box, select a MediaAgent from the list.
    Click Next.
  12. In the Enter the streams and retention criteria for this policy dialog box, click Next.
  13. Select the Software Encryption option to enable encryption on copies including the primary copy.
    Click Next.
  14. In the Do you want to enable Deduplication for the primary copy dialog box, select Yes and click Next.
  15. In the Specify the location to store the Deduplication Store dialog box, select a MediaAgent from the MediaAgent Name list.
    In the Location box, type the path name where the deduplication database must be located or click Browse button to select the folder.
    Click Next.
  16. In the Review your selections dialog box, confirm the details and then click Finish.
  17. In the Subclient Properties dialog box, click OK.
  18. In the Backup Schedule dialog box, select Do Not Schedule and then click OK.
  19. The new subclient appears under the defaultBackupSet in the CommCell Browser.

Hope this helps to create a subclient for a server backup in detailed, please visit other Commvault tutorial from the top menu bar,

Source: me

Wednesday, February 27, 2019

NetBackup status code: 54 - Error Code details and Solution (Recommended Actions to resolve)



NetBackup status code: 54

Message: timed out connecting to client

Explanation: The server did not complete the connection to the client. The accept system or winsock call timed out after 60 seconds.

Some third-party software packages (for example, a personal firewall product) can affect the TCP/IP stack in Windows. This action can cause a loss of connection between the NetBackup server and the bpcd process on the client. NetBackup tries to set SO_REUSEADDR (allow local address reuse) on the inbound socket connection so that the port can be handed off from bpinetd.exe (the NetBackup Client Service) to bpcd.exe. Some products may not allow this functionality due to the various methods that can be used to violate system security.

Recommended Action: Do the following, as appropriate:

  • For a Macintosh or NetWare target client: Verify that the server does not try to connect when a backup or restore is already in progress on the client. These clients can handle only one NetBackup job at a time.

    On a Macintosh, check for activity by examining the NetBackupListen file in the following folder on the startup disk of the Macintosh client:

    :System Folder:Preferences:NetBackup:logs:inetd:log.mmddyy  
  • Perform the following procedure:

    See "Resolving network communication problems" in the Troubleshooting Guide.

  • On UNIX and Linux clients, verify that the /usr/openv/netbackup/bin/bpcd binary exists and that it is the correct size.

  • Check the /etc/inetd.conf file to make sure the bpcd path is correct in the following entry:

    bpcd stream tcp nowait root /usr/openv/netbackup/bin/bpcd bpcd  
  • On the systems that include the following, make sure that the client name is in the master's /etc/hosts file: NetBackup master, media, and clients (with NetBackup database extension products installed on one or more clients).

  • Completely uninstall the third-party software package on the client that causes the failure. Or, contact the software manufacturer to investigate if other configuration options or workarounds are possible.


    Netbackup (NBU) Important Error Codes with solution - NBU Status Codes

    Following are the most required and must known NBU status code which need to know by every NetBackup administrator. 



    1. Status code 2
    Reason: None of the file backed up
    Action taken: no files in target path

    2. Status code 13
    Reason: File read failed
    Action taken: network connectivity

    3. Status code 25
    Reason: Cannot connect to socket
    Action taken: bpcd daemon want to check .

    4. Status code 50
    Reason: Client process aborted
    Action taken: restart the backup manually wants to check any errors

    5. Status Code – 59
    Reason: Access to the client was not allowed
    Action Taken: want to check the bp.conf / master-client access connectivity

    6. Status code 71
    Reason: Backup taking path changed
    Action taken: path should be correct

    7. Status Code – 84
    Reason: Reduce the backup failure due to I/O error.
    Action Taken: Clean the media mounts and to change the tape
    default parameters to reduce backup failures due to I/O.

    8. Status code – 96
    Reason: Backup failure due to unavailable at scratch pool
    Action taken: Volume pool has been allocated to scratch pool

    9. Status code – 129
    Reason: Disk storage unit is full
    Action taken: remove old images

    10. Status code – 196
    Reason: Client backup was not attempted becoz backup window
    close/elapsed time.
    Action taken: manually restart the backup, if exceeds changes
    the timing frequency and backup window frequency.

    11. Status code – 2001
    Reason: Tape library down error/Robotic path changed
    Action taken: Manually bring up the robot.



    Tuesday, February 26, 2019

    NetBackup 7.x Backup Process Flow: Step 1-17 process explained in detail

    NetBackup 7.x Backup Process Flow





    1. When a PolicyClient task has its timer expire (indicating that it is due to run) an internal job task is created within nbpem that sends a Job Start to nbjm for the job which is due. nbpem provides to nbjm the parameters indicated in the backup policy and schedule that is generating the job.


    2. nbjm adds the job to its job list or queue. It then communicates with bpjobd to inform it of the job, at which time the job becomes visible in the Activity Monitor in the queued state until resources are allocated for it.






    3. nbjm sends a resource allocation request to the Resource Broker, nbrb, indicating the resources which are required for the backup operation and any resource consumption constraints for the job, including max jobs per policy, max jobs per client, and max jobs this client. These resource consumption constraints were provided to nbjm by nbpem when the job was initiated.


    4. nbrb requests resources from the EMM service, nbemm, including storage unit, storage unit group, media, and devices or drives.


    5. When physical resources are available, nbemm will allocate them and respond to nbrb, which in turn responds to nbjm. With resources allocated for the job, nbjm will notify bpjobd and the job moves to the active state.


    6. nbjm is responsible for creating the files in the Images database that will house the backup information, the Header file and the Files (.f) file. nbjm initiates this activity by communicating with bpdbm (via nbproxy).


    7. nbjm uses bpcompatd to communicate via PBX to start bpbrm on the media server that will write the backup image. The media server is selected based upon the destination storage unit that is selected.






    8. bpbrm on the media server starts bpbkar (the client's backup and archive process) on the client system. bpbrm also starts bptm on the media server.


    9. bptm initiates a connection with nbjm in order to get media and drive information for the backup job, which nbjm returns through a separate connection it initiates.


    10. bptm will then initiate the mount of the media (tape) specified on the drive specified, or the mount of the disk specified. It will also spawns a bptm child process to receive the image from the client. The details of the Media Manager daemons (ltid, txxd, txxcd, and avrd) involved in the mounting of the media on the drive are not shown in this illustration, to reduce the complexity of the illustration.


    11. bpbkar sends information about the backup image to bpbrm which forwards it to bpdbm on the master server. This stream of metadata is sent throughout the backup and stored in the master server's Image database.


    12. When mounting and positioning of the media in the drive, or of the disk to be used, have been accomplished, the client backup process, bpbkar, will begin sending backup data to the bptm child process on the media server system. The bptm child process receives the image and stores it block by block into a shared memory segment on the media server. The parent bptm process retrieves the image from shared memory and directs it block by block to the allocated storage media.







    13. When the backup has been completed bptm will notify bpbrm, which in turn will notify the Job Manager nbjm that the job has finished bptm will also notify nbjm that it is done with the media.

    14. While the client and media server processes invoked to perform the backup operation (bpbrm, bptm, and bpbkar) are terminated, nbjm will update the status for the job by communicating with bpjobd. The job will be changed to Completed status, and the ending status of the job will be recorded.

    15. nbjm communicates with bpdbm (using nbproxy) to complete the writing and verification of the files for the backup image in the Images database.

    16. With the backup job completed nbjm will de-allocate the resources used for the backup by communicating with nbrb.

    17. nbjm will notify nbpem that the job has been completed The completion status will be included in this notification. The PolicyClient task that created the job is responsible for requesting a re-try operation for this job on failure, or for computing the new 'due time' for this job on success.

    Available Backup Types in Netbackup (Types of Backup)

    1. Full Backup
    Is the starting point of all the backups, contains all the data in the folders and files that are selected to be backed up. 

    Advantages: Restore is fast
    Disadvantages: Backup is slow and consumes more space.

    2. Differential Incremental
    Takes the backup of the files changed since the latest backup.  Archive bit is reset. Backup is taken based on the time stamp.

    Monday, February 25, 2019

    Netbackup Important Error Codes & its Solutions

    Based on my experience in the daily issues level I have shorted few must know backup failure issues status codes which I have mentioned here.

    The following are the Veritas Netbackup important error codes and its solutions.

    1. NBU Status code: 2
    Reason: None of the file backed up

    Error bpsched(pid=XXXXX) backup of client SQLHOST exited with status 2 (none of the requested files were backed up)

    Action taken: no files in target path. 

    Enable the dbclient logfile on the SQL server. 

    Veritas NetBackup (tm) SQL Agent is notconfigured to use Windows NT Authentication. 

    Incase of SQL agent issue: Configure VeritasNetBackup (tm) SQL Agent to use Windows NT Authentication.

    2. NBU Status code 13
    Reason: File read failed

    Corresponding example from the UNIX /usr/openv/netbackup/logs/bpbrm/log.<date>file:
    <16> bpbrm readline: socket read failed, An existing connection was forcibly closed by the remote host. (10054)
    <2> inform_client_of_status: INF - Server status = 13
    <2> put_long: (11) network write() error: An existing connection was forcibly closed by the remote host. ; socket = 496
    <16> inform_client_of_status: could not send server status message

    A Status 13 will occur due to network issues on the master or client.  This error indicates a read operation of a file or socket failed.  This can also occur for Flash Backup or Advanced Client backups.

    Backups fail with Status Code 13 "file read failed", indicating that a read of a file or socket has failed.  Winsock errors 10054 and 10053 may also be seen in the bpbkar log on the client. 
    Action taken: network connectivity, check the below basic troubleshooting steps.

    • Ensure that the latest service packs for all products and components (SQL, Exchange, Notes, etc.) have been installed.
    • Ensure that all the network settings throughout the environment (NICs, hubs, switches, routers, etc.) are set to full duplex, not half duplex
    • Increase the timeout settings on the NIC, if available.
    • Try a different NIC, if available.
    • If NIC teaming is implemented, deactivate for testing purposes.


    3. Status code 25
    Reason: Cannot connect to socket

    The master server is getting a status code 25 (cannot connect on socket) error when attempting to bring up the client host properties using the GUI or remote admin console. 

    Action taken: bpcd daemon want to check.

    When troubleshooting status 25 errors on a NetBackup client, verify that the client was working prior to the issue. If it had been working try to determine what changes may have been made to the client server's OS or the network links

    If there is no major changes, do the following basic troubleshooting: 

    1. To test the master/media server resolution of the client server hostname run the following command:
      • <install path>/netbackup/bin/bpclntcmd -hn <client hostname>
    2. Since reverse lookups is part of the NBU server to client connections make sure the client can also be resolved by its IP address:
      • <install path>/netbackup/bin/bpclntcmd -ip <client IP address>
    3. On the client test the resolution of the NBU servers by issuing the same commands. These commands should be run against the master and all of the media servers that may be trying to backup the client server:
      • <install path>/netbackup/bin/bpclntcmd -hn <NBU server hostname>
      • <install path>/netbackup/bin/bpclntcmd -ip <NBU server IP address>
    4. Verify you are able to "ping" the client's IP address from the NBU server. If this fails consult with your Network Administrator and client server System Administrator to resolve the layer 3 or IP network connectivity.
    5. Double check the server's NIC's IP address and netmask to ensure they are configured correctly.

    4. Status code 50
    Reason: Client process aborted

    The NetBackup Policy Execution Manager (NBPEM) ran out of memory and crashed causing all active jobs to fail with Status Code 50 - client process aborted  
    Action taken: restart the backup manually wants to check any errors

    1. Recycling the NetBackup services will temporarily restore functionality to the master server until the NBPEM process reaches the imposed 4GB memory limit. 

    2. The solution is to set to maxdsiz_64bit 8GB in bp.conf .

    5. Status Code – 59
    Reason: Access to the client was not allowed


    Action Taken: want to check the bp.conf / master-client access connectivity

    A status code of 59 commonly occurs when the client does not have the NetBackup master or media server properly defined in the / usr/openv/netbackup/bp.conf file. There are a number of well-documented and effective ways to troubleshoot this problem, (e.g., creating a bpcd log on the client and then re-attempting the backup) but the UNIX last command is a quick and simple way to clearly establish the cause of this problem. 

    Steps to follow to fix the issue:

    1. If this is not a name resolution problem, add a "SERVER = BUServer" entry to bp.conf on NBUClient 

    OR 

    2. If this is a name resolution problem, correct the name resolution configuration ( /etc/hosts file, DNS or NIS maps) on NBUClient so that the above scenario would show NBUServer being returned in the output of the last command 

     3. In addition If the NetBackup Client being backed up is virtual (vmware).  The policy must be set as "Vmware policy type" if NetBackup is accessing the vsphere server to backup the server.  The other option is the NetBackup client needs to be installed on the virtual server being backed up.

    6. Status code 71
    Reason: Backup taking path changed

    A Status 71 error "none of the filesin the file list exist" occurs.  However, it is known for certain that thebackup selections specified in the policy exist on the clients inquestion. 

    Action taken: path should be correct

    Then proceed with the following: 

    1. Expand Policies in the left pane 

    2. Double click the name of the policy that has failed with Status 71 

    3. In the Change Policy window, click the Files or Backup Selections tab (the tab name varies depending upon version) 

    4. Highlight an entry in the file list and click Rename. Then highlight the entire entry with the mouse.  See if there is a space at the end of the listing.   shows a file list entry that has a trailing space. 

    Remove the trailing space if one is present. Check all other file listentries as needed.  Click OK in the Change Policy window and run thebackup again 

    7. Status Code – 84
    Reason: Reduce the backup failure due to I/O error.

    Backup jobs fail with a NetBackup Status Code 84 (media write error) and the system's device driver returns an I/O error while NetBackup is writing to removable media or a disk file.


    Action Taken: Clean the media mounts and to change the tape

    default parameters to reduce backup failures due to I/O.

    Turn logging up to Verbose = 5 for the bptm process on the problematic media server.  Capture the problemat the higher logging level and examine the resulting log file inside the <install_path>\netbackup\logs\bptm log folder. 

    Additionally,examine the Application Event Log for NetBackupErrors. 

    Some of errors can be caused by a faulty SCSI card.  Replace thefaulty SCSI card. 

    8. Status code – 96
    Reason: Backup failure due to unavailable at scratch pool

    Similar Error log: invalid volume pool (90)unable to allocate new media for backup, storage unit has none available (96)

    Action taken: Volume pool has been allocated to scratch pool

    When duplicating tapes, verify the destination volume pool name is not defined as the same volume pool name which is configured as the scratch volume pool. The -dp option for the bpduplicate command defines the destination volume pool name.

    9. Status code – 129
    Reason: Disk storage unit is full

    Backups to disk storage units fail with a VERITAS NetBackup (tm) Status 129 because the storage unit is full.

    Action taken: remove old images

    There are several methods that can be used to reclaim disk space on a disk storage unit.  Options such as, but not limited to, expiring older images, using an alternate storage unit, changing the retention level used and adding more disk space to the disk storage unit. 

    Expire older images from the disk storage unit to reclaim disk space on the file system. 


    10. Status code – 196
    Reason: Client backup was not attempted becoz backup window
    close/elapsed time.

    Action taken: manually restart the backup, if exceeds changes
    the timing frequency and backup window frequency.

    client backup was not attempted because backup window closed 

    This inability to allocate drive can cause the backup window to be closed. This can lead to an EMM server going down and getting various EMM errors including: 

    Unable to obtain the server list from the Enterprise Media Manager server. Database Server is down (93) 

    Verify there are no ACTIVE jobs. Deactivate and cancel all jobs. 

    Steps: 1. Stop all NetBackup services on the master server: 

    Windows: <install path>\Netbackup\bin\bpdown -v -f 

    UNIX: /usr/openv/netbackup/bin/goodies/netbackup stop 

    2. Check for active processes using command - <install path>\netbackup\bin\bpps –a 

    3. Clear / kill them if you find any 

    4. Restart all NetBackup services on the master server: 

    Windows: <install path>\netbackup\bin\bpup -v -f 

    UNIX: /usr/openv/netbackup/bin/goodies/netbackup start 

    5. To see allocations (likely due to the EMM connection error): 

    <install path>\NetBackup\bin\admincmd\nbrbutil -dump 

    6. To clear the allocations run the following: 

    <install path>\NetBackup\bin\admincmd\nbrbutil -resetall 

    7. Run a regular backup and confirm that it is writing to tape. 

    8.  Reactivate the jobs that were previously deactivated and confirm that they also now are running. 

    11. Status code – 2001
    Reason: Tape library down error/Robotic path changed
    Action taken: Manually bring up the robot.

    Only command vmoprcmd without parameters tells the actual status of drives. Any attempt of bringing of downed drives up with vmoprcmd ended with message: The drive is not ready or inoperable.

    The solution is to deleting of downed drives with tpconfig, then tpautoconf -a and restarting of NetBackup.