Status code 50 – nbrb status: NBRB deallocated orphaned resources

Symptom:
Backup jobs spanning media exit with status code 50 “client process aborted” – Error nbrn status: RB deallocated orphaned resources. Master and media servers are separated by firewall.

Cause:
When a application opens a connection (socket) it uses the OS keepalive option. However the default keepalive value may be larger than the firewall’s tcp_close_interval. If this happen, neither a master or media server will be able to reuse an exiting connection as i have been dropped by the firewall. This will result in various COBRA/COMMS failures.

Cisco firewall’s are by default configured to close inactive TCP sessions after 1 hour.

Resolution:
Set the OS to send “keep alive” at a value lower than the firewalls TCP close interval on both master and media servers.

Temporary change (Linux):
echo 1800 > /proc/sys/net/ipv4/tcp_keepalive_time

Permanent (Linux):
Add net.ipv4.tcp_keepalive_time=1800 to /etc/sysctl.conf and issue command “sysctl -p”

Netbackup version:
Netbackup 6.5.x 7.x

How to set keep alive on other OS:
DOCUMENTATION: COMM_FAILURE as a consequence of reusing a transport that has been inactive across a firewall

(Visited 3,434 times, 1 visits today)