Backup jobs spanning media exit with status code 50 “client process aborted” – Error nbrn status: RB deallocated orphaned resources. Master and media servers are separated by firewall.
When a application opens a connection (socket) it uses the OS keepalive option. However the default keepalive value may be larger than the firewall’s tcp_close_interval. If this happen, neither a master or media server will be able to reuse an exiting connection as i have been dropped by the firewall. This will result in various COBRA/COMMS failures.
Cisco firewall’s are by default configured to close inactive TCP sessions after 1 hour.
Set the OS to send “keep alive” at a value lower than the firewalls TCP close interval on both master and media servers.
Temporary change (Linux):
echo 1800 > /proc/sys/net/ipv4/tcp_keepalive_time
Add net.ipv4.tcp_keepalive_time=1800 to /etc/sysctl.conf and issue command “sysctl -p”
Netbackup 6.5.x 7.x
How to set keep alive on other OS:
DOCUMENTATION: COMM_FAILURE as a consequence of reusing a transport that has been inactive across a firewall