If VEMKD, SD, and SSM are running on the master host when failover happens, the master candidate will take over within two minutes and start VEMKD, SD and SSM within another two minutes to complete the SSM recovery. If the SSM is running on another management host when failover occurs, the SSM recovery should be complete within two minutes.If the client is unable to reconnect to the SSM after failover, it may be caused by the setting of the client reconnection environment variables. Each of the following environment variables plays a role in the client reconnection process:
Verify that the environment variables are properly configured, as follows:
Check that PLATCOMMDRV_TCP_KEEPALIVE_TIME + (SOAM_RELOCATED_RECONNECTION_RETRY_LIMIT * SOAM_RELOCATED_RECONNECTION_RETRY_INTERVAL) + (SOAM_RECONNECTION_RETRY_LIMIT * SOAM_RECONNECTION_RETRY_INTERVAL) > failover time (for VEMKD, SD, and SSM).
Note that the failover time will be less than four minutes for an SSM running on a master host and less than two minutes for an SSM running on a management host.
If the calculated time of the environment variables is less than the failover time, the client cannot reconnect to the SSM and the client will exit after the calculated time. The client detects the connection is broken (notified by OS) after the PLATCOMMDRV_TCP_KEEPALIVE_TIME interval, which then triggers the reconnection process.