You might need to troubleshoot VMware Fault Tolerance by determining the reason for unexpected virtual machine failovers. This type of failover is when your Primary or Secondary VM has failed over and redundancy is reestablished, even though its ESX/ESXi host has not crashed. In such cases, virtual machine execution is not interrupted, but redundancy is temporarily lost.

This problem can arise when access to storage is slow or completely down for one of the hosts. When this occurs there are many storage errors listed in the VMkernel log. To resolve this problem you must address your storage-related issues.

If the logging NIC is not functioning or connections to other hosts through that NIC are down, this can trigger a fault tolerant virtual machine to be failed over so that redundancy can be reestablished. To avoid this problem, dedicate a separate NIC each for VMotion and FT logging traffic and perform VMotion migrations only when the virtual machines are less active.

This can happen because of too many fault tolerant virtual machines being on a host. To resolve this problem, more broadly distribute pairs of fault tolerant virtual machines across different hosts.

If the VMotion migration of a fault tolerant virtual machine fails, the virtual machine might need to be failed over. Usually, this occurs when the virtual machine is too active for the migration to be completed with only minimal disruption to the activity. To avoid this problem, perform VMotion migrations only when the virtual machines are less active.

When a number of file system locking operations, virtual machine power ons, power offs, or VMotion migrations occur on a single VMFS volume, this can trigger fault tolerant virtual machines to be failed over. A symptom that this might be occurring is receiving many warnings about SCSI reservations in the VMkernel log. To resolve this problem, reduce the number of file system operations or ensure that the fault tolerant virtual machine is on a VMFS volume that does not have an abundance of other virtual machines that are regularly being powered on, powered off, or migrated using VMotion.

Check whether or not your /(root) or /vmfs/<datasource> file systems have available space. These file systems can become full for many reasons, and a lack of space might prevent you from being able to start a new Secondary VM.