Identifying issues with and setting up name resolution on ESX/ESXi Server

Symptoms

  • Unable to configure VMware High Availability (HA).
  • Configuring VMware HA fails at about 97% completion.
  • Configuring VMware HA fails between 85 to 91%.
  • Reconfiguring VMware HA fails.
  • Adding an ESX host to a cluster fails.
  • Enabling VMware HA in a cluster fails.
  • These errors are generated when attempting to configure VMware HA:

    • An error occurred during configuration of the HA agent on the host
    • HA agent on <ESX> in cluster <cluster> in <Datacenter> has an error
    • Error: Cannot complete the configuration of the HA agent on the host. Other HA configuration error
    • cmd addnode failed for primary node:/opt/vmware/aam/bin/ft_startup failed

  • Host fails to remediate or exit Maintenance Mode.
  • You see the error:

    Operation timed out

  • In the Tasks view of VirtualCenter/vCenter Server, you see the error:

    There are errors during the remediation operation
    Failed to find host


  • Update Manager is unable to scan ESX/ESXi hosts.

Purpose

The errors listed in the Symptoms section are generated as a result of name resolution issues.
 
This article guides you through identifying issues with name resolution which can seriously impact the normal operation of ESX/ESXi, particularly in HA clustered environments. The article also details correctly configuring host files when there is no DNS server in the environment, or if the DNS server is incorrectly configured.

Resolution

Identifying issues


There is a problem with name resolution if any of these tests fail. ESX/ESXi hosts must be able to find each other by:
  • IP address
  • Short Name
  • Fully Qualified Domain Name (FQDN)

If an issue with name resolution has been identified it must be resolved either on the DNS server or by using hosts files.

Note: After making any changes to DNS or hosts files, ensure that you delete the file /etc/FT_HOSTS (or/etc/opt/vmware/aam/FT_HOSTS and /var/run/vmware/aam/FT_HOSTS) on all affected ESX/ESXi hosts.

  1. Verify that all ESX/ESXi hosts can ping each other by short name. All ESX/ESXi hosts in the environment must be able to ping each other by using short name only. For more information, see Testing network connectivity with the ping command (1003486).

  2. Verify that all ESX/ESXi hosts can nslookup each other.

    Use nslookup (reverse IP address lookup) to verify that the appropriate name is associated to a particular IP address.

    For example:

    [root@esx-server-1 /]# nslookup 192.168.0.5
    Server: 192.168.0.7
    Address: 192.168.0.7#53

    5.0.168.192.in-addr.arpa name = esx-server-2.domain.com


  3. Verify that all ESX/ESXi hosts can reverse nslookup each other.

    Use nslookup (forward IP address lookup) to verify that the appropriate IP address is associated to a particular name.

    For example:

    [root@esx-server-1 /]# nslookup esx-server-2
    Server: 192.168.0.7
    Address: 192.168.0.7#53

    Name: esx-server-2.domain.com
    Address: 192.168.0.5

Additional checks for VMware High Availability (HA) environments - ESX/ESXi 3.x and 4.x Only


Perform these additional checks for VMware High Availability environments with ESX/ESXi versions 3.x and 4.x (this does not apply to ESXi 5.0):

  1. Verify that the reported host name is in lowercase when you run:

    [root@esx-server-1 /]# hostname
    [root@esx-server-1 /]# hostname -s


  2. Verify that all host names in /etc/hosts are in lowercase.
  3. Verify that the search domain in /etc/resolv.conf is in lowercase.
  4. Verify that the host name in /etc/sysconfig/network is a fully qualified domain name, and is lowercase.
  5. Verify that the host name in /etc/vmware/esx.conf is a fully qualified domain name, and is lowercase.
  6. If your ESX/ESXi hosts are registered in DNS, verify that your system host name is lowercase. Run this command to ensure the FQDN is resolvable and all lowercase:

    nslookup short_hostname

  7. Verify that all primary Service Consoles in the VMware HA cluster have the same name.
  8. Verify that all primary Service Consoles are in the same IP subnet.

Note: If a VMotion VMkernel port is on the same vSwitch as the primary Service Console, or if a host has multiple Service Consoles, see Incompatible HA Networks appearing when attempting to configure HA (High Availability) (1006541).

Configuring hosts files on ESX/ESXi


The hosts file on the ESX host is located at /etc/hosts.

Open the file for editing using a text editor such as nano or vi.

Example hosts file:

# Do not remove the following line, or various programs
# that require network functionality will fail.
127.0.0.1 localhost.localdomain localhost

# Any line beginning with a pound sign is a comment and will not be read.
192.168.0.5 esx-server-1.domain.com esx-server-1
192.168.0.6 esx-server-2.domain.com esx-server-2

# The VMware Virtual Center Server must also have an entry here
192.168.0.20 virtualcenter.domain.com virtualcenter

Notes:
  • localhost must always be present in the hosts file. Do not modify or remove the entry for localhost.
  • The hosts file must be identical on all ESX/ESXi hosts in the cluster.
  • There must be an entry for every ESX/ESXi host in the cluster.
  • Every host must have an IP address, Fully Qualified Domain Name (FQDN), and short name.
  • The hosts file is case sensitive. Be sure to use lowercase throughout the environment.

    Note: This issue has been resolved in vCenter 2.5 U3 and above. For more information, see VMware HA Can Be Configured on a Cluster Regardless of the Letter Case of the ESX Server Host Names in the Resolved Issues section of the VirtualCenter 2.5 Update 3 Release Notes.

Based on VMware KB 1003735
  • 1 Users Found This Useful
Was this answer helpful?

Related Articles

Hardware and firmware requirements for 64-bit guest operating systems

PurposeThis article explains the host machine hardware and firmware requirements for installing...

Logging in to the vCenter Server 5.0 Web Client fails with the error: unable to connect to vCenter Inventory Service

DetailsAfter upgrading from vCenter Server 4.1 to 5.0, you experience these symptoms:Cannot log...

Multiple network entries in vCenter Server 5.0.x after migrating virtual machines from a virtual switch to a virtual distributed switch

SymptomsAfter migrating virtual machines from a virtual switch to a virtual Distributed...

Minimum requirements for the VMware vCenter Server 5.x Appliance

PurposeIf you are using the VMware vCenter Server Appliance, beginning with vSphere 5.0 you can...