Troubleshoot MKE node states¶
Nodes enter a variety of states in the course of their lifecycle, including
transitional states such as when a node joins a cluster and when a node is
promoted or demoted. MKE reports the steps of the transition process as they
occur in both the ucp-controller
logs and in the MKE web UI.
To view transitional node states in the MKE web UI:
Log in to the MKE web UI.
In the left-side navigation panel, navigate to Shared Resources > Nodes. The transitional node state displays in the DETAILS column for each node.
Optional. Click the required node. The transitional node state displays in the Overview tab under Cluster Message.
The following table includes all the node states as they are reported by MKE, along with their description and expected duration:
Message |
Description |
Expected duration |
---|---|---|
Completing node registration |
The node is undergoing the registration process and does not yet appear in the KV node inventory. This is expected to occur when a node first joins the MKE swarm. |
5 - 30 seconds |
heartbeat failure |
The node has not contacted any swarm managers in the last 10 seconds. Verify the swarm state using docker info on the node.
|
Until resolved |
Node is being reconfigured |
The |
1 - 60 seconds |
Reconfiguration pending |
The node is expected to be a manager but the |
1 - 10 seconds |
The |
The |
1 - 10 seconds |
Unable to determine node state |
The |
1 - 10 seconds |
Unhealthy MKE Controller: node is unreachable |
Other manager nodes in the cluster have not received a heartbeat message from the affected node within a predetermined timeout period. This usually indicates that there is either a temporary or permanent interruption in the network link to that manager node. Ensure that the underlying networking infrastructure is operational, and contact support if the symptom persists. |
Until resolved |
Unhealthy MKE Controller: unable to reach controller |
The controller that the node is currently communicating with is not reachable within a predetermined timeout. Refresh the node listing to determine whether the symptom persists. The symptom appearing intermittently can indicate latency spikes between manager nodes, which can lead to temporary loss in the availability of MKE. Ensure the underlying networking infrastructure is operational and contact support if the symptom persists. |
Until resolved |
Unhealthy MKE Controller: Docker Swarm Cluster: Local node <ip> has
status |
The MCR Engine ID is not unique in the swarm. When a node first
joins the cluster, it is added to the node inventory and discovered as
|
Until resolved |