Mirantis Kubernetes Engine¶
This section describes the alerts for the Mirantis Kubernetes Engine (MKE) cluster, including the Docker Swarm service.
For troubleshooting guidelines, see Troubleshoot Mirantis Kubernetes Engine alerts.
DockerSwarmNetworkUnhealthy¶
Severity |
Warning |
---|---|
Summary |
Docker Swarm network unhealthy. |
Description |
The Note For the |
DockerSwarmNodeFlapping¶
Severity |
Major |
---|---|
Summary |
Docker Swarm node is flapping. |
Description |
The |
DockerSwarmServiceReplicasDown¶
Severity |
Major |
---|---|
Summary |
Docker Swarm replica is down. |
Description |
The |
DockerSwarmServiceReplicasFlapping¶
Severity |
Major |
---|---|
Summary |
Docker Swarm service is flapping. |
Description |
The |
DockerSwarmServiceReplicasOutage¶
Severity |
Critical |
---|---|
Summary |
Docker Swarm service outage. |
Description |
All |
MKEAPICertExpirationHigh¶
Severity |
Critical |
---|---|
Summary |
MKE API certificate expires on |
Description |
The SSL certificate for MKE API expires on |
MKEAPICertExpirationMedium¶
Severity |
Major |
---|---|
Summary |
MKE API certificate expires on |
Description |
The SSL certificate for MKE API expires on |
MKEAPIDown¶
Severity |
Critical |
---|---|
Summary |
MKE API endpoint is down. |
Description |
The MKE API endpoint on the |
MKEAPIOutage¶
Severity |
Critical |
---|---|
Summary |
MKE API is down. |
Description |
The MKE API (port 443) is not accessible for the last 1 minute. |
MKEContainersUnhealthy¶
Severity |
Major |
---|---|
Summary |
MKE containers are |
Description |
|
MKEManagerAPITargetsOutage¶
Severity |
Critical |
---|---|
Summary |
MKE manager API cluster Prometheus targets outage. |
Description |
Prometheus fails to scrape metrics from 2/3 of MKE manager API nodes. |
MKEMetricsControllerTargetsOutage¶
Severity |
Critical |
---|---|
Summary |
MKE metrics controller Prometheus targets outage. |
Description |
Prometheus fails to scrape metrics from all MKE metrics controller endpoints. |
MKEMetricsEngineTargetDown¶
Severity |
Major |
---|---|
Summary |
MKE metrics engine Prometheus target is down. |
Description |
Prometheus fails to scrape metrics from the MKE metrics engine on the
|
MKEMetricsEngineTargetsOutage¶
Severity |
Critical |
---|---|
Summary |
MKE metrics engine Prometheus targets outage. |
Description |
Prometheus fails to scrape metrics from the MKE metrics engine on all nodes. |
MKENodeDiskFullCritical¶
Severity |
Critical |
---|---|
Summary |
MKE node disk is 95% full. |
Description |
The |
MKENodeDiskFullWarning¶
Severity |
Warning |
---|---|
Summary |
MKE node disk is 85% full. |
Description |
The |
MKENodeDown¶
Severity |
Critical |
---|---|
Summary |
MKE node is down. |
Description |
The |