Mirantis Kubernetes Engine

This section describes the alerts for the Mirantis Kubernetes Engine (MKE) cluster.


MKEAPICertExpirationMajor

Severity

Major

Summary

MKE API certificate expires in less than 10 days.

Description

The SSL certificate for MKE API expires in less than 10 days.


MKEAPICertExpirationWarning

Severity

Warning

Summary

MKE API certificate expires in less than 30 days.

Description

The SSL certificate for MKE API expires in less than 30 days.


MKEAPIDown

Severity

Critical

Summary

MKE API endpoint is down.

Description

The MKE API endpoint on the {{ $labels.node }} node is not accessible for the last 3 minutes.


MKEAPIOutage

Severity

Critical

Summary

MKE API is down.

Description

The MKE API (port 443) is not accessible for the last 1 minute.


MKEContainersUnhealthy

Severity

Major

Summary

MKE containers are Unhealthy.

Description

{{ $value }} MKE {{ $labels.name }} containers are Unhealthy.


MKEManagerAPITargetsOutage

Severity

Critical

Summary

MKE manager API cluster Prometheus targets outage.

Description

Prometheus fails to scrape metrics from 2/3 of MKE manager API nodes (more than 1/10 failed scrapes).


MKEMetricsControllerTargetsOutage

Severity

Critical

Summary

MKE metrics controller cluster Prometheus targets outage.

Description

Prometheus fails to scrape metrics from 2/3 of MKE metrics controller nodes (more than 1/10 failed scrapes).


MKEMetricsEngineTargetDown

Severity

Major

Summary

MKE metrics engine Prometheus target is down.

Description

Prometheus fails to scrape metrics from the MKE metrics engine on the {{ $labels.node }} node (more than 1/10 failed scrapes).


MKEMetricsEngineTargetsOutage

Severity

Critical

Summary

MKE metrics engine Prometheus targets outage.

Description

Prometheus fails to scrape metrics from the MKE metrics engine on all nodes (more than 1/10 failed scrapes).


MKENodeDiskFullCritical

Severity

Critical

Summary

MKE node disk is 95% full.

Description

The {{ $labels.node }} MKE node disk is 95% full.


MKENodeDiskFullWarning

Severity

Warning

Summary

MKE node disk is 85% full.

Description

The {{ $labels.node }} MKE node disk is 85% full.


MKENodeDown

Severity

Critical

Summary

MKE node is down.

Description

The {{ $labels.node }} MKE node is down.