Alertmanager

This section describes the alerts for the Alertmanager service.


AlertmanagerTargetDown

Available since 17.0.0, 16.0.0, and 14.1.0

Severity

Major

Summary

Prometheus Alertmanager target down.

Description

Prometheus fails to scrape metrics from the {{ $labels.pod }} Pod on the {{ $labels.node }} node.

AlertmanagerClusterTargetsOutage

Replaced with AlertmanagerTargetDown in 17.0.0, 16.0.0, and 14.1.0

Severity

Major

Summary

Prometheus Alertmanager targets outage.

Description

Prometheus fails to scrape metrics from all Alertmanager endpoints (more than 1/10 failed scrapes).

AlertmanagerFailedReload

Severity

Warning

Summary

Failure to reload Alertmanager configuration.

Description

Reloading the Alertmanager configuration has failed.

AlertmanagerMembersInconsistent

Severity

Major

Summary

Alertmanager cluster members cannot be found.

Description

Alertmanager has not found all other members of the cluster.

AlertmanagerNotificationFailureWarning

Severity

Warning

Summary

Alertmanager notifications fail.

Description

An average of {{ $value }} Alertmanager {{ $labels.integration }} notifications fail for 2 minutes.

AlertmanagerAlertsInvalidWarning

Severity

Warning

Summary

Alertmanager alerts are invalid.

Description

An average of {{ $value }} Alertmanager {{ $labels.integration }} alerts are invalid for 2 minutes.