General alerts

General alerts

This section lists the general available alerts.


TargetDown

Severity

Critical

Summary

{{ $labels.job }} targets are down.

Description

{{ $value | printf "%.2f" }}% of {{ $labels.job }} targets are down.


TargetFlapping

Severity

Critical

Summary

The {{ $labels.job }} target is flapping.

Description

The {{ $labels.job }}/{{ $labels.instance }} target is changing its state between UP and DOWN for 30 minutes, at least once within the 15 minutes time range.


NodeDown

Severity

Critical

Summary

The {{ $labels.node }} node is down.

Description

The {{ $labels.node }} node is down. Kubernetes treats the node as Not Ready and kubelet is not accessible from Prometheus.


Watchdog

Severity

None

Summary

Watchdog alert that is always firing.

Description

This alert ensures that the entire alerting pipeline is functional. This alert should always be firing in Alertmanager against a receiver. Some integrations with various notification mechanisms can send a notification when this alert is not firing. For example, the DeadMansSnitch integration in PagerDuty.