Calico

This section describes the alerts for Calico.


CalicoDataplaneFailuresHigh

Severity

Warning

Summary

Data plane updates fail.

Description

The Felix daemon on the {{ $labels.node }} node has detected {{ $value }} data plane update failures within the last hour.


CalicoDataplaneAddressMsgBatchSizeHigh

Severity

Warning

Summary

Interface address messages in a batch exceed 5.

Description

The Felix daemon on the {{ $labels.node }} node has seen a high average value of {{ $value }} data plane interface messages in batches.


CalicoDataplaneIfaceMsgBatchSizeHigh

Severity

Warning

Summary

Interface state messages in a batch exceed 5.

Description

The Felix daemon on the {{ $labels.node }} node has detected a high average value of {{ $value }} data plane interface state messages in batches.


CalicoIPsetErrorsHigh

Severity

Warning

Summary

ipset commands fail.

Description

The Felix daemon on the {{ $labels.node }} node has detected {{ $value }} ipset command failures within the last hour.


CalicoIptablesSaveErrorsHigh

Severity

Warning

Summary

iptables-save fails.

Description

The Felix daemon on the {{ $labels.node }} node has detected {{ $value }} iptables-save errors within the last hour.


CalicoIptablesRestoreErrorsHigh

Severity

Warning

Summary

iptables-restore fails.

Description

The Felix daemon on the {{ $labels.node }} node has detected {{ $value }} iptables-restore errors within the last hour.


CalicoTargetDown

Severity

Major

Summary

Calico Prometheus target is down.

Description

Prometheus fails to scrape metrics from the Calico pod on the {{ $labels.node }} node (more than 1/10 failed scrapes).


CalicoTargetsOutage

Severity

Critical

Summary

Calico Prometheus targets outage.

Description

Prometheus fails to scrape metrics from all Calico pods (more than 1/10 failed scrapes).