ZooKeeper

This section lists the alerts for ZooKeeper.


ZooKeeperClusterTargetDown

Since 23.3 to replace ZooKeeperClusterTargetsOutage

Severity

Major

Summary

ZooKeeper cluster Prometheus targets outage.

Description

Prometheus fails to scrape metrics from the {{ $labels.pod }} Pod of the {{ $labels.cluster }} cluster on the {{ $labels.node }} node.

ZooKeeperClusterTargetsOutage

Replaced with ZooKeeperClusterTargetDown in 23.3

Severity

Major

Summary

ZooKeeper cluster Prometheus targets outage.

Description

Prometheus fails to scrape metrics from 2/3 of the {{ $labels.cluster }} cluster endpoints (more than 1/10 failed scrapes).

ZooKeeperMissingFollowers

Severity

Warning

Summary

ZooKeeper cluster has missing followers.

Description

The {{ $labels.cluster }} ZooKeeper cluster in the {{ $labels.namespace }} namespace has missing follower servers.

ZooKeeperRequestOverload

Severity

Warning

Summary

ZooKeeper server request overload.

Description

The {{ $labels.namespace }}/ {{ $labels.pod }} ZooKeeper Pod in the {{ $labels.cluster }} cluster is not keeping up with request handling.

ZooKeeperRunningOutOfFileDescriptors

Severity

Warning

Summary

ZooKeeper server is running out of file descriptors.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} ZooKeeper Pod in the {{ $labels.cluster }} cluster is using at least 85% of available file descriptors.

ZooKeeperSyncOverload

Severity

Warning

Summary

ZooKeeper leader synchronization overload.

Description

The ZooKeeper leader in the {{ $labels.cluster }} cluster in the {{ $labels.namespace }} namespace is not keeping up with synchronization.