Cassandra

This section lists the alerts for Cassandra.


CassandraAuthFailures

Severity

Warning

Summary

Cassandra authentication failures.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster reports an increased number of authentication failures.

CassandraCacheHitRateTooLow

Severity

Major

Summary

Cassandra cache hit rate is too low.

Description

The average hit rate for the {{ $labels.cache }} cache in the {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster is below 85%.

CassandraClientRequestFailure

Severity

Major

Summary

Cassandra client {{ $labels.operation }} request failure.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster reports an increased number of {{ $labels.operation }} operation failures. A failure is a non-timeout exception.

CassandraClientRequestUnavailable

Severity

Critical

Summary

Cassandra client {{ $labels.operation }} request is unavailable.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ labels.cassandra_cluster }} cluster reports an increased number of {{ $labels.operation }} operations ending with UnavailableException. There are not enough replicas alive to perform the {{ $labels.operation }} query with the requested consistency level.

CassandraClusterTargetDown

Available since 23.3 to replace CassandraClusterTargetsOutage

Severity

Critical

Summary

Cassandra cluster target down.

Description

Prometheus fails to scrape metrics from the {{ $labels.pod }} Pod of the {{ $labels.cluster }} cluster on the {{ $labels.node }} node.

CassandraClusterTargetsOutage

Replaced by CassandraClusterTargetDown in 23.3

Severity

Critical

Summary

Cassandra cluster Prometheus targets outage.

Description

Prometheus fails to scrape metrics from 2/3 of the {{ $labels.cluster }} cluster endpoints (more than 1/10 failed scrapes).

CassandraCommitlogTasksPending

Severity

Warning

Summary

Cassandra commitlog has too many pending tasks.

Description

The commitlog in the {{ $labels.namespace }}/ {{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster reached 15 pending tasks.

CassandraCompactionExecutorTasksBlocked

Severity

Warning

Summary

Cassandra compaction executor tasks are blocked.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ labels.cassandra_cluster }} cluster reports that {{ $value }} compaction executor tasks are blocked.

CassandraCompactionTasksPending

Severity

Warning

Summary

Cassandra has too many pending compactions.

Description

The pending compaction tasks in the {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ labels.cassandra_cluster }} cluster reached the threshold of 100 on average as measured over 30 minutes. This may occur due to a too low cluster I/O capacity.

CassandraConnectionTimeouts

Severity

Critical

Summary

Cassandra connection timeouts.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster reports an increased number of connection timeouts between nodes.

CassandraFlushWriterTasksBlocked

Severity

Warning

Summary

Cassandra flush writer tasks are blocked.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster reports that {{ $value } flush writer tasks are blocked.

CassandraHintsTooMany

Severity

Major

Summary

Cassandra has too many hints.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster reports an increased number of hints. Replica nodes are not available to accept mutation due to a failure or maintenance.

CassandraRepairTasksBlocked

Severity

Warning

Summary

Cassandra repair tasks are blocked.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster reports that {{ $value }} repair tasks are blocked.

CassandraStorageExceptions

Severity

Critical

Summary

Cassandra storage exceptions.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster reports an increased number of storage exceptions.

CassandraTombstonesTooManyCritical

Severity

Critical

Summary

Cassandra scanned 50000 tombstones.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster scanned {{ $value }} tombstones in 99% of read queries.

CassandraTombstonesTooManyCritical

Available since MOSK 22.2

Severity

Critical

Summary

Cassandra scanned 50000 tombstones.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster scanned {{ $value }} tombstones in 99% of read queries.

CassandraTombstonesTooManyMajor

Severity

Major

Summary

Cassandra scanned 25000 tombstones.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster scanned {{ $value }} tombstones in 99% of read queries.

CassandraTombstonesTooManyWarning

Severity

Warning

Summary

Cassandra scanned 10000 tombstones.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster scanned {{ $value }} tombstones in 99% of read queries.

CassandraViewWriteLatencyTooHigh

Severity

Warning

Summary

Cassandra high view/write latency.

Description

The {{ $labels.namespace }}/{{ $labels.pod }} Cassandra Pod in the {{ $labels.cassandra_cluster }} cluster reports over 1-second view/write latency for 99% of requests.