Cassandra

Cassandra

This section describes the alerts for the Cassandra service.


CassandraServiceDown

Severity Minor
Summary The Cassandra service on the {{ $labels.host }} node is down.
Raise condition procstat_running{process_name="cassandra-server"} == 0
Description Raises when Telegraf cannot find running cassandra-server processes on the ntw or nal hosts. The host label in the raised alert contains the host name of the affected node.
Troubleshooting Inspect the Cassandra logs in the /var/log/cassandra/ directory on the affected node.
Tuning Not required

CassandraServiceDownMinor

Severity Minor
Summary More than 30% of Cassandra services are down.
Raise condition count(procstat_running{process_name="cassandra-server"} == 0) >= count(procstat_running{process_name="cassandra-server"}) *{{ monitoring.services_failed_warning_threshold_percent }}
Description Raises when Telegraf cannot find running cassandra-server processes on more than 30% of ntw and nal hosts.
Troubleshooting
  • Inspect the CassandraServiceDown alert for the host names of the affected nodes.
  • Inspect the Cassandra logs in /var/log/cassandra/.
Tuning Not required

CassandraServiceDownMajor

Severity Major
Summary More than 60% of Cassandra services are down.
Raise condition count(procstat_running{process_name="cassandra-server"} == 0) >= count(procstat_running{process_name="cassandra-server"}) *{{ monitoring.services_failed_critical_threshold_percent }}
Description Raises when Telegraf cannot find running cassandra-server processes on more than 60% of ntw and nal hosts.
Troubleshooting
  • Inspect the CassandraServiceDown alert for the host names of the affected nodes.
  • Inspect the Cassandra logs in /var/log/cassandra/.
Tuning Not required

CassandraServiceOutage

Severity Critical
Summary All Cassandra services are down.
Raise condition count(procstat_running{process_name="cassandra-server"} == 0) == count(procstat_running{process_name="cassandra-server"})
Description Raises when Telegraf cannot find running cassandra-server processes on all ntw and nal hosts.
Troubleshooting
  • Inspect the CassandraServiceDown alert for the host names of the affected nodes.
  • Inspect the Cassandra logs in /var/log/cassandra/.
Tuning Not required