Cassandra
This section describes the alerts for the Cassandra service.
CassandraServiceDown
| Severity |
Minor |
| Summary |
The Cassandra service on the {{ $labels.host }} node is down. |
| Raise condition |
procstat_running{process_name="cassandra-server"} == 0 |
| Description |
Raises when Telegraf cannot find running cassandra-server processes
on the ntw or nal hosts. The host label in the raised alert
contains the host name of the affected node. |
| Troubleshooting |
Inspect the Cassandra logs in the /var/log/cassandra/ directory on
the affected node. |
| Tuning |
Not required |
CassandraServiceDownMinor
| Severity |
Minor |
| Summary |
More than 30% of Cassandra services are down. |
| Raise condition |
count(procstat_running{process_name="cassandra-server"} == 0) >=
count(procstat_running{process_name="cassandra-server"})
*{{ monitoring.services_failed_warning_threshold_percent }} |
| Description |
Raises when Telegraf cannot find running cassandra-server processes
on more than 30% of ntw and nal hosts. |
| Troubleshooting |
- Inspect the
CassandraServiceDown alert for the host names of the
affected nodes.
- Inspect the Cassandra logs in
/var/log/cassandra/.
|
| Tuning |
Not required |
CassandraServiceDownMajor
| Severity |
Major |
| Summary |
More than 60% of Cassandra services are down. |
| Raise condition |
count(procstat_running{process_name="cassandra-server"} == 0) >=
count(procstat_running{process_name="cassandra-server"})
*{{ monitoring.services_failed_critical_threshold_percent }} |
| Description |
Raises when Telegraf cannot find running cassandra-server processes
on more than 60% of ntw and nal hosts. |
| Troubleshooting |
- Inspect the
CassandraServiceDown alert for the host names of the
affected nodes.
- Inspect the Cassandra logs in
/var/log/cassandra/.
|
| Tuning |
Not required |
CassandraServiceOutage
| Severity |
Critical |
| Summary |
All Cassandra services are down. |
| Raise condition |
count(procstat_running{process_name="cassandra-server"} == 0) ==
count(procstat_running{process_name="cassandra-server"}) |
| Description |
Raises when Telegraf cannot find running cassandra-server processes
on all ntw and nal hosts. |
| Troubleshooting |
- Inspect the
CassandraServiceDown alert for the host names of the
affected nodes.
- Inspect the Cassandra logs in
/var/log/cassandra/.
|
| Tuning |
Not required |