Redis
This section describes the alerts for the Redis service.
RedisServiceDown
Severity |
Minor |
Summary |
The Redis service on the {{$labels.host}} node is down. |
Raise condition |
procstat_running{process_name="redis-server"} == 0 |
Description |
Raises when Telegraf cannot find running redis-server processes on
a node, typically indicating MEM consumption on the node, Redis port
usage by another process, or wrong permissions set for Redis
configuration or log files. The host label in the raised alert
contains the host name of the affected node. |
Troubleshooting |
- Verify the
redis-server service status using
systemctl status redis-server .
- Inspect the
redis-server service logs in
/var/log/redis/redis-server.log .
|
Tuning |
Not required |
RedisServiceDownMinor
Severity |
Minor |
Summary |
More than 30% of Redis services are down. |
Raise condition |
count(procstat_running{process_name="redis-server"} == 0) >= count
(procstat_running{process_name="redis-server"}) * 0.3 |
Description |
Raises when Telegraf cannot find running redis-server processes by
default on more than 30% of the ntw and nal hosts. |
Troubleshooting |
- Inspect the
RedisServiceDown alerts for the host names of the
affected nodes.
- Verify the
redis-server service status using
systemctl status redis-server .
- Inspect the
redis-server service logs in
/var/log/redis/redis-server.log .
|
Tuning |
Not required |
RedisServiceDownMajor
Severity |
Major |
Summary |
More than 60% of Redis services are down. |
Raise condition |
count(procstat_running{process_name="redis-server"} == 0) >= count
(procstat_running{process_name="redis-server"}) * 0.6 |
Description |
Raises when Telegraf cannot find running redis-server processes by
default on more than 60% of the mtr hosts. |
Troubleshooting |
- Inspect the
RedisServiceDown alerts for the host names of the
affected nodes.
- Verify the
redis-server service status using
systemctl status redis-server .
- Inspect the
redis-server service logs in
/var/log/redis/redis-server.log .
|
Tuning |
Not required |
RedisServiceOutage
Severity |
Critical |
Summary |
All Redis services are down. |
Raise condition |
count(procstat_running{process_name="redis-server"} == 0) == count
(procstat_running{process_name="redis-server"}) |
Description |
Raises when Telegraf cannot find running redis-server processes on
all ntw and nal hosts. |
Troubleshooting |
- Inspect the
RedisServiceDown alerts for the host names of the
affected nodes.
- Verify the
redis-server service status using
systemctl status redis-server .
- Inspect the
redis-server service logs in
/var/log/redis/redis-server.log .
|
Tuning |
Not required |