Redis

Redis

This section describes the alerts for the Redis service.


RedisServiceDown

Severity Minor
Summary The Redis service on the {{$labels.host}} node is down.
Raise condition procstat_running{process_name="redis-server"} == 0
Description Raises when Telegraf cannot find running redis-server processes on a node, typically indicating MEM consumption on the node, Redis port usage by another process, or wrong permissions set for Redis configuration or log files. The host label in the raised alert contains the host name of the affected node.
Troubleshooting
  • Verify the redis-server service status using systemctl status redis-server.
  • Inspect the redis-server service logs in /var/log/redis/redis-server.log.
Tuning Not required

RedisServiceDownMinor

Severity Minor
Summary More than 30% of Redis services are down.
Raise condition count(procstat_running{process_name="redis-server"} == 0) >= count (procstat_running{process_name="redis-server"}) * 0.3
Description Raises when Telegraf cannot find running redis-server processes by default on more than 30% of the ntw and nal hosts.
Troubleshooting
  • Inspect the RedisServiceDown alerts for the host names of the affected nodes.
  • Verify the redis-server service status using systemctl status redis-server.
  • Inspect the redis-server service logs in /var/log/redis/redis-server.log.
Tuning Not required

RedisServiceDownMajor

Severity Major
Summary More than 60% of Redis services are down.
Raise condition count(procstat_running{process_name="redis-server"} == 0) >= count (procstat_running{process_name="redis-server"}) * 0.6
Description Raises when Telegraf cannot find running redis-server processes by default on more than 60% of the mtr hosts.
Troubleshooting
  • Inspect the RedisServiceDown alerts for the host names of the affected nodes.
  • Verify the redis-server service status using systemctl status redis-server.
  • Inspect the redis-server service logs in /var/log/redis/redis-server.log.
Tuning Not required

RedisServiceOutage

Severity Critical
Summary All Redis services are down.
Raise condition count(procstat_running{process_name="redis-server"} == 0) == count (procstat_running{process_name="redis-server"})
Description Raises when Telegraf cannot find running redis-server processes on all ntw and nal hosts.
Troubleshooting
  • Inspect the RedisServiceDown alerts for the host names of the affected nodes.
  • Verify the redis-server service status using systemctl status redis-server.
  • Inspect the redis-server service logs in /var/log/redis/redis-server.log.
Tuning Not required