Redis

Redis

This section describes the alerts for the Redis service.


RedisServiceDown

Severity

Minor

Summary

The Redis service on the {{$labels.host}} node is down.

Raise condition

procstat_running{process_name="redis-server"} == 0

Description

Raises when Telegraf cannot find running redis-server processes on a node, typically indicating MEM consumption on the node, Redis port usage by another process, or wrong permissions set for Redis configuration or log files. The host label in the raised alert contains the host name of the affected node.

Troubleshooting

  • Verify the redis-server service status using systemctl status redis-server.

  • Inspect the redis-server service logs in /var/log/redis/redis-server.log.

Tuning

Not required

RedisServiceDownMinor

Severity

Minor

Summary

More than 30% of Redis services are down.

Raise condition

count(procstat_running{process_name="redis-server"} == 0) >= count (procstat_running{process_name="redis-server"}) * 0.3

Description

Raises when Telegraf cannot find running redis-server processes by default on more than 30% of the ntw and nal hosts.

Troubleshooting

  • Inspect the RedisServiceDown alerts for the host names of the affected nodes.

  • Verify the redis-server service status using systemctl status redis-server.

  • Inspect the redis-server service logs in /var/log/redis/redis-server.log.

Tuning

Not required

RedisServiceDownMajor

Severity

Major

Summary

More than 60% of Redis services are down.

Raise condition

count(procstat_running{process_name="redis-server"} == 0) >= count (procstat_running{process_name="redis-server"}) * 0.6

Description

Raises when Telegraf cannot find running redis-server processes by default on more than 60% of the mtr hosts.

Troubleshooting

  • Inspect the RedisServiceDown alerts for the host names of the affected nodes.

  • Verify the redis-server service status using systemctl status redis-server.

  • Inspect the redis-server service logs in /var/log/redis/redis-server.log.

Tuning

Not required

RedisServiceOutage

Severity

Critical

Summary

All Redis services are down.

Raise condition

count(procstat_running{process_name="redis-server"} == 0) == count (procstat_running{process_name="redis-server"})

Description

Raises when Telegraf cannot find running redis-server processes on all ntw and nal hosts.

Troubleshooting

  • Inspect the RedisServiceDown alerts for the host names of the affected nodes.

  • Verify the redis-server service status using systemctl status redis-server.

  • Inspect the redis-server service logs in /var/log/redis/redis-server.log.

Tuning

Not required