This section describes the alerts for the Memcached service.
Severity | Minor |
---|---|
Summary | The Memcached service on the {{ $labels.host }} node is down. |
Raise condition | memcached_up == 0 |
Description | Raised when Telegraf cannot gather metrics from the Memcached service,
typically indicating that Memcached is down on one node and caching does
not work on that node. The host label in the raised alert contains
the host name of the affected node |
Troubleshooting |
|
Tuning | Not required |
Removed since the 2019.2.4 maintenance update.
Severity | Warning |
---|---|
Summary | The Memcached service on the {{ $labels.host }} node was respawned. |
Raise condition | memcached_uptime < 180 |
Description | Raises when the Memcached service uptime is below 180 seconds,
indicating that it was recently respawned (restarted). If Memcached
respawning happened during maintenance, the alert is expected.
Otherwise, this alert indicates an issue with the service. The Warning The alert is a partial duplicate of
|
Troubleshooting |
|
Tuning | Disable the alert as described in Manage alerts. |
Severity | Warning |
---|---|
Summary | More than 5 client connections to the Memcached database on the
{{ $labels.host }} node throttle for 2 minutes. |
Raise condition | increase(memcached_conn_yields[1m]) > 5 |
Description | Raises when the number of times the Memcached connection was throttled
reaches 5 over the last minute.
This warning appears with the Too many open connections error message
in Memcached. Too many connections may cause an error in writing because
of the process starvation (blocking). To avoid this, Memcached throttles
the connection. The host label in the raised alert contains the host
name of the affected node. |
Troubleshooting |
|
Tuning | To change the throttling threshold to
|
Severity | Minor |
---|---|
Summary | The Memcached database on the {{ $labels.host }} node has no open
connections. |
Raise condition | memcached_curr_connections == 0 |
Description | Raises when no connections to Memcached exist on one node, typically
indicating that the connections were dropped. The state may affect
performance. The host label in the raised alert contains the host
name of the affected node. |
Troubleshooting |
|
Tuning | Not required |
Severity | Major |
---|---|
Summary | The Memcached database has no open connections on all nodes. |
Raise condition | count(memcached_curr_connections == 0) == count(memcached_up) |
Description | Raises when no connections to Memcached exist on all nodes, indicating that Memcached has no client connected to it and does not receive data. |
Troubleshooting |
|
Tuning | Not required |
Removed since the 2019.2.4 maintenance update.
Severity | Minor |
---|---|
Summary | The Memcached database on the {{ $labels.host }} node is empty. |
Raise condition | memcached_curr_items == 0 |
Description | Raises when a Memcached database has no items on one node. As Memcached
is an in-memory database, this may be the result of Memcached respawn.
Otherwise, investigate the reason.
The Warning The alert has been removed starting from the 2019.2.4 maintenance update. For the existing MCP deployments, disable this alert. |
Troubleshooting |
|
Tuning | Disable the alert as described in Manage alerts. |
Severity | Warning |
---|---|
Summary | More than 10 evictions in the Memcached database occurred on the
{{ $labels.host }} node during the last minute. |
Raise condition | increase(memcached_evictions[1m]) > 10 |
Description | Raises when the number of Memcached items that were removed before the
ending of TTL has increased by 10 (default threshold) over the last
minute. Memcached is used on the OpenStack controller nodes to cache the
service authentication tokens. A high number of evictions indicates a
heavy token rotation since old items must be removed to free the space
for the new ones, based on pseudo-LRU. The host label in the raised
alert contains the host name of the affected node. |
Troubleshooting |
|
Tuning | To change the evictions limit to
|