Docker Swarm

This section describes the alerts for the Docker Swarm service.


DockerSwarmNetworkUnhealthy

Severity

Warning

Summary

Docker Swarm network unhealthy.

Description

The qLen size and NetMsg showed unexpected output for the last 10 minutes. Verify the NetworkDb Stats output for the qLen size and NetMsg using journalctl -d docker.

Note

For the DockerNetworkUnhealthy alert, StackLight collects metrics from logs. Therefore, this alert is available only if logging is enabled.

DockerSwarmNodeFlapping

Severity

Major

Summary

Docker Swarm node is flapping.

Description

The {{ $labels.node_name }} Docker Swarm node (ID: {{ $labels.node_id }}) state flapped more than 3 times for the last 10 minutes.

DockerSwarmServiceReplicasDown

Severity

Major

Summary

Docker Swarm replica is down.

Description

The {{ $labels.service_name }} Docker Swarm {{ $labels.service_mode }} service replica is down for 5 minutes.

DockerSwarmServiceReplicasFlapping

Severity

Major

Summary

Docker Swarm service is flapping.

Description

The {{ $labels.service_name }} Docker Swarm {{ $labels.service_mode }} service replica is flapping for 10 minutes.

DockerSwarmServiceReplicasOutage

Severity

Critical

Summary

Docker Swarm service outage.

Description

All {{ $labels.service_name }} Docker Swarm {{ $labels.service_mode }} service replicas are down for 2 minutes.