Alert dependencies

Using alert inhibition rules, Alertmanager decreases alert noise by suppressing dependent alerts notifications to provide a clearer view on the cloud status and simplify troubleshooting. Alert inhibition rules are enabled by default.

The following table describes the dependency between alerts. Once an alert from the Alert column raises, the alert from the Silences column will be suppressed with the Inhibited status in the Alertmanager web UI.

Alert

Silences

CephClusterFullCritical

CephClusterFullWarning

CephClusterHealthCritical

CephClusterHealthMinor

CephNodeDown

CephOSDDiskUnavailable

CephOSDDiskNotResponding

CephOSDDown

CephOSDDiskUnavailable

CephOSDDown

CephOSDPgNumTooHighCritical

CephOSDPgNumTooHighWarning

DockerSwarmServiceReplicasFlapping

DockerSwarmServiceReplicasDown

DockerSwarmServiceReplicasOutage

DockerSwarmServiceReplicasDown

ElasticClusterStatusCritical

ElasticClusterStatusWarning

KubeJobCompletion

KubeJobFailed

ElasticHeapUsageCritical

ElasticHeapUsageWarning

FileDescriptorUsageMajor

FileDescriptorUsageWarning

IronicBmApiOutage

IronicBmMetricsMissing

SystemDiskFullMajor

SystemDiskFullWarning

SystemDiskInodesFullMajor

SystemDiskInodesFullWarning

SystemLoadTooHighCritical

SystemLoadTooHighWarning

SystemMemoryFullMajor

SystemMemoryFullWarning

KubePersistentVolumeUsageCritical

KubePersistentVolumeFullInFourDays

KubeAPICertExpirationMajor

KubeAPICertExpirationWarning

KubeAPIErrorsHighMajor

KubeAPIErrorsHighWarning

KubeAPIOutage

KubeAPIDown

KubeAPIResourceErrorsHighMajor

KubeAPIResourceErrorsHighWarning

KubeClientCertificateExpirationInOneDay

KubeClientCertificateExpirationInSevenDays

MCCSSLCertExpirationMajor

MCCSSLCertExpirationWarning

MKEAPICertExpirationMajor

MKEAPICertExpirationWarning

MKEAPIOutage

MKEAPIDown

MKENodeDiskFullCritical

MKENodeDiskFullWarning

NodeDown

KubeDaemonSetMisScheduled

KubeDaemonSetRolloutStuck

KubeAPIResourceErrorsHighMajor

KubeAPIResourceErrorsHighWarning

KubeletDown

KubeNodeNotReady

MKENodeDown

PostgresqlPatroniClusterUnlocked

PostgresqlReplicationNonStreamingReplicas

PostgresqlReplicationPaused

PostgresqlReplicaDown

PostgresqlReplicationNonStreamingReplicas

PostgresqlReplicationPaused

PostgresqlReplicationSlowWalApplication

PostgresqlReplicationSlowWalDownload

PostgresqlReplicationWalArchiveWriteFailing

PrometheusErrorSendingAlertsMajor

PrometheusErrorSendingAlertsWarning

PrometheusMsTeamsDown

KubeDeploymentReplicasMismatch

ServiceNowWebhookReceiverDown

KubeDeploymentReplicasMismatchdeployment

SSLCertExpirationMajor

SSLCertExpirationWarning

TargetDown

KubeDeploymentReplicasMismatch

KubeDaemonSetMisScheduled

KubeDaemonSetRolloutStuck

TargetFlapping

TargetDown