Ceph¶
This section describes the alerts for the Ceph cluster.
CephClusterHealthWarning¶
Severity |
Warning |
---|---|
Summary |
Ceph cluster health is |
Description |
The Ceph cluster is in the |
CephClusterHealthCritical¶
Severity |
Critical |
---|---|
Summary |
Ceph cluster health is |
Description |
The Ceph cluster is in the |
CephClusterTargetDown¶
Severity |
Critical |
---|---|
Summary |
Ceph cluster Prometheus target is down. |
Description |
Prometheus fails to scrape metrics from the
|
CephDaemonSlowOps¶
Available since 15.0.0 and 14.0.0
Severity |
Warning |
---|---|
Summary |
|
Description |
|
CephMonClockSkew¶
Available since 15.0.0 and 14.0.0
Severity |
Warning |
---|---|
Summary |
Ceph Monitor clock skew detected. |
Description |
Ceph Monitor clock drift exceeds configured threshold on the Ceph cluster. |
CephMonQuorumAtRisk¶
Severity |
Major |
---|---|
Summary |
Ceph cluster quorum at risk. |
Description |
The Ceph Monitors quorum on the Ceph cluster is low. |
CephOSDDown¶
Removed in 17.0.0, 16.0.0, and 14.1.0
Severity |
Critical |
---|---|
Summary |
Ceph OSDs are down. |
Description |
|
CephOSDFlapping¶
Available since 15.0.0 and 14.0.0
Severity |
Warning |
---|---|
Summary |
Ceph OSDs flap due to network issues. |
Description |
The Ceph OSD |
CephOSDDiskNotResponding¶
Severity |
Critical |
---|---|
Summary |
Disk not responding. |
Description |
The |
CephOSDSlowClusterNetwork¶
Available since 15.0.0 and 14.0.0
Severity |
Warning |
---|---|
Summary |
Cluster network slows down Ceph OSD heartbeats. |
Description |
Ceph OSD heartbeats on the cluster network (backend) of the cluster are slow. |
CephOSDSlowPublicNetwork¶
Available since 15.0.0 and 14.0.0
Severity |
Warning |
---|---|
Summary |
Public network slows down Ceph OSD heartbeats. |
Description |
Ceph OSD heartbeats on the public network (front end) are running slow. |
CephClusterFullWarning¶
Severity |
Warning |
---|---|
Summary |
Ceph cluster is nearly full. |
Description |
The Ceph cluster utilization has crossed 85%. Expansion is required. |
CephClusterFullCritical¶
Severity |
Critical |
---|---|
Summary |
Ceph cluster is full. |
Description |
The Ceph cluster utilization has crossed 95% and needs immediate expansion. |
CephOSDPgNumTooHighWarning¶
Severity |
Warning |
---|---|
Summary |
Ceph OSDs have more than 200 PGs. |
Description |
Some Ceph OSDs contain more than 200 Placement Groups. This may have a negative impact on the cluster performance. For details, run ceph pg dump. |
CephOSDPgNumTooHighCritical¶
Severity |
Critical |
---|---|
Summary |
Ceph OSDs have more than 300 PGs. |
Description |
Some Ceph OSDs contain more than 300 Placement Groups. This may have a negative impact on the cluster performance. For details, run ceph pg dump. |
CephMonHighNumberOfLeaderChanges¶
Severity |
Major |
---|---|
Summary |
Ceph cluster has too many leader changes. |
Description |
The Ceph Monitor |
CephOSDNodeDown¶
Since 17.0.0, 16.0.0, and 14.1.0 to replace CephNodeDown
Severity |
Critical |
---|---|
Summary |
Ceph node |
Description |
The Ceph OSD node |
CephNodeDown¶
Renamed to CephOSDNodeDown in 17.0.0, 16.0.0, and 14.1.0
Severity |
Critical |
---|---|
Summary |
Ceph node |
Description |
The Ceph node |
CephOSDVersionMismatch¶
Severity |
Warning |
---|---|
Summary |
Multiple versions of Ceph OSDs running. |
Description |
|
CephMonVersionMismatch¶
Severity |
Warning |
---|---|
Summary |
Multiple versions of Ceph Monitors running. |
Description |
|
CephPGInconsistent¶
Severity |
Warning |
---|---|
Summary |
Too many inconsistent Ceph PGs. |
Description |
The Ceph cluster detects inconsistencies in one or more replicas of an
object in |
CephPGUndersized¶
Severity |
Warning |
---|---|
Summary |
Too many undersized Ceph PGs. |
Description |
The Ceph cluster reports |