Documentation Portal
Home
MCP Operations Guide
StackLight LMA operations
StackLight LMA alerts
Available StackLight LMA alerts
Core services
Core services
Core services
ΒΆ
This section describes the alerts available for the core services.
Apache
ApacheServiceDown
ApacheServiceOutage
ApacheWorkersAbsent
Bond interfaces
BondInterfaceDown
BondInterfaceSlaveDown
BondInterfaceSlaveDownMajor
BondInterfaceSingleSlave
Docker
DockerdProcessDown
DockerServiceOutage
DockerService {{ camel_case_name }} ReplicasDownMinor
DockerService {{ camel_case_name }} ReplicasDownMajor
DockerService {{ camel_case_name }} Outage
DockerdServiceReplicaFlapping
Galera
GaleraServiceDown
GaleraServiceOutage
GaleraNodeNotReady
GaleraNodeNotConnected
GlusterFS
GlusterfsServiceMinor
GlusterfsServiceOutage
GlusterfsInodesUsedMinor
GlusterfsInodesUsedMajor
GlusterfsSpaceUsedMinor
GlusterfsSpaceUsedMajor
GlusterfsMountMissing
HAProxy
HaproxyServiceDown
HaproxyServiceDownMajor
HaproxyServiceOutage
HaproxyHTTPResponse5xxTooHigh
HaproxyBackendDown
HaproxyBackendDownMajor
HaproxyBackendOutage
Keepalived
KeepalivedProcessDown
KeepalivedProcessNotResponsive
KeepalivedFailedState
KeepalivedUnknownState
KeepalivedMultipleIPAddr
KeepalivedServiceOutage
libvirt
LibvirtDown
Memcached
MemcachedServiceDown
MemcachedServiceRespawn
MemcachedConnectionThrottled
MemcachedConnectionsNoneMinor
MemcachedConnectionsNoneMajor
MemcachedItemsNoneMinor
MemcachedEvictionsLimit
NGINX
NginxServiceDown
NginxServiceOutage
NginxDroppedIncomingConnections
NTP
NtpOffsetTooHigh
Open vSwitch
ProcessOVSVswitchdMemoryWarning
ProcessOVSVswitchdMemoryCritical
OVSInstanceArpingCheckDown
OVSTooManyPortRunningOnAgent
OVSErrorOnPort
OVSNonInternalPortDown
OVSGatherFailed
RabbitMQ
RabbitmqServiceDown
RabbitmqServiceOutage
RabbitMQUnequalQueueCritical
RabbitmqDiskFullWarning
RabbitmqDiskFullCritical
RabbitmqMemoryLowWarning
RabbitmqMemoryLowCritical
RabbitmqMessagesTooHigh
RabbitmqErrorLogsTooHigh
RabbitmqErrorLogsMajor
RabbitmqFdUsageWarning
RabbitmqFdUsageCritical
Reclass
ReclassUnstagedChanges
ReclassStagedChanges
ReclassRemoteDesync
Salt
SaltMasterServiceDown
SaltMinionServiceDown
SMART disks
SystemSMARTDiskUDMACrcErrorsTooHigh
SystemSMARTDiskHealthStatus
SystemSMARTDiskReadErrorRate
SystemSMARTDiskSeekErrorRate
SystemSMARTDiskTemperatureHigh
SystemSMARTDiskReallocatedSectorsCount
SystemSMARTDiskCurrentPendingSectors
SystemSMARTDiskReportedUncorrectableErrors
SystemSMARTDiskOfflineUncorrectableSectors
SystemSMARTDiskEndToEndError
SSL certificates
CertificateExpirationWarning
CertificateExpirationCritical
System
SystemCpuFullWarning
SystemLoadTooHighWarning
SystemLoadTooHighCritical
SystemDiskFullWarning
SystemDiskFullMajor
SystemDiskInodesFullWarning
SystemDiskInodesFullMajor
SystemDiskErrorsTooHigh
SystemDiskBacklogWarning
SystemDiskBacklogCritical
SystemDiskRequestQueuedWarning
SystemDiskRequestQueuedCritical
SystemMemoryFullWarning
SystemMemoryFullMajor
SystemSwapFullWarning
SystemSwapFullMinor
SystemRxPacketsDroppedTooHigh
SystemTxPacketsDroppedTooHigh
SystemRxPacketsErrorTooHigh
SystemTxPacketsErrorTooHigh
CronProcessDown
SshdProcessDown
SshFailedLoginsTooHigh
PacketsDroppedByCpuWarning
PacketsDroppedByCpuMinor
NetdevBudgetRanOutsWarning
SystemCpuIoWaitWarning
SystemCpuIoWaitCritical
SystemCpuStealTimeWarning
SystemCpuStealTimeCritical
updated: 2024-09-16 10:21
Available StackLight LMA alerts
View Previous Section
Apache
View Next Section
Logout
Preface
Introduction
Provision hardware
SaltStack operations
DriveTrain operations
OpenStack operations
Kubernetes operations
OpenContrail operations
DevOps Portal
StackLight LMA operations
Configure StackLight LMA components
Restart StackLight LMA components
Manage endpoints, metrics, and alerts
Configure StackLight LMA to send notifications
Use the Prometheus web UI
Use the Alertmanager web UI
Use the Alerta web UI
Use Grafana
Use Kibana
StackLight LMA alerts
Available StackLight LMA alerts
Core services
OpenStack
Kubernetes
OpenContrail
Ceph
StackLight LMA
Alerts that require tuning
Generate the list of alerts for a particular deployment
Add new features to an existing StackLight LMA deployment
Back up and restore
Scheduled maintenance with a planned power outage
Upgrade and update an MCP cluster
Cloud verification
Troubleshooting