MOSK management

This section describes the alerts for MOSK management. These alerts are based on metrics from the MOSK management Metric Exporter (MCC Exporter) service.

For troubleshooting guidelines, see Troubleshoot MOSK management Exporter alerts.


ClusterBackupDeleteFailed

Severity

Critical

Summary

Cluster backup deletion has failed

Description

The deletion of the backup {{ $labels.backup_name }} ({{ $labels.backup_uid }}) of the {{ $labels.cluster_namespace }}/{{ $labels.cluster_name }} ({{ $labels.cluster_uid }}) cluster has failed.

ClusterBackupStuck

Severity

Critical

Summary

Cluster backup is stuck

Description

The backup {{ $labels.backup_name }} ({{ $labels.backup_uid }}) of the {{ $labels.cluster_namespace }}/{{ $labels.cluster_name }} ({{ $labels.cluster_uid }}) cluster is stuck.

ClusterBackupFailed

Severity

Critical

Summary

Cluster backup has failed

Description

The backup {{ $labels.backup_name }} ({{ $labels.backup_uid }}) for the {{ $labels.cluster_namespace }}/{{ $labels.cluster_name }} ({{ $labels.cluster_uid }}) cluster has failed.

ClusterUpdateAutoPaused

TechPreview

Severity

Warning

Summary

The cluster update is auto-paused

Description

The {{ $labels.cluster_namespace }}/{{ $labels.cluster_name }} ({{ $labels.cluster_uid }}) cluster update to {{ $labels.target }} is auto-paused.

ClusterUpdateInProgress

TechPreview

Severity

Informational

Summary

The cluster is updating

Description

The {{ $labels.cluster_namespace }}/{{ $labels.cluster_name }} ({{ $labels.cluster_uid }}) cluster update to {{ $labels.target }} is in progress.

ClusterUpdateStepAutoPaused

TechPreview

Severity

Warning

Summary

Step {{ $labels.step_id }} of the cluster update is auto-paused

Description

Step {{ $labels.step_id }} of the {{ $labels.cluster_namespace }}/{{ $labels.cluster_name }} ({{ $labels.cluster_uid }}) cluster update to {{ $labels.target }} is auto-paused.

ClusterUpdateStepInProgress

TechPreview

Severity

Informational

Summary

Step {{ $labels.step_id }} of the cluster update is in progress

Description

Step {{ $labels.step_id }} of the {{ $labels.cluster_namespace }}/{{ $labels.cluster_name }} ({{ $labels.cluster_uid }}) cluster update to {{ $labels.target }} is in progress.

ClusterUpdateStepStuck

TechPreview

Severity

Critical

Summary

Step {{ $labels.step_id }} of the cluster update is stuck

Description

Step {{ $labels.step_id }} of the {{ $labels.cluster_namespace }}/{{ $labels.cluster_name }} ({{ $labels.cluster_uid }}) cluster update to {{ $labels.target }} is stuck.

ClusterUpdateStuck

TechPreview

Severity

Critical

Summary

The cluster update is stuck

Description

The {{ $labels.cluster_namespace }}/{{ $labels.cluster_name }} ({{ $labels.cluster_uid }}) cluster update to {{ $labels.target }} is stuck.

MCCClusterLCMUnhealthy

Severity

Major

Summary

LCM of the cluster is unhealthy

Description

Some LCM operations have issues on the {{ $labels.namespace }}/{{ $labels.name }} cluster.

MCCClusterUpdating

Severity

Informational

Summary

The cluster is updating

Description

The {{ $labels.namespace }}/{{ $labels.name }} cluster is in the updating state.

MCCExporterTargetDown

Severity

Critical

Summary

MCC Exporter Prometheus target is down

Description

Prometheus fails to scrape metrics from the MCC Exporter service.

MCCLicenseExpirationHigh

Severity

Critical

Summary

MOSK management license expires on {{ $value | humanizeTimestamp }}

Description

The MOSK management license expires on {{ $value | humanizeTimestamp }}, less than 10 days are left.

MCCLicenseExpirationMedium

Severity

Warning

Summary

MOSK management license expires on {{ $value | humanizeTimestamp }}

Description

The MOSK management license expires on {{ $value | humanizeTimestamp }}, less than 30 days are left.

MCCUpdateBlocked

Severity

Warning

Summary

MOSK management update is blocked

Description

The MOSK management update from {{ $labels.active_kaasrelease_version }} to {{ $labels.pending_kaasrelease_version }} is available but blocked. For details, see Troubleshoot MOSK management Exporter alerts.

MCCUpdateScheduled

Severity

Informational

Summary

MOSK management update is scheduled

Description

The MOSK management update from {{ $labels.active_kaasrelease_version }} to {{ $labels.pending_kaasrelease_version }} is available and scheduled for {{ $value | humanizeTimestamp }}. For details, see Schedule MOSK management updates.