Troubleshooting Guide¶
The MOSK Troubleshooting Guide helps operators and cloud administrators diagnose and resolve issues that may occur during the deployment or operation of MOSK management and MOSK clusters. It provides practical diagnostic workflows, verification steps, and remediation procedures to identify root causes and restore normal functionality.
The goal of this guide is to enable quick, effective issue resolution and ensure a stable, reliable MOSK environment.
For the list of known issues that you may encounter in the cluster, refer to the Release Notes for the corresponding MOSK version.
- Collect cluster logs
- Inspect the history of a cluster and machine deployment or update
- Cluster deletion freezes
- Keycloak admin console becomes inaccessible after changing the theme
- Unresponsive OpenSDN API due to excessive Cassandra tombstone records
- The ‘database space exceeded’ error on large clusters
- The auditd events cause ‘backlog limit exceeded’ messages
- Troubleshoot a management cluster bootstrap
- Troubleshoot bare metal
- Troubleshoot Ceph
- Troubleshoot StackLight