Perform a graceful reboot of a cluster¶
Available since 2.23.0
You can perform a graceful reboot on a management, regional, or managed cluster. Use the below procedure to cordon, drain, and reboot the required cluster machines using a rolling reboot without workloads interruption. The procedure is also useful for a bulk reboot of machines, for example, on large clusters.
The reboot occurs in the order of cluster upgrade policy that you can change for managed clusters as described in Change the upgrade order of a machine or machine pool.
Caution
The cluster and machines must have the Ready
status to
perform a graceful reboot.
Perform a rolling reboot of a cluster using web UI¶
Available since 2.24.0 and 2.24.2 for MOSK 23.2
Log in to the Container Cloud web UI with the
m:kaas:namespace@operator
orm:kaas:namespace@writer
permissions.Switch to the required project using the Switch Project action icon located on top of the main left-side navigation panel.
On the Clusters page, verify that the status of the required cluster is Ready. Otherwise, the Reboot machines option is greyed out.
Click the More action icon in the last column of the required cluster and select Reboot machines. Confirm the selection.
Note
While a graceful reboot is in progress, the Reboot machines option is greyed out.
To monitor the cluster readiness, hover over the status icon of a specific cluster in the Status column of the Clusters page.
Once the orange blinking status icon becomes green and Ready, the cluster deployment or update is complete.
You can monitor live deployment status of the following cluster components:
Component
Description
Bastion
For the OpenStack-based management or regional clusters, the Bastion node IP address status that confirms the Bastion node creation
Helm
Installation or upgrade status of all Helm releases
Kubelet
Readiness of the node in a Kubernetes cluster, as reported by kubelet
Kubernetes
Readiness of all requested Kubernetes objects
Nodes
Equality of the requested nodes number in the cluster to the number of nodes having the
Ready
LCM statusOIDC
Readiness of the cluster OIDC configuration
StackLight
Health of all StackLight-related objects in a Kubernetes cluster
Swarm
Readiness of all nodes in a Docker Swarm cluster
LoadBalancer
Readiness of the Kubernetes API load balancer
ProviderInstance
Readiness of all machines in the underlying infrastructure (virtual or bare metal, depending on the provider type)
Graceful Reboot
Readiness of a cluster during a scheduled graceful reboot, available since Container Cloud 2.24.0 for non-MOSK clusters
For the history of a cluster deployment or update, refer to Inspect the history of a cluster and machine deployment or update.
Perform a rolling reboot of a cluster using CLI¶
Available since 2.23.0
Create a
GracefulRebootRequest
resource with a name that matches the name of the required cluster. For the resource fields description, see GracefulRebootRequest resource.In
spec:machines
, add the machine list or leave it empty to reboot all cluster machines.Wait until all specified machines are rebooted. You can monitor the reboot status of the cluster and machines using the
Conditions:GracefulReboot
fields of the correspondingCluster
andMachine
objects.The
GracefulRebootRequest
object is automatically deleted once the reboot on all target machines completes.To monitor the live machine status:
kubectl get machines <machineName> -o wide
Example of system response:
NAME READY LCMPHASE NODENAME UPGRADEINDEX REBOOTREQUIRED WARNINGS demo-0 true Ready kaas-node-c6aa8ad3 1 true
Note
To reboot a single node, for example, for maintenance purposes, refer to Enable cluster and machine maintenance mode.