Mirantis Container Cloud (MCC) becomes part of Mirantis OpenStack for Kubernetes (MOSK)!

Starting with MOSK 25.2, the MOSK documentation set will cover all product layers, including MOSK management (formerly MCC). This means everything you need will be in one place. The separate MCC documentation site will be retired, so please update your bookmarks for continued easy access to the latest content.

View Grafana dashboards¶

Using the Grafana web UI, you can view the visual representation of the metric graphs based on the time series databases.

Most Grafana dashboards include a View logs in OpenSearch Dashboards link to immediately view relevant logs in the OpenSearch Dashboards web UI. The OpenSearch Dashboards web UI displays logs filtered using the Grafana dashboard variables, such as the drop-downs. Once you amend the variables, wait for Grafana to generate a new URL.

Note

Due to the known issue, the View logs in OpenSearch Dashboards link does not work in Container Cloud 2.26.0 (Cluster releases 17.1.0 and 16.1.0). The issue is addressed in Container Cloud 2.26.1 (Cluster releases 17.1.1 and 16.1.1).

Caution

The Grafana dashboards that contain drop-down lists are limited to 1000 lines. Therefore, if you require data on a specific item, use the filter by name instead.

Note

Grafana dashboards that present node data have an additional Node identifier drop-down menu. By default, it is set to machine to display short names for Kubernetes nodes. To display Kubernetes node name labels, change this option to node.

To view the Grafana dashboards:

From the drop-down list, select the required dashboard to inspect the status and statistics of the corresponding service in your management or MOSK cluster:

Component	Dashboard	Description
Ceph cluster	Ceph Cluster	Provides the overall health status of the Ceph cluster, capacity, latency, and recovery metrics.
	Ceph Nodes	Provides an overview of the host-related metrics, such as the number of Ceph Monitors, Ceph OSD hosts, average usage of resources across the cluster, network and hosts load. This dashboard is deprecated since Container Cloud 2.25.0 (Cluster releases 17.0.0 and 16.0.0) and is removed in Container Cloud 2.26.0 (Cluster releases 17.1.0 and 16.1.0). Therefore, Mirantis recommends switching to the following dashboards in the current release: For Ceph stats, use the Ceph Cluster dashboard. For resource utilization, use the System dashboard, which includes filtering by Ceph node labels, such as `ceph_role_osd`, `ceph_role_mon`, and `ceph_role_mgr`.
	Ceph OSDs	Provides metrics for Ceph OSDs, including the Ceph OSD read and write latencies, distribution of PGs per Ceph OSD, Ceph OSDs and physical device performance.
	Ceph Pools	Provides metrics for Ceph pools, including the client IOPS and throughput by pool and pools capacity usage.
Ironic	Ironic BM	Provides graphs on Ironic health, HTTP API availability, provisioned nodes by state and installed `ironic-conductor` backend drivers.
Container Cloud	Clusters Overview	Represents the main cluster capacity statistics for all clusters of a Container Cloud deployment where StackLight is installed. Note Due to the known issue, the Prometheus Targets Unavailable panel of the Clusters Overview dashboard does not display data for managed clusters of the 11.7.0, 11.7.4, 12.5.0, and 12.7.x series Cluster releases after update to Container Cloud 2.24.0.
	Etcd	Available since Container Cloud 2.21.0 (Cluster release 11.5.0). Provides graphs on database size, leader elections, requests duration, incoming and outgoing traffic.
	MCC Applications Performance	Available since Container Cloud 2.23.0 (Cluster release 11.7.0). Provides information on the Container Cloud internals work based on Golang, controller runtime, and custom metrics. You can use it to verify performance of applications and for troubleshooting purposes.
Kubernetes resources	Kubernetes Calico	Provides metrics of the entire Calico cluster usage, including the cluster status, host status, and Felix resources.
	Kubernetes Cluster	Provides metrics for the entire Kubernetes cluster, including the cluster status, host status, and resources consumption.
	Kubernetes Containers	Provides charts showing resource consumption per deployed Pod containers running on Kubernetes nodes.
	Kubernetes Deployments	Provides information on the desired and current state of all service replicas deployed on a Container Cloud cluster.
	Kubernetes Namespaces	Provides the Pods state summary and the CPU, MEM, network, and IOPS resources consumption per name space.
	Kubernetes Nodes	Provides charts showing resources consumption per Container Cloud cluster node.
	Kubernetes Pods	Provides charts showing resources consumption per deployed Pod.
OpenStack	OpenStack - Overview	Provides general information on OpenStack services resources consumption, API errors, deployed OpenStack compute nodes and block storage usage.
	OpenStack Ingress controller	Available since MOSK 23.3. Monitors the number of requests, response times and statuses, as well as the number of Ingress SSL certificates including expiration time and resources usage.
	OpenStack Instances Availability	Available since MOSK 23.2. Provides information about the availability of instance floating IPs per OpenStack compute node and project. Also, enables monitoring of probe statistics for individual instance floating IPs.
	OpenStack Network IP Capacity	Available since MOSK 25.1. Provides information about the statistics of IP address allocation for external networks and subnets on non-Tungsten Fabric based MOSK clusters. For configuration details, see Start monitoring IP address capacity.
	OpenStack PortProber	Available since MOSK 24.2. Provides information about the availability of Neutron ports per OpenStack compute node, project, and port owner.
	OpenStack PortProber [Deprecated]	Available since MOSK 25.1. Provides information about the availability of Neutron ports per OpenStack compute node, project, and port owner. Deprecated in favor of the OpenStack PortProber dashboard. Use this deprecated dashboard only to access old data collected before MOSK 25.1.
	OpenStack PowerDNS	Available since MOSK 24.3. Provides different stats about OpenStack PowerDNS servers such as connections, resources, queries, rings, errors, and other.
	OpenStack Usage Efficiency	Available since MOSK 23.3. Provides information about requested (allocated) CPU and memory usage efficiency on a per-project and per-flavor basis. Aims to identify flavors that specific projects are not effectively using, with allocations significantly exceeding actual usage. Also, evaluates per-instance underuse for specific projects.
	KPI - Provisioning	Provides provisioning statistics for OpenStack compute instances, including graphs on VM creation results by day.
	Cinder	Provides graphs on the OpenStack Block Storage service health, HTTP API availability, pool capacity and utilization, number of created volumes and snapshots.
	Glance	Provides graphs on the OpenStack Image service health, HTTP API availability, number of created images and snapshots.
	Gnocchi	Provides panels and graphs on the Gnocchi health and HTTP API availability.
	Heat	Provides graphs on the OpenStack Orchestration service health, HTTP API availability and usage.
	Ironic OpenStack	Provides graphs on the OpenStack Bare Metal Provisioning service health, HTTP API availability, provisioned nodes by state and installed ironic-conductor backend drivers.
	Keystone	Provides graphs on the OpenStack Identity service health, HTTP API availability, number of tenants and users by state.
	Neutron	Provides graphs on the OpenStack networking service health, HTTP API availability, agents status and usage of Neutron L2 and L3 resources.
	NGINX Ingress controller	Not recommended. Deprecated since MOSK 23.3 and is removed in MOSK 24.1. Use OpenStack Ingress controller instead. Monitors the number of requests, response times and statuses, as well as the number of Ingress SSL certificates including expiration time and resources usage.
	Nova - Availability Zones	Provides detailed graphs on the OpenStack availability zones and hypervisor usage.
	Nova - Hypervisor Overview	Provides a set of single-stat panels presenting resources usage by host.
	Nova - Instances	Provides graphs on libvirt Prometheus exporter health and resources usage. Monitors the number of running instances and tasks and allows sorting the metrics by top instances.
	Nova - Overview	Provides graphs on the OpenStack compute services (`nova-scheduler`, `nova-conductor`, and `nova-compute`) health, as well as HTTP API availability.
	Nova - Tenants	Provides graphs on CPU, RAM, disk throughput, IOPS, and space usage and allocation and allows sorting the metrics by top tenants.
	Nova - Users	Provides graphs on CPU, RAM, disk throughput, IOPS, and space usage and allocation and allows sorting the metrics by top users.
	Nova - Utilization	Provides detailed graphs on Nova hypervisor resources capacity and consumption.
	Memcached	Memcached Prometheus exporter dashboard. Monitors Kubernetes Memcached pods and displays memory usage, hit rate, evicts and reclaims rate, items in cache, network statistics, and commands rate.
	MySQL	MySQL Prometheus exporter dashboard. Monitors Kubernetes MySQL pods, resources usage and provides details on current connections and database performance.
	RabbitMQ [Deprecated]	Not recommended. Deprecated since MOSK 25.1. RabbitMQ Prometheus exporter dashboard. Monitors Kubernetes RabbitMQ pods, resources usage and provides details on cluster utilization and performance. Caution This dashboard is renamed from RabbitMQ to RabbitMQ [Deprecated] in MOSK 25.1 and will be removed in one of the following releases for the sake of the RabbitMQ Overview and RabbitMQ Erlang dashboards. For deprecation details, see Deprecation notes: RabbitMQ Prometheus Exporter.
	RabbitMQ Erlang	Available since MOSK 25.1. Monitors RabbitMQ BEAM performance, memory details, load and distribution metrics using native Prometheus plugin metrics.
	RabbitMQ Overview	Available since MOSK 25.1. Monitors RabbitMQ node performance, resource usage, message queue, channel, and connection statistics using native Prometheus plugin metrics.
	Cassandra	Provides graphs on Cassandra clusters’ health, ongoing operations, and resource consumption.
	Kafka	Provides graphs on Kafka clusters’ and broker health, as well as broker and topic usage.
	Redis	Provides graphs on Redis clusters’ and pods’ health, connections, command calls, and resource consumption.
Tungsten Fabric	Tungsten Fabric Controller	Provides graphs on the overall Tungsten Fabric Controller cluster processes and usage.
	Tungsten Fabric vRouter	Provides graphs on the overall Tungsten Fabric vRouter cluster processes and usage.
	ZooKeeper	Provides graphs on ZooKeeper clusters’ quorum health and resource consumption.
StackLight	Alertmanager	Provides performance metrics on the overall health status of the Prometheus Alertmanager service, the number of firing and resolved alerts received for various periods, the rate of successful and failed notifications, and the resources consumption.
	OpenSearch	Provides information about the overall health status of the OpenSearch cluster, including the resources consumption, number of operations and their performance.
	OpenSearch Indices	Provides detailed information about the state of indices, including their size, the number and the size of segments.
	Grafana	Provides performance metrics for the Grafana service, including the total number of Grafana entities, CPU and memory consumption.
	PostgreSQL	Provides PostgreSQL statistics, including read (DQL) and write (DML) row operations, transaction and lock, replication lag and conflict, and checkpoint statistics, as well as PostgreSQL performance metrics.
	Prometheus	Provides the availability and performance behavior of the Prometheus servers, the sample ingestion rate, and system usage statistics per server. Also, provides statistics about the overall status and uptime of the Prometheus service, the chunks number of the local storage memory, target scrapes, and queries duration.
	Prometheus Relay	Provides service status and resources consumption metrics.
	Telemeter Server	Provides statistics and the overall health status of the Telemeter service. Note Due to the known issue, the Telemeter Client Status panel of the Telemeter Server dashboard does not display data for managed clusters of the 11.7.0, 11.7.4, 12.5.0, and 12.7.x series Cluster releases after update to Container Cloud 2.24.0.
System	System	Provides a detailed resource consumption and operating system information per Container Cloud cluster node.
Mirantis Kubernetes Engine (MKE)	MKE Cluster	Provides a global overview of an MKE cluster: statistics about the number of the worker and manager nodes, containers, images, Swarm services.
	MKE Containers	Provides per container resources consumption metrics for the MKE containers such as CPU, RAM, network.