Using the Grafana web UI, you can view the visual representation of the metric graphs based on the time series databases.
To view the Grafana dashboards:
Log in to the Grafana web UI as described in Access StackLight web UIs.
From the drop-down list, select the required dashboard to inspect the status and statistics of the corresponding service in your management or managed cluster:
Component |
Dashboard |
Description |
---|---|---|
Ceph cluster |
Ceph Cluster |
Provides the overall health status of the Ceph cluster, capacity, latency, and recovery metrics. |
Ceph Nodes |
Provides an overview of the host-related metrics, such as the number of Ceph Monitors, Ceph OSD hosts, average usage of resources across the cluster, network and hosts load. |
|
Ceph OSD |
Provides metrics for Ceph OSDs, including the Ceph OSD read and write latencies, distribution of PGs per Ceph OSD, Ceph OSDs and physical device performance. |
|
Ceph Pools |
Provides metrics for Ceph pools, including the client IOPS and throughput by pool and pools capacity usage. |
|
Ironic bare metal |
Ironic BM |
Provides graphs on Ironic health, HTTP API availability, provisioned
nodes by state and installed |
Container Cloud clusters |
Clusters Overview |
Represents the main cluster capacity statistics for all clusters of a Mirantis Container Cloud deployment where StackLight is installed. |
Kubernetes resources |
Kubernetes Calico |
Provides metrics of the entire Calico cluster usage, including the cluster status, host status, and Felix resources. |
Kubernetes Cluster |
Provides metrics for the entire Kubernetes cluster, including the cluster status, host status, and resources consumption. |
|
Kubernetes Deployments |
Provides information on the desired and current state of all service replicas deployed on a Container Cloud cluster. |
|
Kubernetes Namespaces |
Provides the pods state summary and the CPU, MEM, network, and IOPS resources consumption per name space. |
|
Kubernetes Nodes |
Provides charts showing resources consumption per Container Cloud cluster node. |
|
Kubernetes Pods |
Provides charts showing resources consumption per deployed pod. |
|
NGINX |
NGINX |
Provides the overall status of the NGINX cluster and information about NGINX requests and connections. |
StackLight |
Alertmanager |
Provides performance metrics on the overall health status of the Prometheus Alertmanager service, the number of firing and resolved alerts received for various periods, the rate of successful and failed notifications, and the resources consumption. |
Elasticsearch |
Provides information about the overall health status of the Elasticsearch cluster, including the resources consumption and the state of the shards. |
|
Grafana |
Provides performance metrics for the Grafana service, including the total number of Grafana entities, CPU and memory consumption. |
|
PostgreSQL |
Provides PostgreSQL statistics, including read (DQL) and write (DML) row operations, transaction and lock, replication lag and conflict, and checkpoint statistics, as well as PostgreSQL performance metrics. |
|
Prometheus |
Provides the availability and performance behavior of the Prometheus servers, the sample ingestion rate, and system usage statistics per server. Also, provides statistics about the overall status and uptime of the Prometheus service, the chunks number of the local storage memory, target scrapes, and queries duration. |
|
Pushgateway |
Provides performance metrics and the overall health status of the service, the rate of samples received for various periods, and the resources consumption. |
|
Prometheus Relay |
Provides service status and resources consumption metrics. |
|
Telemeter Server |
Provides statistics and the overall health status of the Telemeter service. |
|
System |
System |
Provides a detailed resource consumption and operating system information per Container Cloud cluster node. |
Mirantis Kubernetes Engine (MKE) |
MKE Cluster |
Provides a global overview of an MKE cluster: statistics about the number of the worker and manager nodes, containers, images, Swarm services. |
MKE Containers |
Provides per container resources consumption metrics for the MKE containers such as CPU, RAM, network. |