[28352] Improved the Messages panel of the RabbitMQ Grafana dashboard to display absolute values instead of rates.
[28066] Fixed the issue with the Host API Status graph being unavailable in the Cinder Grafana dashboard.
[26450] Fixed the Apache meta for Telegraf to use the parameters
from server.mods.status
instead of apache:server:bind
.
[28123] Fixed the issue with the absent()
function causing
malfunction of the Ceph Grafana dashboards in case if one of the Prometheus
servers had no data for a particular period of time.
[27250] Added support for the containerd log format to fix the issue with the inability to parse the Kubernetes container logs.
[27142] Fixed the discrepancy in RAM usage data between the Horizon web UI and the Nova - utilization dashboard in Grafana.
[26918] Fixed the issue with the false negative http_response_status
metric for the Aodh URL by adding support for the HTTP response code 200
for the Aodh checks in OpenStack version Pike and newer.
[27982] Fixed the issue with the Apache Grafana
dashboard incorrectly displaying a high percentage (thousands of percents) in
the CPU Load panel for the ctl
nodes.
[27474] Removed the non-valuable ContrailFlow*
alerts to prevent the
false positive raising of such alerts.
[27342] Adjusted the NginxServiceDown
alert by adding the
for: 1m
variable to prevent raise of false positive alerts for the
NGINX service being down.
[27298] Fixed the issue with the inability to resolve the
PacketsDroppedByCpuMajor
alert in a time frame of less than 24 hours.
[26842] Updated the monitoring interval in Telegraf to 40 seconds for Ceph Jewel to prevent timeouts in Telegraf while gathering the data.
[24810] Improved regexp
for the HDD metrics to prevent generation of
false positive for HDD errors.
[26116] Added the Fluentd label for Telegraf to fix the issues with processing severity of the Telegraf logs.