IPMI

This section describes the IPMI alerts for hardware monitoring on bare metal hosts.


IPMICollectorsDown

Severity

Warning

Summary

IPMI {{ $labels.collector }} collector is failing.

Description

The {{ $labels.collector }} collector is failing on {{ $value }} target(s) in the {{ $labels.ipmi_namespace }}/{{ $labels.cluster_name }} cluster.

IPMIExporterTargetsDown

Severity

Warning

Summary

IPMI exporter targets are down.

Description

{{ $value }} IPMI exporter target(s) are down in the {{ $labels.ipmi_namespace }}/{{ $labels.cluster_name }} cluster. This may indicate Prometheus cannot scrape the exporter or the exporter cannot probe IPMI targets.

IPMIPowerSupplyCritical

Severity

Critical

Summary

{{ $labels.ipmi_namespace }}/{{ $labels.cluster_name }} {{ $labels.name }} sensor on {{ $labels.ipmi_host }} is in critical state.

Description

The {{ $labels.name }} sensor on the {{ $labels.ipmi_host }} host in the {{ $labels.ipmi_namespace }}/{{ $labels.cluster_name }} cluster is reporting a critical state.

IPMIPowerSupplyWarning

Severity

Warning

Summary

{{ $labels.ipmi_namespace }}/{{ $labels.cluster_name }} {{ $labels.name }} sensor on {{ $labels.ipmi_host }} is in warning state.

Description

The {{ $labels.name }} sensor on the {{ $labels.ipmi_host }} host in the {{ $labels.ipmi_namespace }}/{{ $labels.cluster_name }} cluster is reporting a warning state.