Telegraf

This section lists the alerts for the Telegraf service.


TelegrafGatherErrors

Severity

Major

Summary

{{ $labels.job }} failed to gather metrics.

Description

The {{ $labels.job }} Prometheus target has gathering errors for the last 10 minutes.

TelegrafDockerSwarmTargetDown

Severity

Critical

Summary

Telegraf Docker Swarm Prometheus target is down.

Description

Prometheus fails to scrape metrics from the {{ $labels.pod }} Pod on the {{ $labels.node }} node.

TelegrafOpenstackTargetDown

Severity

Critical

Summary

Telegraf OpenStack Prometheus target is down.

Description

Prometheus fails to scrape metrics from the Telegraf OpenStack service.

TelegrafSMARTTargetDown

Severity

Major

Summary

Telegraf SMART Prometheus target is down.

Description

Prometheus fails to scrape metrics from the Telegraf SMART endpoint on the {{ $labels.node }} node.

TelegrafSMARTTargetsOutage

Severity

Critical

Summary

Telegraf SMART Prometheus targets outage.

Description

Prometheus fails to scrape metrics from all Telegraf SMART endpoints.