Telegraf

This section lists the alerts for the Telegraf service.


TelegrafDockerSwarmGatherErrors

Severity

Major

Summary

Telegraf Docker Swarm failed to gather metrics.

Description

The Telegraf Docker Swarm Prometheus target contains gathering errors for the last 30 minutes.

TelegrafDockerSwarmTargetDown

Severity

Critical

Summary

Telegraf Docker Swarm Prometheus target is down.

Description

Prometheus fails to scrape metrics from the {{ $labels.pod }} Pod on the {{ $labels.node }} node.

TelegrafSMARTGatherErrors

Severity

Major

Summary

Telegraf SMART failed to gather metrics.

Description

The Telegraf SMART Prometheus target contains gathering errors for the last 10 minutes.

TelegrafSMARTTargetDown

Severity

Major

Summary

Telegraf SMART Prometheus target is down.

Description

Prometheus fails to scrape metrics from the Telegraf SMART endpoint on the {{ $labels.node }} node.

TelegrafSMARTTargetsOutage

Severity

Critical

Summary

Telegraf SMART Prometheus targets outage.

Description

Prometheus fails to scrape metrics from all Telegraf SMART endpoints.