Heka

Heka

This section describes the alerts for the Heka service.

HekaOutputQueueStalled

Severity Warning
Summary The {{ $labels.queue }} queue is stalled on node {{$labels.host }} for more than 1 hour. The corresponding Heka service is either down or stuck.
Raise condition heka_output_queue_size > 134217728
Description Raises when Heka freezes and the output queue is larger than 134217728 (128 MB). The host label in the raised alert contains the name of the affected node.
Troubleshooting Restart the corresponding Heka log collector using service log_collector restart.
Tuning Not required