Table of Contents

runbooks:coustom_alerts:HostUnusualDiskReadRate

HostUnusualDiskReadRate

Meaning

This alert is triggered when a host node experiences high disk read activity, with IO wait greater than 80% over a 5-minute window. It indicates that the disk may be a bottleneck or under heavy load.

Impact

High disk read rates can lead to:

This alert is warning, as prolonged high IO can degrade performance or trigger other alerts.

Diagnosis

Check disk IO statistics:

iostat -x 1 5
iotop -o

Check system-wide IO wait:

top
vmstat 1 5

Check disk usage and filesystem health:

df -h
lsblk
smartctl -a /dev/sdX

Check pods consuming disk on the node:

kubectl top pod --all-namespaces --field-selector spec.nodeName={{ $labels.instance }}

Possible Causes

Mitigation

  1. Identify and reduce disk-intensive workloads
  2. Move high IO workloads to other nodes or storage
  3. Monitor disk health and replace failing disks
  4. Tune filesystem or storage configuration if needed
  5. Scale out storage for critical workloads

Escalation