runbooks:coustom_alerts:KubernetesNodeDiskPressure
This alert is triggered when a Kubernetes node reports the DiskPressure condition for more than 2 minutes. DiskPressure indicates that the node is running low on available disk space, and Kubernetes may evict pods to free space.
Disk pressure on a node can cause:
This alert is critical, as sustained disk pressure can affect cluster stability and production workloads.
Check node status:
kubectl get nodes kubectl describe node <NODE_NAME>
Check disk usage:
df -h du -sh /var/lib/kubelet/*
Check pods consuming disk space:
kubectl get pvc --all-namespaces kubectl describe pod <POD_NAME> -n <NAMESPACE>
Check recent events:
kubectl get events --sort-by=.lastTimestamp
Drain node if immediate relief is needed:
kubectl drain <NODE_NAME> --ignore-daemonsets --delete-emptydir-data
After mitigation:
kubectl uncordon <NODE_NAME>