runbooks:coustom_alerts:KubernetesNodeMemoryPressure
This alert is triggered when a Kubernetes node reports the MemoryPressure condition for more than 2 minutes. MemoryPressure indicates that the node is running low on available memory and may start evicting pods.
Memory pressure on a node can lead to:
This alert is critical because sustained memory pressure directly affects workload stability.
Check node memory status:
kubectl get nodes kubectl describe node <NODE_NAME>
Check node memory usage:
kubectl top node <NODE_NAME> free -m
List pods consuming high memory:
kubectl top pod --all-namespaces --sort-by=memory
Check recent pod evictions:
kubectl get events --sort-by=.lastTimestamp
If immediate relief is needed, drain the node:
kubectl drain <NODE_NAME> --ignore-daemonsets --delete-emptydir-data
After mitigation and stabilization:
kubectl uncordon <NODE_NAME>