I have been working with GKE a lot for a while, I created the standard cluster and deployed my image and some dependencies as well. However, it was running fine for sometime and all of a sudden started throwing disk pressure error.
Any thoughts on this? Additionally, I saw image pull back off error.
First check in the logging if you can see the error message (combined from similar events): Failed to garbage collect the required amount of images. Attempted to free 9041880678 bytes, but only found 0 bytes eligible to free. if so, then this message comes from here,indicating that the Kubelet failed to remove the unused images. This could happen if any one of the following conditions is met:
The image must not currently be in use by an existing container running on the node
The image must not have been recently pulled (to avoid thrashing), etc.
Only images that are "eligible" to be removed are actually garbage collected and end up being deleted.
As mentioned in this document, the Node condition 'DiskPressure' implies that available disk space and inodes on either the node's root filesystem or image filesystem have met an eviction threshold.
You may follow the suggestion mentioned here.