I'm facing scaling issues in gke autopilot cluster. Getting error:
node scale up failed:pod is at risk of not being scheduled
I am facing this issue only with autopilot cluster
Try to do basic troubleshooting steps but issue not at sort out
As per this official doc, the mentioned issue occurs when serial port logging is disabled in your Google Cloud project. GKE Autopilot clusters require serial port logging to effectively debug node issues. If serial port logging is disabled, Autopilot can't provision nodes to run your workloads.
Serial port logging might be disabled at the organization level through an organization policy that enforces the compute.disableSerialPortLogging constraint. Serial port logging could also be disabled at the project or virtual machine (VM) instance level.
To resolve this issue do the following:
- Ask your Google Cloud organization policy administrator to remove the compute.disableSerialPortLogging constraint in the project with your Autopilot cluster.
- If you don't have an organization policy that enforces this constraint, try to enable serial port logging in your project metadata. This action requires the compute.projects.setCommonInstanceMetadata IAM permission.
Refer to this doc for more information about Troubleshooting Autopilot clusters