Search code examples
google-cloud-platformgoogle-compute-enginegoogle-cloud-sdk

Starting a GPU instance on Google Cloud Compute


I am trying to set up a GPU instance on Google Compute Cloud like this

gcloud compute instances create another-ubuntu-instance \
 --maintenance-policy TERMINATE --restart-on-failure \
 --image-project=ubuntu-os-cloud \
 --image-family=ubuntu-2004-lts --machine-type=a2-highgpu-1g --zone europe-west4-b

but I get an error message:

ERROR: (gcloud.compute.instances.create) Could not fetch resource:
 - Quota 'NVIDIA_A100_GPUS' exceeded.  Limit: 0.0 in region europe-west4.

even though I have a quota (I think):

enter image description here

So, what am I doing wrong?


Solution

  • You can check your quotas in a particular region for active project using this command:

    gcloud compute regions describe europe-west4 | grep -1 A100
    

    To read more about GPU quotas please see here