Search code examples
google-cloud-platforminstancegoogle-cloud-runstress-testing

Min Instance Google Cloud Run


What if I set my minimum instance to 1, 4, 10. What is the difference between minimum instances 1, 4, and 10? I'm curious about this one, because when I stress testing it. I didn't see any difference from this?

Is it only effect only in first-time HTTP Request?


Solution

  • This seems to be good explained in documentation:

    By default, Cloud Run scales in to the number of instances based on the number of incoming requests. However, if your service requires reduced latency and you want to limit the number of cold starts, you can change this default behavior by specifying a minimum number of container instances to be kept warm and ready to serve requests.

    So, this is important to avoid cold starts when new instance is created. The maximum performance tested in stress testing should be on the same level.