From the course: Google Cloud Associate Cloud Engineer Cert Prep

Unlock this course with a free trial

Join today to access over 24,900 courses taught by industry experts.

Setting scaling parameters for autoscaling instances

Setting scaling parameters for autoscaling instances - Google Cloud Tutorial

From the course: Google Cloud Associate Cloud Engineer Cert Prep

Setting scaling parameters for autoscaling instances

- Setting scaling parameters for autoscaling instances. So, to deploy an application with autoscaling in Cloud Run, it is super simple. In a Cloud Run console, when you are creating an application, you can determine in the autoscaling part of the menu here on the bottom right corner, you can specify the minimum number of instances and the maximum number of instances. This will be number of containers effectively, and you can choose to have CPU only allocated during request processing, which will probably be cheaper, but it may create the problem of a cold start. Or you can choose to have CPU always allocated. The number of instances is affected by, first of all, the rate of incoming requests or events, the CPU utilization of existing instances over a one-minute window with a target of 60% CPU utilization. So that's the threshold or the target that the Google Cloud platform uses to determine when to scale Cloud Run services. And the number of instances is also affected by the current…

Contents