LinkedIn respects your privacy

LinkedIn and 3rd parties use essential and non-essential cookies to provide, secure, analyze and improve our Services, and to show you relevant ads (including professional and job ads) on and off LinkedIn. Learn more in our Cookie Policy.

Select Accept to consent or Reject to decline non-essential cookies for this use. You can update your choices at any time in your settings.

Join now Sign in

From the course: Google Cloud Associate Cloud Engineer Cert Prep

Unlock this course with a free trial

Join today to access over 24,900 courses taught by industry experts.

Setting scaling parameters for autoscaling instances

Setting scaling parameters for autoscaling instances - Google Cloud Tutorial

From the course: Google Cloud Associate Cloud Engineer Cert Prep

Start my 1-month free trial Buy for my team

Setting scaling parameters for autoscaling instances

“

- Setting scaling parameters for autoscaling instances. So, to deploy an application with autoscaling in Cloud Run, it is super simple. In a Cloud Run console, when you are creating an application, you can determine in the autoscaling part of the menu here on the bottom right corner, you can specify the minimum number of instances and the maximum number of instances. This will be number of containers effectively, and you can choose to have CPU only allocated during request processing, which will probably be cheaper, but it may create the problem of a cold start. Or you can choose to have CPU always allocated. The number of instances is affected by, first of all, the rate of incoming requests or events, the CPU utilization of existing instances over a one-minute window with a target of 60% CPU utilization. So that's the threshold or the target that the Google Cloud platform uses to determine when to scale Cloud Run services. And the number of instances is also affected by the current…

Contents