-
公开(公告)号:US10761893B1
公开(公告)日:2020-09-01
申请号:US16199014
申请日:2018-11-23
Applicant: Amazon Technologies, Inc.
Inventor: Vivek Bhadauria , Praveenkumar Udayakumar , Jonathan Andrew Hedley , Vasant Manohar , Andrea Olgiati , Rakesh Madhavan Nambiar , Gowtham Jeyabalan , Shubham Chandra Gupta , Palak Mehta
Abstract: Techniques are described for automatically scaling (or “auto scaling”) compute resources—for example, virtual machine (VM) instances, containers, or standalone servers—used to support execution of service-oriented software applications and other types of applications that may process heterogeneous workloads. The resource requirements for a software application can be approximated by measuring “worker pool” utilization of instances of each service, where a worker pool represents a number of requests that the service can process concurrently. A scaling service can thus be configured to scale the compute instances provisioned for a service in proportion to worker pool utilization, that is, compute instances can be added as the fleet's worker pools become more “busy,” while compute instances can be removed when worker pools become inactive.