- 专利标题: Automatically scaling compute resources for heterogeneous workloads
-
申请号: US16199014申请日: 2018-11-23
-
公开(公告)号: US10761893B1公开(公告)日: 2020-09-01
- 发明人: Vivek Bhadauria , Praveenkumar Udayakumar , Jonathan Andrew Hedley , Vasant Manohar , Andrea Olgiati , Rakesh Madhavan Nambiar , Gowtham Jeyabalan , Shubham Chandra Gupta , Palak Mehta
- 申请人: Amazon Technologies, Inc.
- 申请人地址: US WA Seattle
- 专利权人: Amazon Technologies, Inc.
- 当前专利权人: Amazon Technologies, Inc.
- 当前专利权人地址: US WA Seattle
- 代理机构: Nicholson De Vos Webster & Elliott LLP
- 主分类号: G06F9/46
- IPC分类号: G06F9/46 ; G06F9/50
摘要:
Techniques are described for automatically scaling (or “auto scaling”) compute resources—for example, virtual machine (VM) instances, containers, or standalone servers—used to support execution of service-oriented software applications and other types of applications that may process heterogeneous workloads. The resource requirements for a software application can be approximated by measuring “worker pool” utilization of instances of each service, where a worker pool represents a number of requests that the service can process concurrently. A scaling service can thus be configured to scale the compute instances provisioned for a service in proportion to worker pool utilization, that is, compute instances can be added as the fleet's worker pools become more “busy,” while compute instances can be removed when worker pools become inactive.
信息查询