-
公开(公告)号:US11995466B1
公开(公告)日:2024-05-28
申请号:US17305145
申请日:2021-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Archana Srikanta , Onur Filiz , Prashant Prahlad , Amit Gupta , Song Hu
CPC classification number: G06F9/5005 , G06F9/455 , G06F9/45558 , G06F9/48 , G06F9/4806 , G06F9/4843 , G06F9/485 , G06F9/4881 , G06F9/50 , G06F9/5011 , G06F9/5022 , G06F9/5027 , G06F9/505 , G06F9/5061 , G06F9/5072 , G06F9/5077 , G06F9/5083 , H04L67/63 , G06F8/60
Abstract: The present application relates to performing a scale-down of the computing resources allocated to executing a software application. For example, the software application for implementing a web server may be packaged as a container image, and one or more instances of the container images may be executed as one or more tasks. The individual tasks may be allocated a set of computing resources such as CPU and memory, and the incoming requests sent to the web server may be distributed across the tasks. If the volume of incoming requests drops below a threshold level, one or more of the tasks may be placed in standby mode, and the amount of computing resources allocated to such tasks may be reduced. When the volume of incoming requests returns above the threshold level, the amount of computing resources allocated to such tasks can be scaled back up to the full amount.
-
公开(公告)号:US11989586B1
公开(公告)日:2024-05-21
申请号:US17305143
申请日:2021-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Archana Srikanta , Onur Filiz , Prashant Prahlad , Amit Gupta , Song Hu
CPC classification number: G06F9/5005 , G06F8/60 , G06F9/455 , G06F9/45558 , G06F9/48 , G06F9/4806 , G06F9/4843 , G06F9/485 , G06F9/4881 , G06F9/50 , G06F9/5011 , G06F9/5022 , G06F9/5027 , G06F9/505 , G06F9/5061 , G06F9/5072 , G06F9/5077 , G06F9/5083 , H04L67/02 , H04L67/30
Abstract: The present application relates to performing a scale-up of the computing resources allocated to executing a software application. For example, the software application for implementing a web server may be packaged as a container image, and one or more instances of the container images may be executed as one or more tasks. The individual tasks may be allocated a set of computing resources such as CPU and memory, and the incoming requests sent to the web server may be distributed across the tasks. If the volume of incoming requests drops below a threshold level, one or more of the tasks may be placed in standby mode, and the amount of computing resources allocated to such tasks may be reduced. When the volume of incoming requests returns above the threshold level, the amount of computing resources allocated to such tasks can be scaled back up to the full amount.
-