-
公开(公告)号:US11669365B1
公开(公告)日:2023-06-06
申请号:US16551555
申请日:2019-08-26
Applicant: Amazon Technologies, Inc.
Inventor: Anirudh Balachandra Aithal , Onur Filiz
CPC classification number: G06F9/4881 , G06F9/5027 , G06F11/1448 , G06F2201/84
Abstract: A task pool may be created and maintained, in association with a container management service, which stores a snapshot of a one or more tasks, the performance of which is suspended or paused. Tasks may be defined as applications executable in one or more software containers instantiated on a virtual machine or compute instance, where the snapshot of a task may include an image of the one or more containers associated with the task container and state information of the one or more containers. A task may be initially provisioned in a paused state or paused after executing at least a portion of the task. Paused tasks in the task pool may be readily restarted or resumed by loading the snapshot onto a compute instance. The task pool may manage provisioning new tasks and pause tasks based on changes in demand.
-
2.
公开(公告)号:US11422844B1
公开(公告)日:2022-08-23
申请号:US16698655
申请日:2019-11-27
Applicant: Amazon Technologies, Inc.
Inventor: Onur Filiz , Micah William Hausler , Archana Srikanta , Sri Saran Balaji Vellore Rajakumar , Eswar Chander Balasubramanian , Anirudh Balachandra Aithal , Ryan Marchand
Abstract: The present application relates to providing a mechanism for specifying multiple network interfaces to be configured for on-demand compute capacity and for obtaining the credentials needed to configure the multiple network interfaces. For example, a container orchestration service may specify, in its code execution request to a serverless container management service, that multiple network interfaces are needed for the requested compute capacity. In response, the serverless container management service can identify the multiple network interfaces, determine the credential to be used to configure each of the multiple network interfaces, and attach the multiple network interfaces to the compute capacity acquired on behalf of the container orchestration service. During execution, the code running inside the compute capacity can utilize the multiple network interfaces as specified by the owner of the code.
-
公开(公告)号:US11989586B1
公开(公告)日:2024-05-21
申请号:US17305143
申请日:2021-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Archana Srikanta , Onur Filiz , Prashant Prahlad , Amit Gupta , Song Hu
CPC classification number: G06F9/5005 , G06F8/60 , G06F9/455 , G06F9/45558 , G06F9/48 , G06F9/4806 , G06F9/4843 , G06F9/485 , G06F9/4881 , G06F9/50 , G06F9/5011 , G06F9/5022 , G06F9/5027 , G06F9/505 , G06F9/5061 , G06F9/5072 , G06F9/5077 , G06F9/5083 , H04L67/02 , H04L67/30
Abstract: The present application relates to performing a scale-up of the computing resources allocated to executing a software application. For example, the software application for implementing a web server may be packaged as a container image, and one or more instances of the container images may be executed as one or more tasks. The individual tasks may be allocated a set of computing resources such as CPU and memory, and the incoming requests sent to the web server may be distributed across the tasks. If the volume of incoming requests drops below a threshold level, one or more of the tasks may be placed in standby mode, and the amount of computing resources allocated to such tasks may be reduced. When the volume of incoming requests returns above the threshold level, the amount of computing resources allocated to such tasks can be scaled back up to the full amount.
-
公开(公告)号:US11995466B1
公开(公告)日:2024-05-28
申请号:US17305145
申请日:2021-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Archana Srikanta , Onur Filiz , Prashant Prahlad , Amit Gupta , Song Hu
CPC classification number: G06F9/5005 , G06F9/455 , G06F9/45558 , G06F9/48 , G06F9/4806 , G06F9/4843 , G06F9/485 , G06F9/4881 , G06F9/50 , G06F9/5011 , G06F9/5022 , G06F9/5027 , G06F9/505 , G06F9/5061 , G06F9/5072 , G06F9/5077 , G06F9/5083 , H04L67/63 , G06F8/60
Abstract: The present application relates to performing a scale-down of the computing resources allocated to executing a software application. For example, the software application for implementing a web server may be packaged as a container image, and one or more instances of the container images may be executed as one or more tasks. The individual tasks may be allocated a set of computing resources such as CPU and memory, and the incoming requests sent to the web server may be distributed across the tasks. If the volume of incoming requests drops below a threshold level, one or more of the tasks may be placed in standby mode, and the amount of computing resources allocated to such tasks may be reduced. When the volume of incoming requests returns above the threshold level, the amount of computing resources allocated to such tasks can be scaled back up to the full amount.
-
-
-