-
1.
公开(公告)号:US20240333658A1
公开(公告)日:2024-10-03
申请号:US18642717
申请日:2024-04-22
Applicant: Amazon Technologies, Inc.
Inventor: Satya Naga Satis Kumar Gunuputi Alluri Venka , John Baker , Shahab Shekari , Kartik Natarajan , Ruhaab Markas , Ganesh Kumar Gella , Santosh Kumar Ameti
IPC: H04L47/762
CPC classification number: H04L47/762
Abstract: Based on analysis of a workload associated with a throttling key of a client request directed to a first service, a scale-out requirement of the throttling key is obtained at respective resource managers of a plurality of other services which are utilized by the first service to respond to client requests. The resource managers initiate, asynchronously with respect to one another, resource provisioning tasks at each of the other services to fulfill the scale-out requirement. A throttling limit associated with the throttling key is updated to a second throttling key after the resource provisioning tasks are completed by the resource managers, and the updated limit is used to determine whether to accept another client request associated with the throttling key.
-
公开(公告)号:US11997021B1
公开(公告)日:2024-05-28
申请号:US18193502
申请日:2023-03-30
Applicant: Amazon Technologies, Inc.
Inventor: Satya Naga Satis Kumar Gunuputi Alluri Venka , John Baker , Shahab Shekari , Kartik Natarajan , Ruhaab Markas , Ganesh Kumar Gella , Santosh Kumar Ameti
IPC: H04L47/762
CPC classification number: H04L47/762
Abstract: Based on analysis of a workload associated with a throttling key of a client request directed to a first service, a scale-out requirement of the throttling key is obtained at respective resource managers of a plurality of other services which are utilized by the first service to respond to client requests. The resource managers initiate, asynchronously with respect to one another, resource provisioning tasks at each of the other services to fulfill the scale-out requirement. A throttling limit associated with the throttling key is updated to a second throttling key after the resource provisioning tasks are completed by the resource managers, and the updated limit is used to determine whether to accept another client request associated with the throttling key.
-
公开(公告)号:US12175966B1
公开(公告)日:2024-12-24
申请号:US17361003
申请日:2021-06-28
Applicant: Amazon Technologies, Inc.
Inventor: Yi-An Lai , Yi Zhang , Roger Scott Jenke , Meghana Puvvadi , Shang-Wen Daniel Li , Peng Zhang , Jason P. Krone , Garima Lalwani , Niranjhana Nayar , Kartik Natarajan
Abstract: Techniques for updating a machine learning model based on user interactions are described. In particular, in some examples, user interactions with a chatbot provide aspects of a data set to be used to train or fine-tune a ML model. In some examples, this is accomplished by collecting data from a first plurality of interactions with a machine learning (ML) model; generating a variant of the ML model using the collected data by: filtering the collected data to create a first data set, training the ML model based on the first data set to generate an adapted ML model, and fine-tuning the adapted ML model on a second data set, different than the first data set to generate the variant of the ML model.
-
-