-
公开(公告)号:US11900169B1
公开(公告)日:2024-02-13
申请号:US17230784
申请日:2021-04-14
Applicant: Amazon Technologies, Inc.
Inventor: Anand Dhandhania , Thomas Loockx
IPC: G06F9/46 , G06F9/50 , G06F16/901 , G06N20/00
CPC classification number: G06F9/505 , G06F16/9024 , G06N20/00
Abstract: Descriptors of machine learning tasks to be used to respond to analysis requests, indicating acceptable categories of runtime environments for the tasks and metrics to be collected from the tasks, are received via programmatic interfaces. In response to an analysis request, an orchestrator receives results from individual tasks as they become available, provides the results to other tasks, and causes a response to the request to be prepared using results from at least a subset of the tasks. Metrics collected from the tasks, and a visual representation of the tasks indicating their runtime environments are presented.
-
公开(公告)号:US11366660B1
公开(公告)日:2022-06-21
申请号:US16447882
申请日:2019-06-20
Applicant: Amazon Technologies, Inc.
Inventor: Anand Dhandhania
Abstract: An API latency estimation system estimates latencies as a function of subcomponent parameters. The system may obtain first information indicative of at least a characteristic of data of a request provided to an API and second information indicative of at least a utilization of a first subcomponent of the API used to fulfill a subtask of a task of the request. An estimated latency for the first subcomponent to fulfill the subtask is determined at least in part by applying a latency estimation model for the API to at least the first information and the second information. If a comparison of the estimated latency to a measured latency for the first subcomponent to perform the subtask indicates a potential anomaly, then an indication of the potential anomaly may be outputted. The model may be updated with API request fulfillment data that is not anomalous.
-
公开(公告)号:US12197958B2
公开(公告)日:2025-01-14
申请号:US18408405
申请日:2024-01-09
Applicant: Amazon Technologies, Inc.
Inventor: Anand Dhandhania , Thomas Loockx
IPC: G06F9/50 , G06F16/901 , G06N20/00
Abstract: Descriptors of machine learning tasks to be used to respond to analysis requests, indicating acceptable categories of runtime environments for the tasks and metrics to be collected from the tasks, are received via programmatic interfaces. In response to an analysis request, an orchestrator receives results from individual tasks as they become available, provides the results to other tasks, and causes a response to the request to be prepared using results from at least a subset of the tasks. Metrics collected from the tasks, and a visual representation of the tasks indicating their runtime environments are presented.
-
公开(公告)号:US11610143B1
公开(公告)日:2023-03-21
申请号:US16915744
申请日:2020-06-29
Applicant: Amazon Technologies, Inc.
Inventor: Vivek Bhadauria , Vasant Manohar , Anand Dhandhania
Abstract: A network-based service may provide a machine learning model for different clients. The network-based service may implement an interface that allows a client to identify a test data set for validating versions of the machine learning model specifically for the client. When a new version of the machine learning model is created, a validation test using the test data set identified by the client may be used. Results of the validation test may be used to make a decision regard whether to migrate workloads for the client to the new version of the machine learning model.
-
公开(公告)号:US20240152402A1
公开(公告)日:2024-05-09
申请号:US18408405
申请日:2024-01-09
Applicant: Amazon Technologies, Inc.
Inventor: Anand Dhandhania , Thomas Loockx
IPC: G06F9/50 , G06F16/901 , G06N20/00
CPC classification number: G06F9/505 , G06F16/9024 , G06N20/00
Abstract: Descriptors of machine learning tasks to be used to respond to analysis requests, indicating acceptable categories of runtime environments for the tasks and metrics to be collected from the tasks, are received via programmatic interfaces. In response to an analysis request, an orchestrator receives results from individual tasks as they become available, provides the results to other tasks, and causes a response to the request to be prepared using results from at least a subset of the tasks. Metrics collected from the tasks, and a visual representation of the tasks indicating their runtime environments are presented.
-
公开(公告)号:US10949661B2
公开(公告)日:2021-03-16
申请号:US16198040
申请日:2018-11-21
Applicant: Amazon Technologies, Inc.
Inventor: Rahul Bhotika , Shai Mazor , Amit Adam , Wendy Tse , Andrea Olgiati , Bhavesh Doshi , Gururaj Kosuru , Patrick Ian Wilson , Umar Farooq , Anand Dhandhania
IPC: G06K9/00
Abstract: Techniques for layout-agnostic complex document processing are described. A document processing service can analyze documents that do not adhere to defined layout rules in an automated manner to determine the content and meaning of a variety of types of segments within the documents. The service may chunk a document into multiple chunks, and operate upon the chunks in parallel by identifying segments within each chunk, classifying the segments into segment types, and processing the segments using special-purpose analysis engines adapted for the analysis of particular segment types to generate results that can be aggregated into an overall output for the entire document that captures the meaning and context of the document text.
-
7.
公开(公告)号:US11373119B1
公开(公告)日:2022-06-28
申请号:US16369884
申请日:2019-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Bhavesh A. Doshi , Anand Dhandhania
Abstract: Techniques for a framework for building, orchestrating, and deploying complex, large-scale Machine Learning (ML) or deep learning (DL) inference applications is described. A ML application orchestration service is disclosed that enables the construction, orchestration, and deployment of complex ML inference applications in a provider network. The disclosed service provides customers with the ability to define machine learning (ML) models and define transformation operations on data before and/or after being provided to the ML models to construct a complex ML inference application. The service provides a framework for the orchestration (co-ordination) of the workflow logic (e.g., of the request and/or response flows) involved in building and deploying a complex ML inference application in the provider network.
-
-
-
-
-
-