-
公开(公告)号:US11886422B1
公开(公告)日:2024-01-30
申请号:US17216347
申请日:2021-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Bohou Li , Vijayan Prabhakaran , Mehul A. Shah , Benjamin Sowell , Douglas Brian Terry
IPC: G06F16/23 , G06F16/245 , G06F16/21 , G06F16/22
CPC classification number: G06F16/2379 , G06F16/219 , G06F16/2282 , G06F16/2343 , G06F16/245
Abstract: A protocol for implementing ACID transactions that provides snapshot isolation in a distributed setting that does not require synchronized clocks is described. The protocol ensures at commit time that transactions touching common objects do not commit out of order. The protocol can be used in the context of a distributed data lake built on an object store in which clients can transactionally add or remove objects from logical tables.
-
公开(公告)号:US20230362265A1
公开(公告)日:2023-11-09
申请号:US18310461
申请日:2023-05-01
Applicant: Amazon Technologies, Inc.
Inventor: George Steven McPherson , Mehul A. Shah , Supratik Chakraborty , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta
IPC: H04L67/01 , H04L67/10 , G06F9/50 , G06F9/4401 , H04L67/00
CPC classification number: H04L67/34 , G06F9/4411 , G06F9/5005 , G06F9/5072 , H04L67/01 , H04L67/10
Abstract: Code may be dynamically routed to computing resources for execution. Code may be received for execution on behalf of a client. Execution criteria for the code may be determined and computing resources that satisfy the execution criteria may be identified. The identified computing resources may then be procured for executing the code and then the code may be routed to the procured computing resources for execution. Permissions or authorization to execute the code may be shared to ensure that computing resources executing the code have the same permissions or authorization when executing the code.
-
公开(公告)号:US11036560B1
公开(公告)日:2021-06-15
申请号:US15385781
申请日:2016-12-20
Applicant: Amazon Technologies, Inc.
Inventor: George Steven Mcpherson , Mehul A. Shah , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta , Sandhya Edupuganti
Abstract: Isolation types may be determined for resources that execute portions of code. Code may be received via a network-based interface from a client for execution. An execution plan for the code may be generated and evaluated to determine one or more isolation types for computing resources that execute the code. The computing resources that are configured to provide the determined isolation types may then be identified and execution of the code initiated at the identified computing resources.
-
公开(公告)号:US20200242135A1
公开(公告)日:2020-07-30
申请号:US16846141
申请日:2020-04-10
Applicant: Amazon Technologies, Inc.
Inventor: Mehul A. Shah , George Steven McPherson , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta
Abstract: Recognizing unknown data objects may be implemented for data objects stored in a data store. Data objects that are identified as unknown may be accessed to retrieve a portion of the data object. Different representations of the data object may be generated for recognizing different data schemas. An analysis of the representations may be performed to identify a data schema for the unknown data object. The data schema may be stored in a metadata store for the unknown data object.
-
公开(公告)号:US12118395B1
公开(公告)日:2024-10-15
申请号:US17116405
申请日:2020-12-09
Applicant: Amazon Technologies, Inc.
Inventor: Anthony A. Virtuoso , Santosh Chandrachood , Bijay Singh Bisht , Mehul A. Shah
CPC classification number: G06F9/5027 , G06F9/4881 , G06F9/5072 , G06F9/5077
Abstract: Techniques for self-tuning an analytics system via observed execution optimization are described. Upon a need for execution resources, a resource manager can select a type of executor from multiple candidate executor types based at least in part on one or more of current execution data associated with the execution of tasks of a user application and/or historic execution data associated with one or more other applications. The current execution data may include event log data originated by the driver application based on the execution of the user application and/or metric data describing characteristics of one or more worker nodes involved with executing the user application or characteristics of one or more other executors implemented by the one or more worker nodes in executing the user application.
-
公开(公告)号:US11695840B2
公开(公告)日:2023-07-04
申请号:US17694585
申请日:2022-03-14
Applicant: Amazon Technologies, Inc.
Inventor: George Steven McPherson , Mehul A. Shah , Supratik Chakraborty , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta
IPC: H04L67/00 , H04L67/10 , G06F9/50 , G06F9/4401 , H04L67/01
CPC classification number: H04L67/34 , G06F9/4411 , G06F9/5005 , G06F9/5072 , H04L67/01 , H04L67/10
Abstract: Code may be dynamically routed to computing resources for execution. Code may be received for execution on behalf of a client. Execution criteria for the code may be determined and computing resources that satisfy the execution criteria may be identified. The identified computing resources may then be procured for executing the code and then the code may be routed to the procured computing resources for execution. Permissions or authorization to execute the code may be shared to ensure that computing resources executing the code have the same permissions or authorization when executing the code.
-
公开(公告)号:US11423041B2
公开(公告)日:2022-08-23
申请号:US16752022
申请日:2020-01-24
Applicant: Amazon Technologies, Inc.
Inventor: George Steven McPherson , Mehul A. Shah , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta
IPC: G06F17/00 , G06F16/25 , G06F16/23 , G06F16/2455
Abstract: History for data objects may be maintained to detect data events. An indication of an Extract, Transform, Load (ETL) process applied to one or more source data objects to generate one or more transformed data objects may be received. History for the source data objects may be updated to include the transformed data objects and the ETL process that generated the transformed data objects. An evaluation of the update may be performed to determine whether an event associated with the data lineage is triggered. If the event is triggered, a notification of the event may be sent to one or more subscribers for the event.
-
公开(公告)号:US20220100774A1
公开(公告)日:2022-03-31
申请号:US17491985
申请日:2021-10-01
Applicant: Amazon Technologies, Inc.
Inventor: Mehul A. Shah , George Steven McPherson , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta , Benjamin Albert Sowell , Bohou Li
Abstract: Data transformation workflows may be generated to transform data objects. A source data schema for a data object and a target data format or target data schema for a data object may be identified. A comparison of the source data schema and the target data format or schema may be made to determine what transformations can be performed to transform the data object into the target data format or schema. Code to execute the transformation operations may then be generated. The code may be stored for subsequent modification or execution.
-
公开(公告)号:US10545979B2
公开(公告)日:2020-01-28
申请号:US15385789
申请日:2016-12-20
Applicant: Amazon Technologies, Inc.
Inventor: George Steven McPherson , Mehul A. Shah , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta
IPC: G06F16/25 , G06F16/23 , G06F16/2455
Abstract: History for data objects may be maintained to detect data events. An indication of an Extract, Transform, Load (ETL) process applied to one or more source data objects to generate one or more transformed data objects may be received. History for the source data objects may be updated to include the transformed data objects and the ETL process that generated the transformed data objects. An evaluation of the update may be performed to determine whether an event associated with the data lineage is triggered. If the event is triggered, a notification of the event may be sent to one or more subscribers for the event.
-
10.
公开(公告)号:US11941016B2
公开(公告)日:2024-03-26
申请号:US17687492
申请日:2022-03-04
Applicant: Amazon Technologies, Inc.
Inventor: Timothy Jones , Andrew Borthwick , Sergei Dobroshinsky , Shehzad Qureshi , Stephen Michael Ash , Pedrito Uriah Maynard-Zhang , Chethan Kommaranahalli Rudramuni , Abhishek Sharma , Juliana Saussy , Adam Lawrence Joseph Heinermann , Alaykumar Navinchandra Desai , Mehul A. Shah , Mehul Y. Shah , Anurag Windlass Gupta , Prajakta Datta Damle
CPC classification number: G06F16/254 , G06F9/543 , G06N20/00
Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.
-
-
-
-
-
-
-
-
-