-
公开(公告)号:US20220279049A1
公开(公告)日:2022-09-01
申请号:US17694585
申请日:2022-03-14
Applicant: Amazon Technologies, Inc.
Inventor: George Steven McPherson , Mehul A. Shah , Supratik Chakraborty , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta
IPC: H04L67/00 , H04L67/10 , G06F9/4401 , G06F9/50 , H04L67/01
Abstract: Code may be dynamically routed to computing resources for execution. Code may be received for execution on behalf of a client. Execution criteria for the code may be determined and computing resources that satisfy the execution criteria may be identified. The identified computing resources may then be procured for executing the code and then the code may be routed to the procured computing resources for execution. Permissions or authorization to execute the code may be shared to ensure that computing resources executing the code have the same permissions or authorization when executing the code.
-
公开(公告)号:US12056516B2
公开(公告)日:2024-08-06
申请号:US17491296
申请日:2021-09-30
Applicant: Amazon Technologies, Inc.
Inventor: Anthony A. Virtuoso , Eric Mills , Bijay Singh Bisht , Santosh Chandrachood , Mehul A. Shah , Wesley Blumenthal , Linchi Zhang , Rahul Pathak , Matt Bachman
CPC classification number: G06F9/45558 , G06F2009/4557 , G06F2009/45575 , G06F2009/45595
Abstract: Techniques for a distributed data processing application service in a cloud provider network are described. A virtual machine bundle is obtained, the virtual machine bundle including a copy of a memory and one or more registers of a paused virtual machine executing a distributed data processing application runtime. A request to launch a virtual machine to execute a distributed data processing application runtime is received, the request to launch the virtual machine including a bundle identifier that identifies the virtual machine bundle. The virtual machine is launched based at least in part on the virtual machine bundle. The virtual machine sends a message to register the virtual machine with a cluster of virtual machines.
-
公开(公告)号:US11709809B1
公开(公告)日:2023-07-25
申请号:US17216359
申请日:2021-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Bohou Li , Vijayan Prabhakaran , Mehul A. Shah , Benjamin Sowell , Douglas Brian Terry
CPC classification number: G06F16/219 , G06F16/2246
Abstract: Techniques for using tree data structures to maintain a transactionally consistent set with support for time-travel queries are described. When a transaction commits, a new version of the tree data structure is created using a copy-on-write based method such that the tree shares internal nodes with previous trees to save space. This approach may be used in the implementation of a transactional data catalog in which the files that make up a table are stored in a transactional set.
-
公开(公告)号:US11704331B2
公开(公告)日:2023-07-18
申请号:US16926537
申请日:2020-07-10
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Edward Caldwell , Anurag Windlass Gupta , Mehul A. Shah , Prajakta Datta Damle , George Steven McPherson
IPC: G06F16/00 , G06F16/25 , G06F16/28 , G06F16/951 , G06F16/23
CPC classification number: G06F16/254 , G06F16/2358 , G06F16/283 , G06F16/951
Abstract: Dynamic generation of data catalogs may be implemented for accessing data sets in different storage locations. Data sets may be accessed in order to extract portions of data. Structure recognition techniques may be applied to the extracted data in order to determine structural information for the data sets. The structural information may then be stored as part of a data catalog for the data sets. Requests to access the data catalog from different clients may be received and the requested structural data supplied so that the clients may access different data sets utilizing the supplied structural data. Data catalogs may be updated as changes to data sets are made.
-
公开(公告)号:US20230169086A1
公开(公告)日:2023-06-01
申请号:US18048645
申请日:2022-10-21
Applicant: Amazon Technologies, Inc.
Inventor: George Steven McPherson , Mehul A. Shah , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta
CPC classification number: G06F16/254 , G06F9/542
Abstract: Extract, Transform, Load (ETL) processing may be initiated by detected events. A trigger event may be associated with an ETL process apply one or more transformations to a source data object. The trigger event may be detected for the ETL process and evaluated with respect to one or more execution conditions for the ETL process. If the execution conditions for the ETL process are satisfied, then the ETL process may be executed. At least some of the source data object may be obtained, the one or more transformations of the ETL process may be applied, and one or more transformed data objects may be stored.
-
公开(公告)号:US11277494B1
公开(公告)日:2022-03-15
申请号:US15385784
申请日:2016-12-20
Applicant: Amazon Technologies, Inc.
Inventor: George Steven McPherson , Mehul A. Shah , Supratik Chakraborty , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta
Abstract: Code may be dynamically routed to computing resources for execution. Code may be received for execution on behalf of a client. Execution criteria for the code may be determined and computing resources that satisfy the execution criteria may be identified. The identified computing resources may then be procured for executing the code and then the code may be routed to the procured computing resources for execution. Permissions or authorization to execute the code may be shared to ensure that computing resources executing the code have the same permissions or authorization when executing the code.
-
17.
公开(公告)号:US11269911B1
公开(公告)日:2022-03-08
申请号:US16199115
申请日:2018-11-23
Applicant: Amazon Technologies, Inc.
Inventor: Timothy Jones , Andrew Borthwick , Sergei Dobroshinsky , Shehzad Qureshi , Stephen Michael Ash , Pedrito Uriah Maynard-Zhang , Chethan Kommaranahalli Rudramuni , Abhishek Sharma , Juliana Saussy , Adam Lawrence Joseph Heinermann , Alaykumar Navinchandra Desai , Mehul A. Shah , Mehul Y. Shah , Anurag Windlass Gupta , Prajakta Datta Damle
Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.
-
公开(公告)号:US11138220B2
公开(公告)日:2021-10-05
申请号:US15385764
申请日:2016-12-20
Applicant: Amazon Technologies, Inc.
Inventor: Mehul A. Shah , George Steven McPherson , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta , Benjamin Albert Sowell , Bohou Li
Abstract: Data transformation workflows may be generated to transform data objects. A source data schema for a data object and a target data format or target data schema for a data object may be identified. A comparison of the source data schema and the target data format or schema may be made to determine what transformations can be performed to transform the data object into the target data format or schema. Code to execute the transformation operations may then be generated. The code may be stored for subsequent modification or execution.
-
公开(公告)号:US10963479B1
公开(公告)日:2021-03-30
申请号:US15385777
申请日:2016-12-20
Applicant: Amazon Technologies, Inc.
Inventor: Mehul A. Shah , George Steven McPherson , Supratik Chakraborty , Anurag Windlass Gupta , Benjamin Albert Sowell
Abstract: Version controlled Extract, Transform, Load (ETL) code may be hosted for developing or executing the ETL job in an ETL system. A version of ETL code may be obtained from version controlled code store and maintained in a data store. Development or execution clients may submit access requests for the version of ETL code which may be serviced from the version stored in the data store. Updates to the version of the ETL code may be eventually committed to the version controlled code store. The latest version of ETL code may also be obtained from the version controlled code store when providing the ETL code in response to a request to retrieve the ETL code.
-
公开(公告)号:US20200159742A1
公开(公告)日:2020-05-21
申请号:US16752022
申请日:2020-01-24
Applicant: Amazon Technologies, Inc.
Inventor: George Steven McPherson , Mehul A. Shah , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta
IPC: G06F16/25 , G06F16/2455 , G06F16/23
Abstract: History for data objects may be maintained to detect data events. An indication of an Extract, Transform, Load (ETL) process applied to one or more source data objects to generate one or more transformed data objects may be received. History for the source data objects may be updated to include the transformed data objects and the ETL process that generated the transformed data objects. An evaluation of the update may be performed to determine whether an event associated with the data lineage is triggered. If the event is triggered, a notification of the event may be sent to one or more subscribers for the event.
-
-
-
-
-
-
-
-
-