-
公开(公告)号:US11886422B1
公开(公告)日:2024-01-30
申请号:US17216347
申请日:2021-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Bohou Li , Vijayan Prabhakaran , Mehul A. Shah , Benjamin Sowell , Douglas Brian Terry
IPC: G06F16/23 , G06F16/245 , G06F16/21 , G06F16/22
CPC classification number: G06F16/2379 , G06F16/219 , G06F16/2282 , G06F16/2343 , G06F16/245
Abstract: A protocol for implementing ACID transactions that provides snapshot isolation in a distributed setting that does not require synchronized clocks is described. The protocol ensures at commit time that transactions touching common objects do not commit out of order. The protocol can be used in the context of a distributed data lake built on an object store in which clients can transactionally add or remove objects from logical tables.
-
公开(公告)号:US11709809B1
公开(公告)日:2023-07-25
申请号:US17216359
申请日:2021-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Bohou Li , Vijayan Prabhakaran , Mehul A. Shah , Benjamin Sowell , Douglas Brian Terry
CPC classification number: G06F16/219 , G06F16/2246
Abstract: Techniques for using tree data structures to maintain a transactionally consistent set with support for time-travel queries are described. When a transaction commits, a new version of the tree data structure is created using a copy-on-write based method such that the tree shares internal nodes with previous trees to save space. This approach may be used in the implementation of a transactional data catalog in which the files that make up a table are stored in a transactional set.
-
公开(公告)号:US11714573B1
公开(公告)日:2023-08-01
申请号:US17216373
申请日:2021-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Shashank Bhardwaj , Roman Gavrilov , Brian Scott Ross , Mehul A. Shah , Benjamin Sowell , Anthony A. Virtuoso , Linan Zheng
IPC: G06F3/06
CPC classification number: G06F3/0659 , G06F3/067 , G06F3/0613 , G06F3/0614 , G06F3/0653
Abstract: Techniques for storage optimization in a distributed object store are described. A storage optimization service of a provider network monitors changes to data objects in a distributed object store that are part of a data lake and are referenced by a table index. The storage optimization service determines whether particular storage optimizations involving the data objects would be beneficial, prioritizes the ordering of these optimizations with a focus on performing impactful optimizations first, while intelligently scheduling the optimizations to avoid overutilization of available resources.
-
公开(公告)号:US11599514B1
公开(公告)日:2023-03-07
申请号:US17216326
申请日:2021-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Vinay Agrawal , Bohou Li , Vikas Malik , Tushar Poddar , Vijayan Prabhakaran , Mukesh Punhani , Mehul A. Shah , Benjamin Sowell , Douglas Brian Terry
IPC: G06F16/00 , G06F16/21 , G06F16/23 , G06F16/2458 , G06F16/22 , G06F16/27 , G06F16/248 , G06F16/2457
Abstract: Techniques for implementing systems using transactional version sets are described. Transactional version sets or t-sets include a collection of elements, each having a collection of metadata. A t-set is transactional in that a sequence of updates to one or more t-sets are made within an atomic transaction. A t-set is versioned since each committed transaction that updates it produces a new timestamped version that can be accessed via time-travel queries.
-
-
-