-
公开(公告)号:US11709809B1
公开(公告)日:2023-07-25
申请号:US17216359
申请日:2021-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Bohou Li , Vijayan Prabhakaran , Mehul A. Shah , Benjamin Sowell , Douglas Brian Terry
CPC classification number: G06F16/219 , G06F16/2246
Abstract: Techniques for using tree data structures to maintain a transactionally consistent set with support for time-travel queries are described. When a transaction commits, a new version of the tree data structure is created using a copy-on-write based method such that the tree shares internal nodes with previous trees to save space. This approach may be used in the implementation of a transactional data catalog in which the files that make up a table are stored in a transactional set.
-
公开(公告)号:US11842085B1
公开(公告)日:2023-12-12
申请号:US17710444
申请日:2022-03-31
Applicant: Amazon Technologies, Inc.
IPC: G06F3/06
CPC classification number: G06F3/0683 , G06F3/061 , G06F3/064 , G06F3/0647
Abstract: Methods for modeling performance of tiered storage of a data processing service given an increase in the storage capacity of a warm storage tier of the tiered storage are disclosed. Buffers in the warm storage tier are used to store data block identifiers corresponding to a set of data blocks that would be stored in the warm storage tier given the increase in storage capacity in addition to those already stored in the warm storage tier. When an incoming query targets a data block that has a corresponding data block identifier in one of the buffers, a hit counter is incremented in order to track the hit rate that would be made on the up-sized warm storage tier. In response to adding the data block targeted by the query to the warm storage tier, one or more evictions from the warm storage tier may additionally be triggered.
-
公开(公告)号:US11599514B1
公开(公告)日:2023-03-07
申请号:US17216326
申请日:2021-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Vinay Agrawal , Bohou Li , Vikas Malik , Tushar Poddar , Vijayan Prabhakaran , Mukesh Punhani , Mehul A. Shah , Benjamin Sowell , Douglas Brian Terry
IPC: G06F16/00 , G06F16/21 , G06F16/23 , G06F16/2458 , G06F16/22 , G06F16/27 , G06F16/248 , G06F16/2457
Abstract: Techniques for implementing systems using transactional version sets are described. Transactional version sets or t-sets include a collection of elements, each having a collection of metadata. A t-set is transactional in that a sequence of updates to one or more t-sets are made within an atomic transaction. A t-set is versioned since each committed transaction that updates it produces a new timestamped version that can be accessed via time-travel queries.
-
公开(公告)号:US11886422B1
公开(公告)日:2024-01-30
申请号:US17216347
申请日:2021-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Bohou Li , Vijayan Prabhakaran , Mehul A. Shah , Benjamin Sowell , Douglas Brian Terry
IPC: G06F16/23 , G06F16/245 , G06F16/21 , G06F16/22
CPC classification number: G06F16/2379 , G06F16/219 , G06F16/2282 , G06F16/2343 , G06F16/245
Abstract: A protocol for implementing ACID transactions that provides snapshot isolation in a distributed setting that does not require synchronized clocks is described. The protocol ensures at commit time that transactions touching common objects do not commit out of order. The protocol can be used in the context of a distributed data lake built on an object store in which clients can transactionally add or remove objects from logical tables.
-
公开(公告)号:US12216653B1
公开(公告)日:2025-02-04
申请号:US17710476
申请日:2022-03-31
Applicant: Amazon Technologies, Inc.
IPC: G06F15/16 , G06F16/2453
Abstract: Methods for improving performance of tiered storage of a data processing service by detecting and reducing thrashing of data blocks between warm and cold storage tiers are disclosed. In order to understand the frequency of hits by incoming queries to data blocks that are not currently stored in the warm storage tier, the elapsed time between query hits to the respective data blocks may be tracked using timers. Times below a given amount of time may be used to indicate thrashing. For example, recently evicted data blocks that are subsequently hit by a query within a short amount of time since eviction may indicate thrashing. In scenarios in which thrashing may be occurring, a threshold corresponding to the number of times a given data block in the cold storage tier receives a query hit before being added to the warm storage tier may be turned on.
-
-
-
-