-
公开(公告)号:US20240184785A1
公开(公告)日:2024-06-06
申请号:US18441948
申请日:2024-02-14
Applicant: Amazon Technologies, Inc.
Inventor: Lonnie J. Princehouse , Timothy A. Rath , Gaurav Gupta , Mustafa Ozan Ozen , Omer Ahmed Zaki , Karthik Gurumoorthy Subramanya Bharathy , Gaurav Saxena
IPC: G06F16/2455 , G06F3/06 , G06F16/248 , G06F16/27
CPC classification number: G06F16/24568 , G06F3/0607 , G06F3/0644 , G06F3/067 , G06F16/248 , G06F16/278
Abstract: Methods, systems, and computer-readable media for continuous functions in a time-series database are disclosed. A plurality of data points of a time series are stored into one or more storage tiers of a time-series database. The plurality of data points comprise a plurality of discrete measurements at respective timestamps. Using one or more query processors of the time-series database, a query of the time series is initiated. The query indicates a time range. Using the one or more query processors, a continuous function is determined that represents a segment of the time series in the time range. The continuous function is determined based at least in part on the plurality of data points. An operation is performed using the continuous function as an input.
-
公开(公告)号:US11934409B2
公开(公告)日:2024-03-19
申请号:US16199102
申请日:2018-11-23
Applicant: Amazon Technologies, Inc.
Inventor: Lonnie J. Princehouse , Timothy A. Rath , Gaurav Gupta , Mustafa Ozan Ozen , Omer Ahmed Zaki , Karthik Gurumoorthy Subramanya Bharathy , Gaurav Saxena
IPC: G06F16/00 , G06F3/06 , G06F16/2455 , G06F16/248 , G06F16/27
CPC classification number: G06F16/24568 , G06F3/0607 , G06F3/0644 , G06F3/067 , G06F16/248 , G06F16/278
Abstract: Methods, systems, and computer-readable media for continuous functions in a time-series database are disclosed. A plurality of data points of a time series are stored into one or more storage tiers of a time-series database. The plurality of data points comprise a plurality of discrete measurements at respective timestamps. Using one or more query processors of the time-series database, a query of the time series is initiated. The query indicates a time range. Using the one or more query processors, a continuous function is determined that represents a segment of the time series in the time range. The continuous function is determined based at least in part on the plurality of data points. An operation is performed using the continuous function as an input.
-
公开(公告)号:US10997137B1
公开(公告)日:2021-05-04
申请号:US16219686
申请日:2018-12-13
Applicant: Amazon Technologies, Inc.
Inventor: Dumanshu Goyal , Timothy A. Rath
IPC: G06F16/00 , G06F16/21 , G06F16/2457 , G06F16/2458 , G06F16/248
Abstract: Methods, systems, and computer-readable media for two-dimensional partition splitting in a time-series database are disclosed. Stream processor(s) write elements of time-series data to a first replica group of a first tile associated with a first set of spatial and temporal boundaries. A second replica group is initialized with the same boundaries. A control plane generates updated metadata for the first tile. The updated metadata indicates a modified first set of spatial and temporal boundaries for the first replica group associated with the first tile and a second set of spatial and temporal boundaries for the second replica group associated with a second tile. The modified first set represents a first portion of the first set, and the second set represents a second portion of the first set. The updated metadata is obtained by the stream processor(s) which write, to the second tile, time-series data within the second set of boundaries.
-
公开(公告)号:US20200167360A1
公开(公告)日:2020-05-28
申请号:US16199078
申请日:2018-11-23
Applicant: Amazon Technologies, Inc.
Inventor: Timothy A. Rath , Gaurav Gupta , Mustafa Ozan Ozen , Omer Ahmed Zaki
IPC: G06F16/2455 , G06F16/27 , G06F3/06 , G06F16/25
Abstract: Methods, systems, and computer-readable media for a scalable architecture for a distributed time-series database are disclosed. Using a fleet of ingestion routers, time-series data generated by a plurality of client devices is stored into a plurality of durable partitions. The time-series data comprises a plurality of time series, and an amount of the ingestion routers is determined based at least in part on an ingestion rate of the time-series data. Using a fleet of stream processors, the time-series data from the durable partitions is stored into a plurality of storage tiers including a first storage tier and a second storage tier. A retention period for the first storage tier differs from a retention period for the second storage tier. An amount of the stream processors is determined based at least in part on the time-series data in the durable partitions.
-
公开(公告)号:US12169487B1
公开(公告)日:2024-12-17
申请号:US17548293
申请日:2021-12-10
Applicant: Amazon Technologies, Inc.
Inventor: Kanishka Chaturvedi , Timothy A. Rath , Rolf Petter Skog , Zhihao Li , Zhong Ren
Abstract: Techniques for performing a scratch space flush of cold tier upsert data into cold tier storage while allowing continued hot tier ingestion are described. Data points to be written into a time series database having a hot tier and a cold tier may be designated to be written directly into the cold tier, bypassing the hot tier that typically stores new data points that eventually are aged out into the cold tier. New data points for insertion directly into the cold tier may be stored to a scratch space storage and moved into the cold tier according to a non-blocking protocol that allows for continued hot tier ingestion without significant service disruptions.
-
公开(公告)号:US11989186B2
公开(公告)日:2024-05-21
申请号:US16199078
申请日:2018-11-23
Applicant: Amazon Technologies, Inc.
Inventor: Timothy A. Rath , Gaurav Gupta , Mustafa Ozan Ozen , Omer Ahmed Zaki
IPC: G06F16/2455 , G06F16/25 , G06F16/27
CPC classification number: G06F16/24568 , G06F16/252 , G06F16/278
Abstract: Methods, systems, and computer-readable media for a scalable architecture for a distributed time-series database are disclosed. Using a fleet of ingestion routers, time-series data generated by a plurality of client devices is stored into a plurality of durable partitions. The time-series data comprises a plurality of time series, and an amount of the ingestion routers is determined based at least in part on an ingestion rate of the time-series data. Using a fleet of stream processors, the time-series data from the durable partitions is stored into a plurality of storage tiers including a first storage tier and a second storage tier. A retention period for the first storage tier differs from a retention period for the second storage tier. An amount of the stream processors is determined based at least in part on the time-series data in the durable partitions.
-
公开(公告)号:US11609933B1
公开(公告)日:2023-03-21
申请号:US16039258
申请日:2018-07-18
Applicant: Amazon Technologies, Inc.
Inventor: Mustafa Ozan Ozen , Sandeep Bhatia , Lonnie J. Princehouse , Timothy A. Rath , Gaurav Saxena
IPC: G06F16/00 , G06F16/27 , G06F16/21 , G06F16/23 , G06F16/2458
Abstract: Atomic partition scheme updates to partition items may be implemented by a time series database. A time threshold may be assigned to partition scheme update so that the time threshold may be applied across a set of ingestion nodes that may apply the partition scheme update the same. A request to store an item with a timestamp less than the time threshold may be stored in one partition of the time series database, while the item may be stored in a different partition of the time series database if the item has timestamp greater than or equal to the time threshold.
-
公开(公告)号:US11068537B1
公开(公告)日:2021-07-20
申请号:US16216580
申请日:2018-12-11
Applicant: Amazon Technologies, Inc.
Inventor: Dumanshu Goyal , Timothy A. Rath , Eric Coll , Gaurav Gupta , Lonnie J. Princehouse
IPC: G06F16/00 , G06F16/901 , G06F16/27
Abstract: Methods, systems, and computer-readable media for partition segmenting in a distributed time-series database are disclosed. A first host receives a request to store a data element in a tile comprising linked segments including a head segment and a tail segment. The head and tail segments are associated with the same spatial and temporal boundaries, and the data element is within those boundaries. The first host determines that the data element is not stored in the head segment and forwards the data element to a next segment of the tile. An additional host receives a request to store the data element in the tail segment. The additional host determines that the data element is not stored in the tail segment and then stores the data element in the tail segment.
-
公开(公告)号:US11263184B1
公开(公告)日:2022-03-01
申请号:US16116791
申请日:2018-08-29
Applicant: Amazon Technologies, Inc.
Inventor: Mustafa Ozan Ozen , Dumanshu Goyal , Lonnie J. Princehouse , Gaurav Saxena , Atilim Cetin , Gaurav Gupta , Sandeep Bhatia , Nilesh Shahdadpuri , Timothy A. Rath , Eric Coll , Nirmesh Khandelwal
IPC: G06F16/00 , G06F16/21 , G06F16/27 , G06F16/2458 , G06F16/248 , G06F16/901 , G06F16/2457
Abstract: Methods, systems, and computer-readable media for partition splitting in a distributed database are disclosed. A partition of data is split into a first sub-partition and a second sub-partition. A first portion of the data is assigned to the first sub-partition, and a second portion of the data is assigned to the second sub-partition. One or more elements of the first portion of the data from the partition and an additional one or more elements of the first portion of the data from the first sub-partition are stored into a first node. One or more elements of the second portion of the data from the partition and an additional one or more elements of the second portion of the data from the second sub-partition are stored into a second node. The partition is prevented from receiving new data after the partition is split into the first and second sub-partitions.
-
公开(公告)号:US20200167361A1
公开(公告)日:2020-05-28
申请号:US16199102
申请日:2018-11-23
Applicant: Amazon Technologies, Inc.
Inventor: Lonnie J. Princehouse , Timothy A. Rath , Gaurav Gupta , Mustafa Ozan Ozen , Omer Ahmed Zaki , Karthik Gurumoorthy Subramanya Bharathy , Gaurav Saxena
IPC: G06F16/2455 , G06F16/27 , G06F3/06 , G06F16/248
Abstract: Methods, systems, and computer-readable media for continuous functions in a time-series database are disclosed. A plurality of data points of a time series are stored into one or more storage tiers of a time-series database. The plurality of data points comprise a plurality of discrete measurements at respective timestamps. Using one or more query processors of the time-series database, a query of the time series is initiated. The query indicates a time range. Using the one or more query processors, a continuous function is determined that represents a segment of the time series in the time range. The continuous function is determined based at least in part on the plurality of data points. An operation is performed using the continuous function as an input.
-
-
-
-
-
-
-
-
-