-
公开(公告)号:US10754844B1
公开(公告)日:2020-08-25
申请号:US15717614
申请日:2017-09-27
Applicant: Amazon Technologies, Inc.
Inventor: Vaibhav Jain , Tate Andrew Certain , James Christopher Sorenson, III , Sharatkumar Nagesh Kuppahally , Akshat Vig , Go Hori , Chase Kernan , Naveen Ganapathi Subramanian , Nicholas Gordon
Abstract: Various systems and methods to efficiently store backups of partitions of a database are disclosed. An indication to take a current snapshot of a partition of the database is received. A log size and operation type statistics are determined for a portion of the change log for the partition. A snapshot type is determined for the partition based at least in part on the log size and the operation type statistics, where the snapshot type specifies a log-based snapshot or a full snapshot. The snapshot is stored as metadata indicating a point in the change log if the determined snapshot type is log-based snapshot or stored as complete data of the partition if the determined snapshot type is full snapshot.
-
公开(公告)号:US11755620B1
公开(公告)日:2023-09-12
申请号:US16844888
申请日:2020-04-09
Applicant: Amazon Technologies, Inc.
Inventor: Vaibhav Jain , Rishabh Jain , Amit Gupta , Palak Agrawal , Sagar Mundra , Benjamin Donald Wood , Lewis Bruck
IPC: G06F16/2453 , G06F16/28 , G06F16/242
CPC classification number: G06F16/28 , G06F16/2445 , G06F16/2453
Abstract: Non-relational database operations are invoked to provide results consistent with commands specified in a relational query language. A request to access data in a non-relational database is received that includes one or more commands specified in a query language for accessing data according to a relational data scheme. The request may be evaluated to determine one or more Application Programming Interfaces (APIs) to invoke operations at the non-relational database that provide a result consistent with the commands specified in the query language. The determined APIs are executed and a result returned for the request based on the execution of the APIs.
-
公开(公告)号:US11210462B1
公开(公告)日:2021-12-28
申请号:US16439572
申请日:2019-06-12
Applicant: Amazon Technologies, Inc.
Inventor: Ayan Sircar , Abhishek Mehrotra , Aniruddha Deshpande , Padmini Rajanna , Pawan Kaunth , Vaibhav Jain
IPC: G10L15/00 , G06F40/205 , G06F16/9032 , G06F16/903 , G10L15/08 , G06F3/16 , G06Q30/06 , G06F16/9035 , G06F40/284
Abstract: Systems and methods are described for processing voice input to detect and remove voice recognition errors in the context of a product attribute query. Spoken-word input may be processed to tentatively identify a query regarding a product and an attribute. A hierarchical product catalog is then used to identify categories that include the identified product, and an affinity score is determined for each category to indicate the relative strength of the relationship between the category and the attribute. The affinity score for each category is determined based on historical questions submitted to a question and answer service with regard to other products in the category. An affinity score for the product-attribute pairing is then determined based on a weighted average of the affinity scores for the product categories, and the affinity score is used to determine whether the question is valid and the voice input has been correctly processed.
-
14.
公开(公告)号:US11120006B2
公开(公告)日:2021-09-14
申请号:US16014890
申请日:2018-06-21
Applicant: Amazon Technologies, Inc.
Inventor: Douglas Brian Terry , Tate Andrew Certain , Amit Gupta , Rishabh Jain , Vaibhav Jain , Alexander Richard Keyes , Somasundaram Perianayagam , Nathan Pellegrom Riley , Akshat Vig , Ming-Chuan Wu
Abstract: Transaction requests may be ordered in a distributed database according to an independently assigned sequence. Different distributed system nodes, such as a transaction coordinator and a storage node may independently assign sequence numbers to requests to access a distributed database. A storage node may receive the request from a transaction coordinator with an assigned sequence number and another request to which the storage node may assign a sequence number. The storage node can then order performance of the requests based on the sequence numbers.
-
15.
公开(公告)号:US11314779B1
公开(公告)日:2022-04-26
申请号:US15994897
申请日:2018-05-31
Applicant: Amazon Technologies, Inc.
Inventor: Vaibhav Jain
Abstract: Systems and methods for managing timestamps for a sequential update stream in a distributed system. A system includes a plurality of storage nodes of a data stream service of a provider network. Each storage node includes one or more processors and memory storing instructions executable by the one or more processors. The instructions cause the one or more processors to create a shard, at the storage node, for storing records of a sequential update stream recording changes to a database partition. The instructions also cause the one or more processors to assign a timestamp for the shard based on a local clock of the storage node. The instructions further cause the one or more processors to adjust the timestamp for the shard to a time later than a timestamp for a previous shard of the sequential update stream stored at a different storage node of the plurality of storage nodes.
-
公开(公告)号:US11314717B1
公开(公告)日:2022-04-26
申请号:US15632266
申请日:2017-06-23
Applicant: Amazon Technologies, Inc.
Inventor: Tate Andrew Certain , Anshul Gupta , Vaibhav Jain , Sharatkumar Nagesh Kuppahally , Alexander Richard Keyes , Rajaprabhu Thiruchi Loganathan , Ravi Math , Adam Douglas Morley , Lokendra Singh Panwar , Krishnan Seshadrinathan , James Christopher Sorenson, III , Stefano Stefani , Wei Xiao
Abstract: Scalable architecture for propagating updates may be implemented for data replicated from a data set. A node may receive updates to items in a data set that have been committed to the data set. The node may determine whether the update should be applied to a replicated portion of the data set. For updates that should be applied, the node may identify another node that hosts the replicated portion of the data set and send a request to the other node to perform a conditional atomic operation to apply the update to the item in the replicated portion of the data set. The condition may compare a version identifier associated with an update and a current version identifier for the item at the other node. If the condition evaluates true, then the update to the item in the replicated portion may be performed.
-
公开(公告)号:US20220083529A1
公开(公告)日:2022-03-17
申请号:US17531560
申请日:2021-11-19
Applicant: Amazon Technologies, Inc.
Inventor: Vaibhav Jain , Vikas Jain , Sharatkumar Nagesh Kuppahally
Abstract: Various systems and methods of tracking database partition change log dependencies are described herein. A system is disclosed herein, according to some embodiments. A change log for a database partition of the plurality of database partitions is stored to a log store. An indication that a split of the database partition into two or more child database partitions has occurred is received. Responsive to the split of the database partition, the change log is split into two or more corresponding child change logs. A portion of the change log is deleted responsive to a determination that a respective complete snapshot is stored for each child database partition corresponding to a child change log of the change log and a determination that a retention window for the portion of the change log has passed.
-
公开(公告)号:US11250022B1
公开(公告)日:2022-02-15
申请号:US17037482
申请日:2020-09-29
Applicant: Amazon Technologies, Inc.
Inventor: Sharatkumar Nagesh Kuppahally , Peter Zhivkov , Somasundaram Perianayagam , James Christopher Sorenson, III , Amit Gupta , Chase Kernan , Nicholas Gordon , Ravi Math , Vaibhav Jain , Remi Paucher
Abstract: Offline building of a projected data subset may be performed. A request to create a data set that is a projected subset of data from a source data set may be received. A data store separate from the data store storing the source data set may store a copy of the source data set that is used to replicate items to the projected subset of data according to a schema for the projected data subset. Updates made to the source data set may also be replicated to the projected data subset according to the schema. Conflicts between replicated items and replicated updates to the projected data set may be resolved by comparing a version identifier for the replicated update and replicated item to determine what to store in the projected data subset.
-
公开(公告)号:US10997160B1
公开(公告)日:2021-05-04
申请号:US16364025
申请日:2019-03-25
Applicant: Amazon Technologies, Inc.
Inventor: Akshat Vig , Somasundaram Perianayagam , Vaibhav Jain , Alexander Richard Keyes , Stefano Stefani , Douglas Brian Terry , James Christopher Sorenson, III , Amit Gupta , Rishabh Jain
IPC: G06F16/23
Abstract: Updates performed as part of transaction requests to a data store may be included in a stream of updates. Updates to items in the data store that are included in transactions determined not to be committed to the data store may be excluded from the stream of updates. Records in the stream of updates may include an identifier for the transaction that included the update described by the record. The identifier for the transaction may be used to identify updates to other items in the data store that are included in the same transaction.
-
公开(公告)号:US10990581B1
公开(公告)日:2021-04-27
申请号:US15717606
申请日:2017-09-27
Applicant: Amazon Technologies, Inc.
Inventor: Vaibhav Jain
Abstract: Various systems and methods of tracking a size of a database change log are described herein. A system is disclosed herein, according to some embodiments. An indication of a plurality of change events that have occurred at the database is received. A new log segment is generated for the change log for the database based on the received indication including the plurality of change events. Metadata for an end log segment from the change log is retrieved to identify a cumulative size for the change log. A new cumulative size for the change log is determined based on a size of the new log segment and the identified cumulative size. The new log segment is stored to the change log as a new end log segment with metadata indicating the new cumulative size for the change log.
-
-
-
-
-
-
-
-
-