-
公开(公告)号:US20170228290A1
公开(公告)日:2017-08-10
申请号:US15495827
申请日:2017-04-24
Applicant: Amazon Technologies, Inc.
Inventor: Maximliano Maccanti , Timothy Andrew Rath , Rama Krishna Sandeep Pokkunuri , Akshat Vig , Clarence Wing Yin NG , Srivaths Badrinath Copparam , Rajaprabhu Thiruchi Loganathan , Wei Xiao , William Alexander Stevenson
IPC: G06F11/14
CPC classification number: G06F11/1451 , G06F11/1458 , G06F11/1464 , G06F11/1469 , G06F11/2094 , G06F2201/80
Abstract: A system that implements a data storage service may store data for a database table in multiple replicated partitions on respective storage nodes. In response to a request to back up a table, the service may back up individual partitions of the table to a remote storage system independently and (in some cases) in parallel, and may update (or create) and store metadata about the table and its partitions on storage nodes of the data storage service and/or in the remote storage system. Backing up each partition may include exporting it from the database in which the table is stored, packaging and compressing the exported partition for upload, and uploading the exported, packaged, and compressed partition to the remote storage system. The remote storage system may be a key-value durable storage system in which each backed-up partition is accessible using its partition identifier as the key.
-
公开(公告)号:US09576038B1
公开(公告)日:2017-02-21
申请号:US13865113
申请日:2013-04-17
Applicant: Amazon Technologies, Inc.
Inventor: Xianglong Huang , David Alan Lutz , Wei Xiao , Maximiliano Maccanti , Somasundaram Perianayagam , Rande A. Blackman , Stuart Henry Seelye Marshall
CPC classification number: G06F17/30575
Abstract: A distributed database management system may comprise a plurality of computing nodes. A request to update an item maintained by the system may be acknowledged as durable and committed once an entry corresponding to the request has been written to a log file and quorum among the computing nodes has been achieved. Improved consistency may be achieved by maintaining snapshots of committed item states within queryable in-memory snapshot data structures. Range queries may be performed by merging a secondary index with the snapshots and applying filters. Projections may be completed by retrieving additional data from an item collection maintain on one or more storage devices.
Abstract translation: 分布式数据库管理系统可以包括多个计算节点。 一旦与该请求相对应的条目已经被写入到日志文件中并且已经实现了计算节点之间的仲裁,则更新系统维护的项目的请求可以被确认为持久的并被提交。 通过在可查询的内存中快照数据结构中维护已提交项目状态的快照可以实现改进的一致性。 可以通过将辅助索引与快照合并并应用过滤器来执行范围查询。 可以通过从一个或多个存储设备上的项目集合维护中检索附加数据来完成投影。
-
63.
公开(公告)号:US09385956B2
公开(公告)日:2016-07-05
申请号:US13926697
申请日:2013-06-25
Applicant: Amazon Technologies, Inc.
Inventor: Wei Xiao , Bjorn Patrick Swift , Kiran-Kumar Muniswamy-Reddy , Miguel Mascarenhas Filipe , Yijun Lu , Stefano Stefani , James R. Hamilton
IPC: H04J1/16 , H04L12/26 , H04J3/14 , H04L12/801
CPC classification number: H04L47/12
Abstract: Methods and apparatus for compound token buckets usable for burst-mode admission control are disclosed. A peak burst rate and a sustained burst rate of work requests that are to be supported at a work target are determined. The maximum token populations of a peak-burst token bucket and a sustained-burst token bucket are configured, based on the peak burst rate and the sustained burst rate respectively. In response to receiving a work request directed at the work target, a determination to accept the work request for execution is made based at least in part on the token population of the peak-burst token bucket and/or the sustained-burst token bucket.
Abstract translation: 公开了可用于突发模式准入控制的复合令牌桶的方法和装置。 确定要在工作目标中支持的峰值突发速率和工作请求的持续突发速率。 基于峰值突发速率和持续突发速率,配置峰值突发令牌桶和持续突发令牌桶的最大标记量。 响应于接收到针对工作目标的工作请求,至少部分地基于峰 - 突发令牌桶和/或持续突发令牌桶的令牌总数进行接受执行工作请求的确定。
-
公开(公告)号:US12175968B1
公开(公告)日:2024-12-24
申请号:US17213492
申请日:2021-03-26
Applicant: Amazon Technologies, Inc.
Inventor: Mohamed Farouk AbdelHady , Qian Hu , Mohamed Thahir Peer Mohamed , Wei Xiao , Zheng Gao , Radhika Arava , Xibin Gao
Abstract: Techniques for selecting a skill to execute in response to a natural language input are described. A system may receive a natural language input, determine profile data associated with the natural language input, and determine the profile data indicates a locale and at least first language and second languages. The system determines first and second sets of skills corresponding to the locale/first language and locale/second language, respectively. The system determines a first group of skill candidates corresponding to a portion of the first set of skills, and determines a second group of skill candidates corresponding to a portion of the second set of skills. The system performs ranking processing across the first and second groups of skills to determine a best skill for responding to the natural language input. Thus, in some situations, the skill invoked may not correspond to the same language represented in the natural language input.
-
公开(公告)号:US11940990B1
公开(公告)日:2024-03-26
申请号:US15625976
申请日:2017-06-16
Applicant: Amazon Technologies, Inc.
Inventor: Sharatkumar Nagesh Kuppahally , Ravi Math , Adam Douglas Morley , Ming-chuan Wu , Wei Xiao , Rajaprabhu Thiruchi Loganathan
IPC: G06F16/23 , G06F16/245 , G06F16/27
CPC classification number: G06F16/2379 , G06F16/245 , G06F16/27
Abstract: Consistent queries to replicated data may be performed according to global clock values. Time values for updates performed at a data set may be identified according to a global clock. The updates may be replicated to a replicated portion of a data set. Queries directed to the replicated portion of the data set may be performed in response to determining that the time values for updates indicate that the replicated portion of the data set is consistent with a time value identified for the query according to the global clock.
-
公开(公告)号:US20230400990A1
公开(公告)日:2023-12-14
申请号:US18331087
申请日:2023-06-07
Applicant: Amazon Technologies, Inc.
Inventor: Stefano Stefani , Timothy Andrew Rath , Chiranjeeb Buragahain , Yan Valerie Leshinsky , David Alan Lutz , Jakub Kulesza , Wei Xiao , Jai Vasanth
CPC classification number: G06F3/0619 , G06F3/0644 , G06F16/278 , G06F3/065 , G06F3/067
Abstract: A system that implements a scalable data storage service may maintain tables in a data store on behalf of storage service clients. The service may maintain table data in multiple replicas of partitions that are stored on respective computing nodes in the system. In response to detecting an anomaly in the system, detecting a change in data volume on a partition or service request traffic directed to a partition, or receiving a service request from a client to split a partition, the data storage service may create additional copies of a partition replica using a physical copy mechanism. The data storage service may issue a split command defined in an API for the data store to divide the original and additional replicas into multiple replica groups, and to configure each replica group to maintain a respective portion of the table data that was stored in the partition before the split.
-
公开(公告)号:US20230283681A1
公开(公告)日:2023-09-07
申请号:US18178379
申请日:2023-03-03
Applicant: Amazon Technologies, Inc.
Inventor: Wei Xiao , David Alan Lutz , Timothy Andrew Rath , Maximiliano Maccanti , Miguel Mascarenhas Filipe , David Craig Yanacek
CPC classification number: H04L67/51 , G06F9/505 , G06F11/3485 , G06F11/3433 , G06F11/3452 , H04L47/76 , H04L41/0896 , H04L47/78 , H04L67/01 , G06F2209/504 , Y02D10/00
Abstract: A system that provides services to clients may receive and service requests, various ones of which may require different amounts of work. The system may determine whether it is operating in an overloaded or underloaded state based on a current work throughput rate, a target work throughput rate, a maximum request rate, or an actual request rate, and may dynamically adjust the maximum request rate in response. For example, if the maximum request rate is being exceeded, the maximum request rate may be raised or lowered, dependent on the current work throughput rate. If the target or committed work throughput rate is being exceeded, but the maximum request rate is not being exceeded, a lower maximum request rate may be proposed. Adjustments to the maximum request rate may be made using multiple incremental adjustments. Service request tokens may be added to a leaky token bucket at the maximum request rate.
-
公开(公告)号:US11709600B2
公开(公告)日:2023-07-25
申请号:US16926519
申请日:2020-07-10
Applicant: Amazon Technologies, Inc.
Inventor: Stefano Stefani , Timothy Andrew Rath , Chiranjeeb Buragahain , Yan Valerie Leshinsky , David Alan Lutz , Jakub Kulesza , Wei Xiao , Jai Vasanth
CPC classification number: G06F3/0619 , G06F3/065 , G06F3/067 , G06F3/0644 , G06F16/278
Abstract: A system that implements a scalable data storage service may maintain tables in a data store on behalf of storage service clients. The service may maintain table data in multiple replicas of partitions that are stored on respective computing nodes in the system. In response to detecting an anomaly in the system, detecting a change in data volume on a partition or service request traffic directed to a partition, or receiving a service request from a client to split a partition, the data storage service may create additional copies of a partition replica using a physical copy mechanism. The data storage service may issue a split command defined in an API for the data store to divide the original and additional replicas into multiple replica groups, and to configure each replica group to maintain a respective portion of the table data that was stored in the partition before the split.
-
公开(公告)号:US11609697B2
公开(公告)日:2023-03-21
申请号:US17133009
申请日:2020-12-23
Applicant: Amazon Technologies, Inc.
Inventor: Swaminathan Sivasubramanian , Stefano Stefani , Wei Xiao , Timothy Andrew Rath , Rande A. Blackman , Grant Alexander MacDonald McAlister , Raymond S. Bradford
IPC: G06F12/00 , G06F13/00 , G06F3/06 , H04L67/1097
Abstract: A system that implements a scaleable data storage service may maintain tables in a data store on behalf of storage service clients. The service may maintain data in partitions stored on respective computing nodes in the system. The service may support multiple throughput models, including a committed throughput model and a best effort throughput model. A service request to create a table may specify that requests directed to the table should be serviced under a committed throughput model and may specify the committed throughput level in terms of logical service request units. The service may reserve low-latency storage and other resources sufficient to meet the specified committed throughput level. A client/user may request a modification to the committed throughput level in anticipation of workload changes, such as an increase or decrease in traffic or data volume. In response, the system may increase or decrease the resources reserved for the table.
-
公开(公告)号:US11443232B1
公开(公告)日:2022-09-13
申请号:US16370733
申请日:2019-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Zahid Rahman , Wei Xiao , Stefano Stefani , Rahul Sharma , Siddharth Joshi
IPC: G06F7/00 , G06N20/00 , G06F16/335 , G06F16/383 , H04L67/10 , G06F40/40
Abstract: Techniques for active learning-based data labeling are described. An active learning-based data labeling service enables a user to build and manage large, high accuracy datasets for use in various machine learning systems. Machine learning may be used to automate annotation and management of the datasets, increasing efficiency of labeling tasks and reducing the time required to perform labeling. Embodiments utilize active learning techniques to reduce the amount of a dataset that requires manual labeling. As subsets of the dataset are labeled, this label data is used to train a model which can then identify additional objects in the dataset without manual intervention. The label data can be added to an augmented manifest, the augmented manifest can be used to filter the dataset to perform further labeling jobs on the same or different subsets of the dataset.
-
-
-
-
-
-
-
-
-