Patent search ap:("AMAZON TECHNOLOGIES Page INC.") AND inv:"Wei Xiao"

71.

发明授权
Real-time monitoring of IO load and latency 有权

公开(公告)号：US10924562B1

公开(公告)日：2021-02-16

申请号：US13886025

申请日：2013-05-02

Applicant: Amazon Technologies, Inc.

Inventor： Wei Xiao , Kiran-Kumar Muniswamy-Reddy , Yijun Lu , Bjorn Patrick Swift , Miguel Mascarenhas Filipe

IPC: H04L29/08

Abstract: Providers of web services and other types of software as a service may be subject to service-level agreements requiring that response times be within a defined range. For efficiency, multiple services may be hosted on the same set of computing nodes, which may jeopardize adherence to service-level agreements. A control system may involve classifying service requests and determining desired values for measurements such as latency. An error value may be calculated based on the difference between measured and desired values. A controller may adjust a rate of capacity utilization for the computing nodes based on the current error, a history of past errors, and a prediction of future errors.

72.

发明授权
Accelerator based inference service 有权

公开(公告)号：US10853129B1

公开(公告)日：2020-12-01

申请号：US16358355

申请日：2019-03-19

Applicant: Amazon Technologies, Inc.

Inventor： Sudipta Sengupta , Haifeng He , Pejus Manoj Das , Poorna Chand Srinivas Perumalla , Wei Xiao , Shirley Xue Yi Leung , Vladimir Mitrovic , Yongcong Luo , Jiacheng Guo , Stefano Stefani , Matthew Shawn Wilson

IPC: G06F9/46 , G06F9/48 , G06N20/00 , G06N5/04 , G06F9/50 , G06N3/08 , G06F9/455

Abstract: Implementations detailed herein include description of a computer-implemented method to migrate a machine learning model from one accelerator portion (such as a portion of a graphical processor unit (GPU)) to a different accelerator portion. In some instances, a state of the first accelerator portion is persisted, the second accelerator portion is configured, the first accelerator portion is then detached from a client application instance, and at least a portion of an inference request is performed using the loaded at least a portion of the machine learning model on the second accelerator portion that had been configured.

73.

发明授权
Projection-based updates 有权

公开(公告)号：US10437809B1

公开(公告)日：2019-10-08

申请号：US14868086

申请日：2015-09-28

Applicant: Amazon Technologies, Inc.

Inventor： Wei Xiao , Jeffrey Hocheng Nieh , Fahad Ahmed , David Craig Yanacek , Andrew Desmond Budiman , Usman Ahmed Shami

IPC: G06F17/30 , G06F16/23 , G06F16/22

Abstract: A repository of key-value data may store a first object value having an internal structure of a hierarchy of sub-objects. The repository may receive a request to modify the first object, expressed as a projection of locations in the object to be updated and a function that, upon evaluation, returns values to be used to update the projected locations of the object. The repository may determine that the locations specified by the projections correspond to non-overlapping regions of the object and, based on the determination, update the object using the results of evaluating the function.

74.

发明授权
Table and index communications channels 有权

公开(公告)号：US10216768B1

公开(公告)日：2019-02-26

申请号：US14182258

申请日：2014-02-17

Applicant: AMAZON TECHNOLOGIES, INC.

Inventor： Xianglong Huang , Yijun Lu , Wei Xiao , Jiandan Zheng

IPC: G06F17/30

Abstract: One or more table partitions may communicate with an index partition that may be a master of a replication group. A communications channel may exist between table partitions and the index partition. Upon splitting the index partition, communications between the table partitions and the index partition may be suspended. Upon completion of the split, communications may be reestablished between the table partitions and a partition, of the replication group of index partitions, designated to be a master following the split. Messages accumulated by the table partitions during the split may be sent to the index partition upon reestablishing communications.

75.

发明申请
CONSISTENT QUERY OF LOCAL INDEXES 审中-公开

公开(公告)号：US20180210914A1

公开(公告)日：2018-07-26

申请号：US15925666

申请日：2018-03-19

Applicant: Amazon Technologies, Inc.

Inventor： Xianglong Huang , David Alan Lutz , Wei Xiao , Maimiliano Maccanti , Somasundaram Perianayagam , Rande A. Blackman , Stuart Henry Seelye Marshall

IPC: G06F17/30

CPC classification number: G06F16/24535 , G06F16/22 , G06F16/2365 , G06F16/2379 , G06F16/275

Abstract: A distributed database management system may comprise a plurality of computing nodes. A request to update an item maintained by the system may be acknowledged as durable and committed once an entry corresponding to the request has been written to a log file and quorum among the computing nodes has been achieved. Improved consistency may be achieved by maintaining snapshots of committed item states within queryable in-memory snapshot data structures. Range queries may be performed by merging a secondary index with the snapshots and applying filters. Projections may be completed by retrieving additional data from an item collection maintain on one or more storage devices.

76.

发明授权
Consistent query of local indexes 有权

公开(公告)号：US09922086B1

公开(公告)日：2018-03-20

申请号：US15400175

申请日：2017-01-06

Applicant: Amazon Technologies, Inc.

Inventor： Xianglong Huang , David Alan Lutz , Wei Xiao , Maximiliano Maccanti , Somasundaram Perianayagam , Rande A. Blackman , Stuart Henry Seelye Marshall

IPC: G06F17/30

CPC classification number: G06F17/30451 , G06F17/30312 , G06F17/30371 , G06F17/30377 , G06F17/30581

Abstract: A distributed database management system may comprise a plurality of computing nodes. A request to update an item maintained by the system may be acknowledged as durable and committed once an entry corresponding to the request has been written to a log file and quorum among the computing nodes has been achieved. Improved consistency may be achieved by maintaining snapshots of committed item states within queryable in-memory snapshot data structures. Range queries may be performed by merging a secondary index with the snapshots and applying filters. Projections may be completed by retrieving additional data from an item collection maintain on one or more storage devices.

77.

发明授权
Implicit prioritization to rate-limit secondary index creation for an online table 有权

公开(公告)号：US09898614B1

公开(公告)日：2018-02-20

申请号：US14859072

申请日：2015-09-18

Applicant: Amazon Technologies, Inc.

Inventor： Kiran Kumar Muniswamy Reddy , Wei Xiao , Adam Douglas Morley , Lokendra Singh Panwar

IPC: G06F21/62 , G06F17/30 , H04L12/26 , H04L12/819

CPC classification number: G06F21/62 , G06F17/30321 , G06F21/604 , H04L43/0876 , H04L47/215

Abstract: A data storage system may implement implicit prioritization to rate-limit secondary index creation for an online table. A secondary index may be generated for a table stored in a data store. The table may be incrementally indexed, performing multiple indexing operations to populate the secondary index. Prior to performing an indexing operation, an evaluation of a capacity limitation for performing indexing operations may be made with respect to capacity to process access requests at the data store. If a determination is made that performance of the indexing operation exceeds the capacity limitation, then the indexing operation may be throttled. If a determination is made that performance of the indexing operation does not exceed the capacity limitation, then the indexing operation may be performed.

78.

发明授权
Backup of partitioned database tables 有权

公开(公告)号：US09633051B1

公开(公告)日：2017-04-25

申请号：US14032883

申请日：2013-09-20

Applicant: Amazon Technologies, Inc.

Inventor： Maximiliano Maccanti , Timothy Andrew Rath , Rama Krishna Sandeep Pokkunuri , Akshat Vig , Clarence Wing Yin Ng , Srivaths Badrinath Copparam , Rajaprabhu Thiruchi Loganathan , Wei Xiao , William Alexander Stevenson

IPC: G06F7/00 , G06F17/00 , G06F17/30

CPC classification number: G06F11/1469 , G06F11/1451 , G06F11/1458 , G06F11/2094 , G06F2201/80

Abstract: A system that implements a data storage service may store data for a database table in multiple replicated partitions on respective storage nodes. In response to a request to back up a table, the service may back up individual partitions of the table to a remote storage system independently and (in some cases) in parallel, and may update (or create) and store metadata about the table and its partitions on storage nodes of the data storage service and/or in the remote storage system. Backing up each partition may include exporting it from the database in which the table is stored, packaging and compressing the exported partition for upload, and uploading the exported, packaged, and compressed partition to the remote storage system. The remote storage system may be a key-value durable storage system in which each backed-up partition is accessible using its partition identifier as the key.

79.

发明授权
Equitable distribution of excess shared-resource throughput capacity 有权
Title translation: 公平分配过剩的共享资源吞吐能力

公开(公告)号：US09553821B2

公开(公告)日：2017-01-24

申请号：US13926684

申请日：2013-06-25

Applicant: Amazon Technologies, Inc.

Inventor： Wei Xiao , Bjorn Patrick Swift , Kiran-Kumar Muniswamy-Reddy , Miguel Mascarenhas Filipe , Yijun Lu , Stuart Henry Seelye Marshall , Stefano Stefani , James R. Hamilton

IPC: G06F15/173 , H04L12/911

CPC classification number: H04L47/215 , H04L43/16 , H04L47/70 , H04L47/80

Abstract: Methods and apparatus for equitable distribution of excess shared-resource throughput capacity are disclosed. A first and a second work target are configured to access a shared resource to implement accepted work requests. Admission control is managed at the work targets using respective token buckets. A first metric indicative of the work request arrival rates at the work targets during a time interval, and a second metric associated with the provisioned capacities of the work targets are determined. A number of tokens determined based on a throughput limit of the shared resource is distributed among the work targets to be used for admission control during a subsequent time interval. The number of tokens distributed to each work target is based on the first metric and/or the second metric.

Abstract translation: 披露了公平分配超额共享资源吞吐能力的方法和手段。第一个和第二个工作目标被配置为访问共享资源以实现接受的工作请求。使用相应令牌桶在工作目标上管理入学控制。确定在时间间隔内指示工作目标的工作请求到达率的第一指标，以及与工作目标的提供能力相关联的第二度量。基于共享资源的吞吐量限制确定的多个令牌被分配在在随后的时间间隔期间用于准入控制的工作目标之间。分配给每个工作目标的令牌数量基于第一度量和/或第二度量。

80.

发明授权
Database system providing skew metrics across a key space 有权
Title translation: 数据库系统在关键空间提供偏斜度量

公开(公告)号：US09384227B1

公开(公告)日：2016-07-05

申请号：US13909418

申请日：2013-06-04

Applicant: Amazon Technologies, Inc.

Inventor： Wei Xiao , Yijun Lu , Miguel Mascarenhas Filipe , Kiran-Kumar Muniswamy-Reddy , Bjorn Patrick Swift , David Craig Yanacek , Stuart Henry Seelye Marshall

IPC: G06F7/00 , G06F17/30

CPC classification number: G06F17/30584 , G06F17/30339

Abstract: A database service may maintain tables on behalf of clients and may provision throughput capacity for those tables. A table may be divided into multiple partitions, according to hash of the primary key values for each of the items in the table, and the items in the table may be accessed using the hash of their primary key values. Provisioned throughput capacity for the table may be divided between the partitions and used in servicing requests directed to items in the table. The service (or underlying system) may provide mechanisms for generating skew-related metrics or reports and presenting them to clients via a graphical user interface (GUI). The metrics and reports may indicate the amount of uniformity or skew in the distribution of requests across the key space for the table using histograms, heat maps, or other representations. Clients may initiate actions to correct any skewing via the GUI.

Abstract translation: 数据库服务可以代表客户端维护表，并且可以为这些表提供吞吐能力。根据表中每个项目的主键值的散列值，可以将表分成多个分区，并且可以使用其主键值的散列来访问表中的项目。表的设置吞吐能力可以在分区之间划分，并用于针对表中项目的服务请求。服务（或底层系统）可以提供用于生成偏斜相关度量或报告并通过图形用户界面（GUI）将其呈现给客户端的机制。指标和报告可以指示在使用直方图，热图或其他表示的表的关键空间中的请求分布的均匀性或偏差量。客户端可以通过GUI启动纠正任何偏移的动作。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification