Real-time monitoring of IO load and latency

    公开(公告)号:US10924562B1

    公开(公告)日:2021-02-16

    申请号:US13886025

    申请日:2013-05-02

    Abstract: Providers of web services and other types of software as a service may be subject to service-level agreements requiring that response times be within a defined range. For efficiency, multiple services may be hosted on the same set of computing nodes, which may jeopardize adherence to service-level agreements. A control system may involve classifying service requests and determining desired values for measurements such as latency. An error value may be calculated based on the difference between measured and desired values. A controller may adjust a rate of capacity utilization for the computing nodes based on the current error, a history of past errors, and a prediction of future errors.

    Equitable distribution of excess shared-resource throughput capacity
    6.
    发明授权
    Equitable distribution of excess shared-resource throughput capacity 有权
    公平分配过剩的共享资源吞吐能力

    公开(公告)号:US09553821B2

    公开(公告)日:2017-01-24

    申请号:US13926684

    申请日:2013-06-25

    CPC classification number: H04L47/215 H04L43/16 H04L47/70 H04L47/80

    Abstract: Methods and apparatus for equitable distribution of excess shared-resource throughput capacity are disclosed. A first and a second work target are configured to access a shared resource to implement accepted work requests. Admission control is managed at the work targets using respective token buckets. A first metric indicative of the work request arrival rates at the work targets during a time interval, and a second metric associated with the provisioned capacities of the work targets are determined. A number of tokens determined based on a throughput limit of the shared resource is distributed among the work targets to be used for admission control during a subsequent time interval. The number of tokens distributed to each work target is based on the first metric and/or the second metric.

    Abstract translation: 披露了公平分配超额共享资源吞吐能力的方法和手段。 第一个和第二个工作目标被配置为访问共享资源以实现接受的工作请求。 使用相应令牌桶在工作目标上管理入学控制。 确定在时间间隔内指示工作目标的工作请求到达率的第一指标,以及与工作目标的提供能力相关联的第二度量。 基于共享资源的吞吐量限制确定的多个令牌被分配在在随后的时间间隔期间用于准入控制的工作目标之间。 分配给每个工作目标的令牌数量基于第一度量和/或第二度量。

    Database system providing skew metrics across a key space
    7.
    发明授权
    Database system providing skew metrics across a key space 有权
    数据库系统在关键空间提供偏斜度量

    公开(公告)号:US09384227B1

    公开(公告)日:2016-07-05

    申请号:US13909418

    申请日:2013-06-04

    CPC classification number: G06F17/30584 G06F17/30339

    Abstract: A database service may maintain tables on behalf of clients and may provision throughput capacity for those tables. A table may be divided into multiple partitions, according to hash of the primary key values for each of the items in the table, and the items in the table may be accessed using the hash of their primary key values. Provisioned throughput capacity for the table may be divided between the partitions and used in servicing requests directed to items in the table. The service (or underlying system) may provide mechanisms for generating skew-related metrics or reports and presenting them to clients via a graphical user interface (GUI). The metrics and reports may indicate the amount of uniformity or skew in the distribution of requests across the key space for the table using histograms, heat maps, or other representations. Clients may initiate actions to correct any skewing via the GUI.

    Abstract translation: 数据库服务可以代表客户端维护表,并且可以为这些表提供吞吐能力。 根据表中每个项目的主键值的散列值,可以将表分成多个分区,并且可以使用其主键值的散列来访问表中的项目。 表的设置吞吐能力可以在分区之间划分,并用于针对表中项目的服务请求。 服务(或底层系统)可以提供用于生成偏斜相关度量或报告并通过图形用户界面(GUI)将其呈现给客户端的机制。 指标和报告可以指示在使用直方图,热图或其他表示的表的关键空间中的请求分布的均匀性或偏差量。 客户端可以通过GUI启动纠正任何偏移的动作。

    Dynamic replica failure detection and healing
    8.
    发明授权
    Dynamic replica failure detection and healing 有权
    动态复制失败检测和愈合

    公开(公告)号:US09304815B1

    公开(公告)日:2016-04-05

    申请号:US13917317

    申请日:2013-06-13

    Abstract: Detecting replica faults within a replica group and dynamically scheduling replica healing operations are described. Status metadata for one or more replica groups may be accessed. Based, at least in part, the status data a number of available replicas for at least one replica group may be determined to incompliant with a healthy state definition for the replica group. One or more healing operations to restore the number of available replicas for the at least one replica group to the respective healthy state definition may be dynamically scheduled. In some embodiments, one or more resource constraints for performing healing operations and one or more resource requirements for each of the one or more healing operations may be used to order the one or more healing operations.

    Abstract translation: 对副本组中的副本故障进行检测并动态调度复制恢复操作。 可以访问一个或多个副本组的状态元数据。 至少部分地基于状态数据,至少一个副本组的可用副本的数量可以被确定为与副本组的健康状态定义不一致。 可以动态地调度用于将至少一个副本组的可用副本的数量恢复到相应的健康状态定义的一个或多个愈合操作。 在一些实施例中,用于执行愈合操作的一个或多个资源约束和针对所述一个或多个愈合操作中的每一个的一个或多个资源需求可用于对一个或多个愈合操作进行排序。

    Cross-zone replicated block storage devices

    公开(公告)号:US11494108B2

    公开(公告)日:2022-11-08

    申请号:US16579626

    申请日:2019-09-23

    Abstract: The present disclosure generally relates to creating virtualized block storage devices whose data is replicated across isolated computing systems to lower risk of data loss even in wide-scale events, such as natural disasters. The virtualized device can include at least two volumes, each of which is implemented in a distinct computing system. Each volume can be implemented by at least two computing devices, a first of which is configured as a primary device to which reads from and writes to the volume are directed. Of the two volumes, one can be indicated as primary, indicating authority to accept reads to and writes from the virtualized device. A primary device of the primary volume, on obtaining a write to the volume, can replicate the write to both a secondary device of a primary volume and to the secondary volume.

    Stream-based logging for distributed storage systems

    公开(公告)号:US11343314B1

    公开(公告)日:2022-05-24

    申请号:US15967309

    申请日:2018-04-30

    Abstract: Generally described, aspects of the present application correspond to maintaining a message stream for a network-based data store, which stream includes messages reflecting modifications to the data store. Messages within the stream may be used to revert a state of the data store to a prior point in time reflected within the messages of the stream, such as by “rewinding” operations on the data store by use of the messages within the stream. Messages in the stream may further be used to asynchronously update a replica of the data store.

Patent Agency Ranking