Failure-decoupled volume-level redundancy coding techniques

    公开(公告)号:US10089179B2

    公开(公告)日:2018-10-02

    申请号:US15640067

    申请日:2017-06-30

    Abstract: Techniques described and suggested herein include systems and methods for storing, indexing, and retrieving original data of data archives on data storage systems using redundancy coding techniques. For example, redundancy codes, such as erasure codes, may be applied to archives (such as those received from a customer of a computing resource service provider) so as allow the storage of original data of the individual archives available on a minimum of volumes, such as those of a data storage system, while retaining availability, durability, and other guarantees imparted by the application of the redundancy code. Sparse indexing techniques may be implemented so as to reduce the footprint of indexes used to locate the original data, once stored. The volumes may be apportioned into failure-decorrelated subsets, and archives stored thereto may be apportioned to such subsets.

    Incremental backups while minimizing media access

    公开(公告)号:US10042718B1

    公开(公告)日:2018-08-07

    申请号:US15280944

    申请日:2016-09-29

    Abstract: Techniques for producing incremental short-term backups while minimizing media access are described herein. A backup request is received that specifies data to backup and a schedule for that backup. The data is then partitioned based on the schedule and, for each of the partitions, it is determined whether to store a full or incremental backup of that partition. Each partition is fully backed up once during a cycle of backups and incrementally backed up at other times. With each full backup of a partition, a reverse delta that can be used to reconstruct the previous full backup for that partition is stored with the full backup.

    Incremental media size extension for grid encoded data storage systems

    公开(公告)号:US09904589B1

    公开(公告)日:2018-02-27

    申请号:US14789778

    申请日:2015-07-01

    CPC classification number: G06F11/10 H03M13/11 H03M13/1148

    Abstract: Techniques for incrementally increasing media size in data storage systems using grid encoded data storage techniques are described herein. A grid of shards is created where each shard of the grid of shards has a first index, a second index and each shard also has an associated storage device configured with a storage capacity that is large enough to store the largest set of data on a shard. Upon determining to replace the storage devices of the grid with storage devices that have a different storage capacity, the storage devices can be incrementally replaced within the grid by first padding each shard of the grid of shards with a set of data values, replacing a data shard storage device with a device of the different storage capacity, and replacing a set of derived shard storage devices with devices of the different storage capacity.

    Failure-decoupled volume-level redundancy coding techniques

    公开(公告)号:US09697080B2

    公开(公告)日:2017-07-04

    申请号:US15283023

    申请日:2016-09-30

    Abstract: Techniques described and suggested herein include systems and methods for storing, indexing, and retrieving original data of data archives on data storage systems using redundancy coding techniques. For example, redundancy codes, such as erasure codes, may be applied to archives (such as those received from a customer of a computing resource service provider) so as allow the storage of original data of the individual archives available on a minimum of volumes, such as those of a data storage system, while retaining availability, durability, and other guarantees imparted by the application of the redundancy code. Sparse indexing techniques may be implemented so as to reduce the footprint of indexes used to locate the original data, once stored. The volumes may be apportioned into failure-decorrelated subsets, and archives stored thereto may be apportioned to such subsets.

    PRECOMPUTED REDUNDANCY CODE MATRICES FOR HIGH-AVAILABILITY DATA STORAGE
    46.
    发明申请
    PRECOMPUTED REDUNDANCY CODE MATRICES FOR HIGH-AVAILABILITY DATA STORAGE 审中-公开
    针对高可用性数据存储的预算冗余代码矩阵

    公开(公告)号:US20170060687A1

    公开(公告)日:2017-03-02

    申请号:US15350946

    申请日:2016-11-14

    Abstract: Techniques described and suggested herein include systems and methods for precomputing regeneration information for data archives (“archives”) that have been processed and stored using redundancy coding techniques. For example, regeneration information, such as redundancy code-related matrices (such as inverted matrices based on, e.g., a generator matrix for the selected redundancy code) corresponding to subsets of the shards, is computed for each subset and, in some embodiments, stored for use in the event that one or more shards becomes unavailable, e.g., so as to more efficiently and/or quickly regenerate a replacement shard.

    Abstract translation: 本文描述和提出的技术包括用于使用冗余编码技术预处理和存储的数据归档(“档案”)的再计算再生信息的系统和方法。 例如,针对每个子集计算对应于碎片子集的冗余代码相关矩阵(诸如基于例如用于所选择的冗余码的生成矩阵的倒排矩阵)的再生信息,并且在一些实施例中, 存储用于在一个或多个分片变得不可用的情况下使用,例如,以便更有效地和/或快速地再生替换分片。

    Precomputed redundancy code matrices for high-availability data storage
    47.
    发明授权
    Precomputed redundancy code matrices for high-availability data storage 有权
    用于高可用性数据存储的预计算冗余码矩阵

    公开(公告)号:US09495249B1

    公开(公告)日:2016-11-15

    申请号:US14675471

    申请日:2015-03-31

    Abstract: Techniques described and suggested herein include systems and methods for precomputing regeneration information for data archives (“archives”) that have been processed and stored using redundancy coding techniques. For example, regeneration information, such as redundancy code-related matrices (such as inverted matrices based on, e.g., a generator matrix for the selected redundancy code) corresponding to subsets of the shards, is computed for each subset and, in some embodiments, stored for use in the event that one or more shards becomes unavailable, e.g., so as to more efficiently and/or quickly regenerate a replacement shard.

    Abstract translation: 本文描述和提出的技术包括用于使用冗余编码技术预处理和存储的数据归档(“档案”)的再计算再生信息的系统和方法。 例如,针对每个子集计算对应于碎片子集的冗余代码相关矩阵(诸如基于例如用于所选择的冗余码的生成矩阵的倒排矩阵)的再生信息,并且在一些实施例中, 存储用于在一个或多个分片变得不可用的情况下使用,例如,以便更有效地和/或快速地再生替换分片。

    Generation and verification of erasure encoded fragments

    公开(公告)号:US10592344B1

    公开(公告)日:2020-03-17

    申请号:US15654065

    申请日:2017-07-19

    Abstract: Erasure encoded fragments are generated by an erasure encoding scheme, represented by an erasure encoding matrix, operating on a data file. A new erasure encoded fragment may be generated from previously-generated erasure encoded fragments without reconstructing the original data file. Available and valid erasure encoded fragments are identified and a set of those fragments is selected. A composite encoding matrix is generated based upon the selected fragments and the fragment specified to be generated. The composite matrix is applied to the selected fragments to produce a plurality of partial sums. The partial sums are then combined to generate the specified fragment. The partial sums may be produced by different devices so as to distribute the computational workload and/or to reduce network traffic. The integrity of a generated fragment may be verified by generating the specified fragment twice, using two different sets of fragments, and then comparing the two results.

Patent Agency Ranking