Locality aware, two-level fingerprint caching
    1.
    发明授权
    Locality aware, two-level fingerprint caching 有权
    局部感知,二级指纹缓存

    公开(公告)号:US09495379B2

    公开(公告)日:2016-11-15

    申请号:US13646852

    申请日:2012-10-08

    CPC classification number: G06F17/30156 G06F11/1453 G06F2201/81 H04L67/2847

    Abstract: The present disclosure provides for implementing a two-level fingerprint caching scheme for a client cache and a server cache. The client cache hit ratio can be improved by pre-populating the client cache with fingerprints that are relevant to the client. Relevant fingerprints include fingerprints used during a recent time period (e.g., fingerprints of segments that are included in the last full backup image and any following incremental backup images created for the client after the last full backup image), and thus are referred to as fingerprints with good temporal locality. Relevant fingerprints also include fingerprints associated with a storage container that has good spatial locality, and thus are referred to as fingerprints with good spatial locality. A pre-set threshold established for the client cache (e.g., threshold Tc) is used to determine whether a storage container (and thus fingerprints associated with the storage container) has good spatial locality.

    Abstract translation: 本公开提供了实现用于客户机高速缓存和服务器高速缓存的两级指纹高速缓存方案。 可以通过使用与客户端相关的指纹预先填充客户端缓存来提高客户端缓存命中率。 相关指纹包括在最近一段时间内使用的指纹(例如,最后一个完整备份图像中包含的片段的指纹以及在最后一个完整备份图像之后为客户端创建的任何后续增量备份映像),因此被称为指纹 具有良好的时间局部性。 相关指纹还包括与具有良好空间局部性的存储容器相关联的指纹,因此被称为具有良好空间局部性的指纹。 为客户端缓存建立的预设阈值(例如,阈值Tc)用于确定存储容器(以及因此与存储容器相关联的指纹)是否具有良好的空间局部性。

    Systems and methods for parallel content-defined data chunking
    2.
    发明授权
    Systems and methods for parallel content-defined data chunking 有权
    用于并行内容定义数据块的系统和方法

    公开(公告)号:US09477677B1

    公开(公告)日:2016-10-25

    申请号:US13889118

    申请日:2013-05-07

    CPC classification number: G06F17/30159

    Abstract: A computer-implemented method for parallel content-defined data chunking may include (1) identifying a data stream to be chunked, (2) splitting the data stream into a plurality of data sub-streams by alternatingly dividing consecutive bytes of the data stream among the plurality of data sub-streams, and (3) chunking, in parallel, each data sub-stream within the plurality of data sub-streams into a plurality of data segments using a content-defined chunking algorithm. Various other methods, systems, and computer-readable media are also disclosed.

    Abstract translation: 用于并行内容定义的数据分块的计算机实现的方法可以包括(1)识别要分块的数据流,(2)通过将数据流的连续字节交替地分割成多个数据子流, 多个数据子流,以及(3)使用内容定义的分块算法并行地将多个数据子流中的每个数据子流分块成多个数据段。 还公开了各种其它方法,系统和计算机可读介质。

    SYSTEMS AND METHODS FOR PROVIDING INCREASED SCALABILITY IN DEDUPLICATION STORAGE SYSTEMS
    3.
    发明申请
    SYSTEMS AND METHODS FOR PROVIDING INCREASED SCALABILITY IN DEDUPLICATION STORAGE SYSTEMS 有权
    用于提供存储系统中增加的可扩展性的系统和方法

    公开(公告)号:US20150112950A1

    公开(公告)日:2015-04-23

    申请号:US14580246

    申请日:2014-12-23

    Abstract: A computer-implemented method for providing increased scalability in deduplication storage systems may include (1) identifying a database that stores a plurality of reference objects, (2) determining that at least one size-related characteristic of the database has reached a predetermined threshold, (3) partitioning the database into a plurality of sub-databases capable of being updated independent of one another, (4) identifying a request to perform an update operation that updates one or more reference objects stored within at least one sub-database, and then (5) performing the update operation on less than all of the sub-databases to avoid processing costs associated with performing the update operation on all of the sub-databases. Various other systems, methods, and computer-readable media are also disclosed.

    Abstract translation: 用于在重复数据删除存储系统中提供增加的可扩展性的计算机实现的方法可以包括(1)识别存储多个参考对象的数据库,(2)确定数据库的至少一个尺寸相关特性已经达到预定阈值, (3)将数据库分割成能够彼此独立地更新的多个子数据库,(4)识别执行更新存储在至少一个子数据库中的一个或多个参考对象的更新操作的请求,以及 然后(5)在小于所有子数据库的情况下执行更新操作,以避免处理与对所有子数据库执行更新操作相关联的成本。 还公开了各种其它系统,方法和计算机可读介质。

    Autonomous container group management for scale out deduplication system

    公开(公告)号:US09626253B1

    公开(公告)日:2017-04-18

    申请号:US14316636

    申请日:2014-06-26

    Abstract: A method for data container group management in a deduplication system is provided. The method includes arranging a plurality of data container groups according to a plurality of file systems. A subset of the plurality of data container groups correspond to each of the plurality of file systems, each of the plurality of data container groups having a reference database, a plurality of data containers, and a data container group identifier (ID). The method includes performing a first backup process for a first client-policy pair with deduplication via a first one of the plurality of data container groups and performing a second backup process for a second client-policy pair with deduplication via a second one of the plurality of data container groups.

    Processes and Methods For Client-Side Fingerprint Caching To Improve Deduplication System Backup Performance
    5.
    发明申请
    Processes and Methods For Client-Side Fingerprint Caching To Improve Deduplication System Backup Performance 有权
    客户端指纹缓存的进程和方法,以提高重复数据删除系统的备份性能

    公开(公告)号:US20150046403A1

    公开(公告)日:2015-02-12

    申请号:US14524397

    申请日:2014-10-27

    CPC classification number: G06F11/1458 G06F11/1453

    Abstract: A system and method for caching fingerprints in a client cache is provided. A data object that comprises a set of data segments and describes a backup process is identified. Thereafter, a request referencing the data object is made to a deduplication server to request that a task identifier be added to the data object. If the deduplication server is able to successfully add the task identifier to the data object, then an active identifier is added to each data segment from the set of data segments in a cache that is within a client system.

    Abstract translation: 提供了一种用于缓存客户端缓存中的指纹的系统和方法。 识别包括一组数据段并描述备份过程的数据对象。 此后,向重复数据删除服务器发出引用数据对象的请求,以请求将任务标识符添加到数据对象。 如果重复数据删除服务器能够成功地将任务标识符添加到数据对象,则将活动标识符从客户机系统中的缓存中的数据段集合中添加到每个数据段。

    DEDUPLICATION STORAGE SYSTEM WITH EFFICIENT REFERENCE UPDATING AND SPACE RECLAMATION
    6.
    发明申请
    DEDUPLICATION STORAGE SYSTEM WITH EFFICIENT REFERENCE UPDATING AND SPACE RECLAMATION 审中-公开
    具有有效参考更新和空间重新定位的重复存储系统

    公开(公告)号:US20140244599A1

    公开(公告)日:2014-08-28

    申请号:US13774083

    申请日:2013-02-22

    Abstract: A deduplication storage system and associated methods are described. The deduplication storage system may split data objects into segments and store the segments. A plurality of data segment containers may be maintained. Each of the containers may include two or more of the data segments. Maintaining the containers may include maintaining a respective logical size of each container. In response to detecting that the logical size of a particular container has fallen below a threshold level, the deduplication storage system may perform an operation to reclaim the storage space allocated to one or more of the data segments included in the particular container.

    Abstract translation: 描述重复数据删除存储系统及相关方法。 重复数据消除存储系统可能会将数据对象分割成段并存储段。 可以维护多个数据段容器。 每个容器可以包括两个或更多个数据段。 维护容器可以包括保持每个容器的相应逻辑尺寸。 响应于检测到特定容器的逻辑大小已经低于阈值水平,重复数据消除存储系统可以执行操作以回收分配给包括在特定容器中的一个或多个数据段的存储空间。

    Locality Aware, Two-Level Fingerprint Caching
    7.
    发明申请
    Locality Aware, Two-Level Fingerprint Caching 有权
    局部感知,两级指纹缓存

    公开(公告)号:US20140101113A1

    公开(公告)日:2014-04-10

    申请号:US13646852

    申请日:2012-10-08

    CPC classification number: G06F17/30156 G06F11/1453 G06F2201/81 H04L67/2847

    Abstract: The present disclosure provides for implementing a two-level fingerprint caching scheme for a client cache and a server cache. The client cache hit ratio can be improved by pre-populating the client cache with fingerprints that are relevant to the client. Relevant fingerprints include fingerprints used during a recent time period (e.g., fingerprints of segments that are included in the last full backup image and any following incremental backup images created for the client after the last full backup image), and thus are referred to as fingerprints with good temporal locality. Relevant fingerprints also include fingerprints associated with a storage container that has good spatial locality, and thus are referred to as fingerprints with good spatial locality. A pre-set threshold established for the client cache (e.g., threshold Tc) is used to determine whether a storage container (and thus fingerprints associated with the storage container) has good spatial locality.

    Abstract translation: 本公开提供了实现用于客户机高速缓存和服务器高速缓存的两级指纹高速缓存方案。 可以通过使用与客户端相关的指纹预先填充客户端缓存来提高客户端缓存命中率。 相关指纹包括在最近一段时间内使用的指纹(例如,最后一个完整备份图像中包含的片段的指纹以及在最后一个完整备份图像之后为客户端创建的任何后续增量备份映像),因此被称为指纹 具有良好的时间局部性。 相关指纹还包括与具有良好空间局部性的存储容器相关联的指纹,因此被称为具有良好空间局部性的指纹。 为客户端缓存建立的预设阈值(例如,阈值Tc)用于确定存储容器(以及因此与存储容器相关联的指纹)是否具有良好的空间局部性。

    Method and system for efficient space management for single-instance-storage volumes
    8.
    发明授权
    Method and system for efficient space management for single-instance-storage volumes 有权
    用于单实例存储卷的高效空间管理的方法和系统

    公开(公告)号:US08650371B1

    公开(公告)日:2014-02-11

    申请号:US13668726

    申请日:2012-11-05

    CPC classification number: G06F11/1453 G06F11/1448

    Abstract: A method and system for efficient space management for single-instance-storage volumes is provided. A backup module storing data within a collection of containers according to access locality and retention time of the data, wherein the retention time defines an amount of time the data is stored within the collection of containers before deletion of the data, and the access locality comprises an order in which the data is to be accessed is further provided. A compaction module compacting the stored data by selecting at least two containers among the collection of containers, wherein the selection is performed using a predetermined criteria that includes access locality and retention time of the data is also provided. The compaction module distributes the data among the at least two containers. The compaction criteria creates an imbalance among the containers to create more empty, full, or nearly full containers.

    Abstract translation: 提供了一种用于单实例存储卷的高效空间管理的方法和系统。 一种备份模块,其根据所述数据的访问位置和保留时间,在容器集合内存储数据,其中所述保留时间定义了数据在所述数据集合中存储的时间量,并且所述访问位置包括 还提供了要访问数据的顺序。 压缩模块通过在容器集合中选择至少两个容器来压缩存储的数据,其中使用包括访问位置和数据的保留时间的预定标准来执行选择。 压缩模块在至少两个容器之间分配数据。 压实标准在容器之间产生不平衡,以创建更多的空,满或几乎完整的容器。

    Handling data segments in deduplication
    10.
    发明授权
    Handling data segments in deduplication 有权
    处理重复数据删除中的数据段

    公开(公告)号:US09442807B1

    公开(公告)日:2016-09-13

    申请号:US13935405

    申请日:2013-07-03

    CPC classification number: G06F17/30159

    Abstract: In some embodiments, a method of maintaining a reference list for data deduplication is provided. The method includes discarding a newly arriving data segment in response to finding a fingerprint of the newly arriving data segment matches an existing fingerprint in a plurality of fingerprints on a fingerprint-to-file reference list. The method includes adding, in the fingerprint-to-file reference list, to a list for the existing fingerprint, a source for the newly arriving data segment, in response to the fingerprint-to-file reference list indicating the existing fingerprint does not correspond to a hot data segment and setting an indication in the fingerprint-to-file reference list that the existing fingerprint corresponds to the hot data segment in response to the list for the existing fingerprint meeting or exceeding a predetermined number of entries. Other embodiments are included.

    Abstract translation: 在一些实施例中,提供了维护用于重复数据删除的参考列表的方法。 该方法包括:响应于找到新到达的数据段的指纹,匹配指纹到文件参考列表上的多个指纹中的现有指纹,丢弃新到达的数据段。 该方法包括在指纹到文件参考列表中添加现有指纹的列表,响应于指示现有指纹的指纹到文件参考列表,新到达的数据段的源不对应 到达热数据段并且在指纹到文件参考列表中设置现有指纹对应于热数据段的指示,以响应于现有指纹会议的列表或超过预定数量的条目。 包括其他实施例。

Patent Agency Ranking