Insertion and eviction schemes for deduplicated cache system of a storage system
    11.
    发明授权
    Insertion and eviction schemes for deduplicated cache system of a storage system 有权
    存储系统的重复数据缓存系统的插入和逐出方案

    公开(公告)号:US09390116B1

    公开(公告)日:2016-07-12

    申请号:US14038673

    申请日:2013-09-26

    CPC classification number: G06F17/30336

    Abstract: A computer-implemented method for indexing content stored in a cache memory device is disclosed. The method starts with maintaining a fingerprint index having a plurality of fingerprint entries, each mapping a fingerprint to a storage location of a cache memory device, where the cache memory device caches some of data blocks stored in a persistent storage device of a storage system, and where the fingerprint index is a partial index indexing a portion of data stored in the cache memory device. In response to receiving a request to insert a new fingerprint, the method continues with evicting one of the fingerprint entries according to a predetermined eviction algorithm and inserting the new fingerprint into the evicted fingerprint entry.

    Abstract translation: 公开了一种用于索引存储在高速缓冲存储器件中的内容的计算机实现的方法。 该方法开始于维护具有多个指纹条目的指纹索引,每个指纹条目将指纹映射到高速缓冲存储器设备的存储位置,其中高速缓存存储器设备缓存存储在存储系统的持久存储设备中的一些数据块, 并且其中指纹索引是索引存储在高速缓冲存储器设备中的数据的一部分的部分索引。 响应于接收到插入新指纹的请求,该方法继续根据预定的逐出算法驱逐指纹条目中的一个,并将新指纹插入被驱逐的指纹条目。

    Memory efficient sanitization of a deduplicated storage system
    12.
    发明授权
    Memory efficient sanitization of a deduplicated storage system 有权
    重复数据删除的存储系统的内存高效消毒

    公开(公告)号:US09430164B1

    公开(公告)日:2016-08-30

    申请号:US13763508

    申请日:2013-02-08

    Abstract: Techniques for sanitizing a storage system are described herein. In one embodiment, for each file stored in the storage system, a list of fingerprints representing data chunks of the file is obtained. In such an embodiment, for each of the fingerprints, identifying a first container storing a data chunk corresponding to the fingerprint is identified, and determining a storage location of the first container in which the data chunk is stored is determined. In one embodiment, a bit in copy bit vector (CBV) is populated based on the identified container and the storage location. In one embodiment, after all of the bits corresponding to the data chunks of the first container have been populated in the CBV, data chunks represented by the CBV are copied from the first container to a second container, and records of the data chunks in the first container are erased.

    Abstract translation: 本文描述了用于消毒存储系统的技术。 在一个实施例中,对于存储在存储系统中的每个文件,获得表示文件的数据块的指纹列表。 在这样的实施例中,对于每个指纹,识别存储与指纹相对应的数据块的第一容器,并确定存储数据块的第一容器的存储位置。 在一个实施例中,基于所识别的容器和存储位置来填充复制位矢量(CBV)中的位。 在一个实施例中,在对应于第一容器的数据块的所有位已经被填充到CBV之后,由CBV表示的数据块从第一容器复制到第二容器,并将数据块的记录在 第一个容器被擦除。

    Indexing a deduplicated cache system by integrating fingerprints of underlying deduplicated storage system
    13.
    发明授权
    Indexing a deduplicated cache system by integrating fingerprints of underlying deduplicated storage system 有权
    通过集成底层重复数据删除的存储系统的指纹来索引重复数据删除的缓存系统

    公开(公告)号:US09336143B1

    公开(公告)日:2016-05-10

    申请号:US14038694

    申请日:2013-09-26

    Abstract: A computer-implemented method for indexing content stored in a cache memory device is disclosed. The method starts with maintaining a file index having a plurality of extent entries, each extent entry corresponding to one of a plurality of file extents stored in a cache memory device that caches data stored in a persistent storage device of a storage system. The method continues with maintaining a fingerprint index having a plurality of fingerprint entries, each mapping a fingerprint to a data region of a file indexed in the file index, wherein each fingerprint indexed in the fingerprint index is retrieved from metadata stored in the persistent storage device of the storage system when one or more corresponding data chunks were accessed, and deduplicating and accessing the file extents stored in the cache memory device using the file index and the fingerprint index.

    Abstract translation: 公开了一种用于索引存储在高速缓冲存储器件中的内容的计算机实现的方法。 该方法开始于维护具有多个盘区条目的文件索引,每个盘区条目对应于存储在存储在存储系统的持久存储装置中的数据的高速缓冲存储器装置中存储的多个文件盘区中的一个。 该方法继续保持具有多个指纹条目的指纹索引,每个指纹条目将指纹映射到文件索引中索引的文件的数据区域,其中从指纹索引中索引的每个指纹从存储在永久存储设备中的元数据中检索 当使用一个或多个对应的数据块时,使用文件索引和指纹索引对存储在高速缓冲存储器设备中的文件盘区进行重复数据删除和访问。

    Deduplicated cache system of a storage system
    14.
    发明授权
    Deduplicated cache system of a storage system 有权
    存储系统的重复数据缓存系统

    公开(公告)号:US09304914B1

    公开(公告)日:2016-04-05

    申请号:US14038665

    申请日:2013-09-26

    CPC classification number: G06F12/0802 G06F12/0871 G06F2212/313 G06F2212/463

    Abstract: A computer-implemented method for caching content in a cache memory device is disclosed. The method starts with receiving a request for accessing a first data block associated with a first file, and a file manager provides access of the first data block in a persistent storage device of a storage system. The file manager then caches the first data block in a cache memory device including deduplicating the first data block, wherein at least some of data blocks stored in the cache memory device are deduplicated data blocks, and wherein at least one of the data blocks is referenced by different regions of an identical file or different files.

    Abstract translation: 公开了一种用于在高速缓冲存储器件中缓存内容的计算机实现的方法。 该方法从接收到访问与第一文件相关联的第一数据块的请求开始,并且文件管理器提供对存储系统的持久存储设备中的第一数据块的访问。 文件管理器然后将第一数据块高速缓存在高速缓冲存储器设备中,包括重复数据删除第一数据块,其中存储在高速缓冲存储器设备中的至少一些数据块是重复数据删除的数据块,并且其中至少一个数据块被引用 由不同区域的相同文件或不同的文件。

    Preferential selection of candidates for delta compression
    15.
    发明授权
    Preferential selection of candidates for delta compression 有权
    优先选择三角洲压缩的候选人

    公开(公告)号:US09268783B1

    公开(公告)日:2016-02-23

    申请号:US14203359

    申请日:2014-03-10

    CPC classification number: G06F17/30153 H03M7/3091

    Abstract: A computer-implemented method and system for improving efficiency in a delta compression process in a data storage system selects a data chunk to delta compress and selects a set of candidate data chunks using a first selection mechanism. Throughput or resource utilization is monitored. A change is made to a second selection mechanism that increases similarity of the set of candidates with the selected data chunk to improve compression in response to determining high resource availability or high throughput level. A change is made to a third selection mechanism that increases throughput of the delta compression process in response to determining low resources availability or low throughput.

    Abstract translation: 用于提高数据存储系统中的增量压缩处理的效率的计算机实现的方法和系统选择数据块进行增量压缩,并使用第一选择机制选择一组候选数据块。 监测吞吐量或资源利用率。 改变第二选择机制,其增加候选组与所选择的数据块的相似性,以响应于确定高资源可用性或高吞吐量水平来改善压缩。 改变了第三选择机制,其响应于确定低资源可用性或低吞吐量而增加了增量压缩过程的吞吐量。

    Preferential selection of candidates for delta compression
    16.
    发明授权
    Preferential selection of candidates for delta compression 有权
    优先选择三角洲压缩的候选人

    公开(公告)号:US09262434B1

    公开(公告)日:2016-02-16

    申请号:US14540895

    申请日:2014-11-13

    CPC classification number: G06F17/30162 H03M7/3091

    Abstract: A computer-implemented method and system for improving efficiency in a delta compression process in a data storage system selects a data chunk to delta compress and generates a sketch for the selected data chunk. The method and system search for a set of candidate data chunks with a matching sketch and rank the set of candidate data chunks by degree of sketch matching. The set of candidate data chunks are tie-braked using location status data for each candidate and the selected data chunk is delta compressed with a selected candidate data chunk. The delta compressed selected data chunk is then stored in a data storage system.

    Abstract translation: 用于提高数据存储系统中的增量压缩过程中的效率的计算机实现的方法和系统选择数据块以进行增量压缩,并为所选择的数据块生成草图。 该方法和系统搜索具有匹配草图的一组候选数据块,并通过草图匹配程度对候选数据块的集合进行排序。 使用每个候选的位置状态数据对候选数据块的组合进行制动,并且所选择的数据块通过所选候选数据块进行增量压缩。 然后将delta压缩的所选数据块存储在数据存储系统中。

    Indexing architecture for deduplicated cache system of a storage system
    17.
    发明授权
    Indexing architecture for deduplicated cache system of a storage system 有权
    存储系统的重复数据缓存系统的索引架构

    公开(公告)号:US08935446B1

    公开(公告)日:2015-01-13

    申请号:US14038668

    申请日:2013-09-26

    CPC classification number: G06F12/0871

    Abstract: A computer-implemented method for indexing content stored in a cache memory device is disclosed. The method starts with in response to receiving a first request for caching a first file extent associated with a first file in a cache memory device, generating a first fingerprint based on content of the first file extent. Then the method continues with searching in a fingerprint index based on the first fingerprint to determine whether the first file extent has been stored in the cache memory. In response to determining that a fingerprint entry matching the first fingerprint is found, the method then continues with associating a first identifier identifying the first file extent and the first file with a storage location of the cache memory device obtained from the matching fingerprint entry, without storing the first file extent in the cache memory device.

    Abstract translation: 公开了一种用于索引存储在高速缓冲存储器件中的内容的计算机实现的方法。 响应于接收到缓存与高速缓存存储器设备中的第一文件相关联的第一文件盘区的第一请求,基于第一文件盘区的内容生成第一指纹,开始该方法。 然后,该方法继续基于第一指纹在指纹索引中进行搜索,以确定第一文件范围是否已被存储在高速缓冲存储器中。 响应于确定找到与第一指纹匹配的指纹条目,然后该方法继续将识别第一文件盘区的第一标识符和第一文件与从匹配指纹条目获得的高速缓冲存储器装置的存储位置相关联,而没有 将第一文件盘区存储在高速缓冲存储器装置中。

    Multiple location index
    18.
    发明授权

    公开(公告)号:US10922228B1

    公开(公告)日:2021-02-16

    申请号:US14674911

    申请日:2015-03-31

    Abstract: Systems and methods for accessing data stored in multiple locations. A cache and a storage system are associated with an index. Entries in the index identify locations of data in both the cache and the storage system. When an index lookup occurs and an entry in the index identifies at least two locations for the data, the locations are ordered based on at least one factor and the data stored in the optimal location as determined from the at least one factor is returned.

Patent Agency Ranking