Content aligned block-based deduplication
    93.
    发明授权
    Content aligned block-based deduplication 有权
    内容对齐的基于块的重复数据删除

    公开(公告)号:US09110602B2

    公开(公告)日:2015-08-18

    申请号:US13750105

    申请日:2013-01-25

    Abstract: A content alignment system according to certain embodiments aligns a sliding window at the beginning of a data segment. The content alignment system performs a block alignment function on the data within the sliding window. A deduplication block is established if the output of the block alignment function meets a predetermined criteria. At least part of a gap is established if the output of the block alignment function does not meet the predetermined criteria. The predetermined criteria is changed if a threshold number of outputs fail to meet the predetermined criteria.

    Abstract translation: 根据某些实施例的内容对齐系统对准在数据段的开始处的滑动窗口。 内容对齐系统对滑动窗口内的数据执行块对齐功能。 如果块对齐功能的输出满足预定标准,则建立重复数据消除块。 如果块对准功能的输出不满足预定标准,则建立间隙的至少一部分。 如果阈值数量的输出不符合预定标准,则改变预定标准。

    BLOCK-LEVEL SINGLE INSTANCING
    94.
    发明申请
    BLOCK-LEVEL SINGLE INSTANCING 审中-公开
    块级单一实现

    公开(公告)号:US20150199242A1

    公开(公告)日:2015-07-16

    申请号:US14668450

    申请日:2015-03-25

    Abstract: Described in detail herein are systems and methods for single instancing blocks of data in a data storage system. For example, the data storage system may include multiple computing devices (e.g., client computing devices) that store primary data. The data storage system may also include a secondary storage computing device, a single instance database, and one or more storage devices that store copies of the primary data (e.g., secondary copies, tertiary copies, etc.). The secondary storage computing device receives blocks of data from the computing devices and accesses the single instance database to determine whether the blocks of data are unique (meaning that no instances of the blocks of data are stored on the storage devices). If a block of data is unique, the single instance database stores it on a storage device. If not, the secondary storage computing device can avoid storing the block of data on the storage devices.

    Abstract translation: 在此详细描述的是用于数据存储系统中单个实例数据块的系统和方法。 例如,数据存储系统可以包括存储主数据的多个计算设备(例如,客户端计算设备)。 数据存储系统还可以包括辅助存储计算设备,单个实例数据库以及存储主要数据的副本(例如,次要副本,三级副本等)的一个或多个存储设备。 辅助存储计算设备从计算设备接收数据块并访问单个实例数据库以确定数据块是否是唯一的(意味着数据块的实例不存储在存储设备上)。 如果一个数据块是唯一的,那么单个实例数据库将其存储在存储设备上。 如果不是,则辅助存储计算设备可以避免将数据块存储在存储设备上。

    INFORMATION MANAGEMENT OF DATA ASSOCIATED WITH MULTIPLE CLOUD SERVICES
    96.
    发明申请
    INFORMATION MANAGEMENT OF DATA ASSOCIATED WITH MULTIPLE CLOUD SERVICES 有权
    与多云服务相关的数据信息管理

    公开(公告)号:US20150113055A1

    公开(公告)日:2015-04-23

    申请号:US14589179

    申请日:2015-01-05

    Abstract: A method and system for providing information management of data from hosted services receives information management policies for a hosted account of a hosted service, requests data associated with the hosted account from the hosted service, receives data associated with the hosted account from the hosted service, and provides a preview version of the received data to a computing device. In some examples, the system indexes the received data to associate the received data with a user of an information management system, and/or provides index information related to the received data to the computing device.

    Abstract translation: 用于从托管服务提供数据的信息管理的方法和系统接收托管服务的托管帐户的信息管理策略,从托管服务请求与托管帐户相关联的数据,从托管服务接收与托管帐户相关联的数据, 并将所接收的数据的预览版本提供给计算设备。 在一些示例中,系统对接收到的数据进行索引以将接收到的数据与信息管理系统的用户相关联,和/或将与接收到的数据相关的索引信息提供给计算设备。

    PARTIAL FILE RESTORE IN A DATA STORAGE SYSTEM
    98.
    发明申请
    PARTIAL FILE RESTORE IN A DATA STORAGE SYSTEM 有权
    数据存储系统中的部分文件恢复

    公开(公告)号:US20140201153A1

    公开(公告)日:2014-07-17

    申请号:US14152843

    申请日:2014-01-10

    Abstract: The data storage system according to certain aspects can implement partial file restore, where only a portion of the secondary copy of a file is restored. Such portion may be designated by one or more application offsets for the file. The system may provide an in-chunk index that includes mapping information between the application offsets and the secondary copy offsets. Chunks may refer to logical data units in which secondary copies are stored, and the in-chunk index for a chunk may be stored in secondary storage with the chunk. Because the mapping information may not be provided at a fixed interval, the system can search through application offsets in the in-chunk index to locate the secondary copy offset corresponding to the portion application offset(s). In this manner, the system may restore the designated portion of the secondary copy in a fast and efficient manner by using the in-chunk index.

    Abstract translation: 根据某些方面的数据存储系统可以实现部分文件恢复,其中只有文件的副本副本的一部分被恢复。 这样的部分可以由文件的一个或多个应用偏移来指定。 系统可以提供包括应用偏移和次要复印偏移之间的映射信息的块内索引。 块可以指存储次级副本的逻辑数据单元,并且块的块内索引可以与块一起存储在辅助存储器中。 由于可能不能以固定的时间间隔提供映射信息,所以系统可以搜索块内索引中的应用偏移量来定位与部分应用偏移相对应的次要复制偏移量。 以这种方式,系统可以通过使用组块索引以快速且有效的方式恢复辅助副本的指定部分。

    STORING A VARIABLE NUMBER OF INSTANCES OF DATA OBJECTS
    99.
    发明申请
    STORING A VARIABLE NUMBER OF INSTANCES OF DATA OBJECTS 有权
    存储可变数据数据对象的数据

    公开(公告)号:US20130218842A1

    公开(公告)日:2013-08-22

    申请号:US13847359

    申请日:2013-03-19

    CPC classification number: G06F17/30283 G06F11/2094 G06F17/30575

    Abstract: Described are systems and methods for storing a variable number of instances of data objects (e.g., 1, 2, 3, or up to N−1 instances, where N is the number of instances of the data object included in primary data) in secondary storage across a data storage network. In some examples, a system for storing a variable number of instances of data objects includes, one or more computing devices storing a set of data objects and multiple storage devices distinct from the one or more computing devices. Each of the multiple storage devices is configured to store at least a single instance of a data object. The system also includes a database configured to store information associated with the data objects. This information includes substantially unique identifiers for the data objects and, for each of the data objects, a number of instances of the data object stored on the multiple storage devices.

    Abstract translation: 描述了用于存储可变数目的数据对象(例如,1,2,3或至多N-1个实例的系统和方法,其中N是主数据中包括的数据对象的实例的数量)在次级 存储在数据存储网络中。 在一些示例中,用于存储可变数量的数据对象的实例的系统包括存储一组数据对象的一个​​或多个计算设备和与一个或多个计算设备不同的多个存储设备。 多个存储设备中的每一个被配置为存储数据对象的至少一个实例。 该系统还包括配置为存储与数据对象相关联的信息的数据库。 该信息包括用于数据对象的基本上唯一的标识符,并且对于每个数据对象,存储多个存储设备上存储的数据对象的多个实例。

    Efficient deduplication database validation

    公开(公告)号:US11175996B2

    公开(公告)日:2021-11-16

    申请号:US16743879

    申请日:2020-01-15

    Abstract: According to certain aspects, a method can include receiving an indication that a restoration of a deduplication database using a secondary copy of a file associated with a secondary copy job is complete; retrieving a first data fingerprint from a data storage database, wherein the first data fingerprint is associated with the secondary copy job used to restore the deduplication database; retrieving a second data fingerprint from a deduplication database media agent, wherein the second data fingerprint is associated with the secondary copy job used to restore the deduplication database; comparing the first data fingerprint with the second data fingerprint to determine whether the first data fingerprint and the second data fingerprint match; and transmitting an instruction to the deduplication database media agent to rebuild the restored deduplication database in response to a determination that the first data fingerprint and the second data fingerprint do not match.

Patent Agency Ranking