PERFORMING DATA STORAGE OPERATIONS WITH A CLOUD ENVIRONMENT, INCLUDING CONTAINERIZED DEDUPLICATION, DATA PRUNING, AND DATA TRANSFER
    42.
    发明申请
    PERFORMING DATA STORAGE OPERATIONS WITH A CLOUD ENVIRONMENT, INCLUDING CONTAINERIZED DEDUPLICATION, DATA PRUNING, AND DATA TRANSFER 有权
    使用云环境执行数据存储操作,包括集中的重复数据,数据预处理和数据传输

    公开(公告)号:US20130238572A1

    公开(公告)日:2013-09-12

    申请号:US13850903

    申请日:2013-03-26

    Abstract: Various systems and methods may be used for performing data storage operations, including content-indexing, containerized deduplication, and policy-driven storage, within a cloud environment. The systems support a variety of clients and cloud storage sites that may connect to the system in a cloud environment that requires data transfer over wide area networks, such as the Internet, which may have appreciable latency and/or packet loss, using various network protocols, including HTTP and FTP. Methods for content indexing data stored within a cloud environment may facilitate later searching, including collaborative searching. Methods for performing containerized deduplication may reduce the strain on a system namespace, effectuate cost savings, etc. Methods may identify suitable storage locations, including suitable cloud storage sites, for data files subject to a storage policy. Further, the systems and methods may be used for providing a cloud gateway and a scalable data object store within a cloud environment.

    Abstract translation: 在云环境中,各种系统和方法可用于执行数据存储操作,包括内容索引,容器重复数据消除和策略驱动存储。 该系统支持各种客户端和云存储站点,可在云环境中连接到系统,该环境需要使用各种网络协议在可能具有可观的延迟和/或分组丢失的广域网(例如Internet)上进行数据传输 ,包括HTTP和FTP。 存储在云环境中的内容索引数据的方法可以促进稍后的搜索,包括协作搜索。 执行集装式重复数据删除的方法可以减少系统命名空间的压力,实现成本节省等。方法可以为受到存储策略的数据文件识别合适的存储位置,包括合适的云存储站点。 此外,系统和方法可以用于在云环境中提供云网关和可扩展数据对象存储。

    Cloud-based air-gapped data storage management system

    公开(公告)号:US12130708B2

    公开(公告)日:2024-10-29

    申请号:US17120555

    申请日:2020-12-14

    Abstract: An illustrative cloud-based air-gapped data storage management (destination) system obtains authorized access to other (source) systems' backup copies, replicates those copies within the destination system, parses supplemental metadata included in the source backup copies, and integrates the replica copies into the destination system as though natively created there. Replica copies are integrated as backup copies without first restoring the source backup copies to a native data format. The source system lacks knowledge of or connectivity with the destination system, thus maintaining an “air gap” between the systems. The destination system preferably operates in a cloud computing environment. The destination system uses supplemental metadata from the replica copies to re-create or mimic the source's computing environment and to restore backed up data from the replica copies. The destination system also operates as an autonomous analytics engine, applying value-added services to backed up data pulled from source system(s).

    Deduplication database without reference counting

    公开(公告)号:US12007967B2

    公开(公告)日:2024-06-11

    申请号:US17725451

    申请日:2022-04-20

    CPC classification number: G06F16/215 G06F16/2237 G06F16/2282 G06F16/278

    Abstract: A deduplicated storage system is provided according to certain embodiments that uses one or more mechanisms to update the deduplication database and remove records corresponding to data blocks that have been or will be erased from the secondary copies, without using or tracking reference counting values. Some embodiments described herein use a secondary table to identify the corresponding records from the primary table that can be removed and/or moved to another table for storing “zero-reference” data blocks. In other embodiments, the system will then traverse the “zero-reference” table and remove those primary data blocks from secondary storage devices.

    Restore using deduplicated secondary copy data

    公开(公告)号:US11829251B2

    公开(公告)日:2023-11-28

    申请号:US17149438

    申请日:2021-01-14

    CPC classification number: G06F11/1453 G06F11/1451 G06F11/1464 G06F11/1469

    Abstract: Disclosed methods and systems leverage resources in a storage management system to restore a selected backup to a production site. The backup is partitioned into blocks with associated signatures. The production site may have blocks that have not changed from when the backup occurred, so those blocks do not need to be restored. Block signatures from the production site are compared with block signatures from the incremental backup to identify blocks that need to be restored. Efficiency may be achieved by synchronizing the replacement blocks from more easily accessible location where available before synchronizing from less accessible locations. In some embodiments, a user may specify the location of the site with the replacement blocks.

    DATA STORAGE SYSTEM WITH RAPID RESTORE CAPABILITY

    公开(公告)号:US20230328151A1

    公开(公告)日:2023-10-12

    申请号:US18133450

    申请日:2023-04-11

    CPC classification number: H04L67/5683 H04L67/565 H04L67/1097

    Abstract: An improved information management system that implements a staging area or cache to temporarily store primary data in a native format before the primary data is converted into secondary copies in a secondary format is described herein. For example, the improved information management system can include various media agents that each include one or more high speed drives. When a client computing device provides primary data for conversion into secondary copies, the primary data can initially be stored in the native format in the high speed drive(s). If the client computing device then submits a request for the primary data, the media agent can simply retrieve the primary data from the high speed drive(s) and transmit the primary data to the client computing device. Because the primary data is already in the native format, no conversion operations are performed by the media agent, thereby reducing the restore delay.

    Block-level single instancing
    49.
    发明授权

    公开(公告)号:US11709739B2

    公开(公告)日:2023-07-25

    申请号:US17884482

    申请日:2022-08-09

    Abstract: Described in detail herein are systems and methods for single instancing blocks of data in a data storage system. For example, the data storage system may include multiple computing devices (e.g., client computing devices) that store primary data. The data storage system may also include a secondary storage computing device, a single instance database, and one or more storage devices that store copies of the primary data (e.g., secondary copies, tertiary copies, etc.). The secondary storage computing device receives blocks of data from the computing devices and accesses the single instance database to determine whether the blocks of data are unique (meaning that no instances of the blocks of data are stored on the storage devices). If a block of data is unique, the single instance database stores it on a storage device. If not, the secondary storage computing device can avoid storing the block of data on the storage devices.

    Data storage system with rapid restore capability

    公开(公告)号:US11659064B2

    公开(公告)日:2023-05-23

    申请号:US17498212

    申请日:2021-10-11

    CPC classification number: H04L67/5683 H04L67/565 H04L67/1097

    Abstract: An improved information management system that implements a staging area or cache to temporarily store primary data in a native format before the primary data is converted into secondary copies in a secondary format is described herein. For example, the improved information management system can include various media agents that each include one or more high speed drives. When a client computing device provides primary data for conversion into secondary copies, the primary data can initially be stored in the native format in the high speed drive(s). If the client computing device then submits a request for the primary data, the media agent can simply retrieve the primary data from the high speed drive(s) and transmit the primary data to the client computing device. Because the primary data is already in the native format, no conversion operations are performed by the media agent, thereby reducing the restore delay.

Patent Agency Ranking