COLD TIERING MICROSERVICE FOR DEDUPLICATED DATA

    公开(公告)号:US20240362123A1

    公开(公告)日:2024-10-31

    申请号:US18767293

    申请日:2024-07-09

    IPC分类号: G06F11/14 G06F16/215

    摘要: One example method includes identifying objects that each include one or more segments to be transferred from a source storage tier to a target storage tier, determining a total amount of data to be transferred, using a tiering controller to create worker nodes operable to transfer the segments to the target storage tier, where a number of worker nodes created is based on the amount of data, transferring, from the source storage tier to the target storage tier, only those segments of the objects not already present in the target storage tier, and the transferring of the segments is performed by the worker nodes, and for each of the objects, placing metadata associated with that object in a bucket.

    CONTENT INDEXING OF FILES IN VIRTUAL DISK BLOCK-LEVEL BACKUP COPIES

    公开(公告)号:US20240362121A1

    公开(公告)日:2024-10-31

    申请号:US18762887

    申请日:2024-07-03

    IPC分类号: G06F11/14 G06F9/455 G06F16/13

    摘要: A streamlined approach analyzes block-level backups of VM virtual disks and creates both coarse and fine indexes of backed up VM data files in the block-level backups. The indexes (collectively the “content index”) enable granular searching by filename, by file attributes (metadata), and/or by file contents, and further enable granular live browsing of backed up VM files. Thus, by using the illustrative data storage management system, ordinary block-level backups of virtual disks are “opened to view” through indexing. Any block-level copies can be indexed according to the illustrative embodiments, including file system block-level copies. The indexing occurs offline in an illustrative data storage management system, after VM virtual disks are backed up into block-level backup copies, and therefore the indexing does not cut into the source VM's performance. The disclosed approach is widely applicable to VMs executing in cloud computing environments and/or in non-cloud data centers. The illustrative content indexing is accomplished without restoring the VM data files being indexed to a staging location.

    CLOUD COMPUTING SANDBOX BACKUP
    3.
    发明公开

    公开(公告)号:US20240362120A1

    公开(公告)日:2024-10-31

    申请号:US18307501

    申请日:2023-04-26

    发明人: Gadi Luc Vered

    IPC分类号: G06F11/14 G06F16/11

    摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for backing up environments. One of the methods includes maintaining, for a cloud computing environment, first data that indicates one or more previously active sandbox environments; determining second data that indicates one or more most recently active sandbox environments; determining, using the second data, a newly added sandbox environment; determining, using a first identifier for the newly added sandbox environment and a second identifier for a prior sandbox environment from the one or more previously active sandbox environments, whether the newly added sandbox environment is likely a refresh of the prior sandbox environment; and performing one or more actions for the newly added sandbox environment using a result of the determination whether the newly added sandbox environment is likely a refresh of the prior sandbox environment.

    DIRECTORY RESTORE FROM REMOTE OBJECT STORE
    4.
    发明公开

    公开(公告)号:US20240362118A1

    公开(公告)日:2024-10-31

    申请号:US18655446

    申请日:2024-05-06

    申请人: NetApp, Inc.

    IPC分类号: G06F11/14 G06F16/11

    摘要: Techniques are provided for restoring a directory from a snapshot of a volume backed up to an object store. The snapshot may be backed up from a node to the object store, such as a cloud computing environment. A user may want to restore the directory within the volume without having to restore the entire volume, which otherwise would waste computing resources, storage, network bandwidth, and time. Accordingly, the techniques provided herein are capable of restoring just the directory from the snapshot that is stored within the object store. Because snapshot data of the snapshot may be stored across multiple objects within the object store, certain objects are identified as comprising snapshot data (backup data) of the directory and content items within the directory. In this way, the snapshot data of the directory is restored from these objects to a restore directory at a restore target.

    Back-reference data structure for a deduplication storage system

    公开(公告)号:US12130707B2

    公开(公告)日:2024-10-29

    申请号:US18185202

    申请日:2023-03-16

    IPC分类号: G06F16/215 G06F11/14

    CPC分类号: G06F11/1453 G06F2201/84

    摘要: Example implementations relate to deduplication operations in a storage system. An example includes generating a housekeeping work map to delete a backup item stored in a deduplication storage system; selecting a first work entry of the housekeeping work map, where the first work entry identifies a first container index and a first manifest; in response to a selection of the first work entry, loading the first container index into the memory, the first container index comprising a back-reference data structure; identifying, in the back-reference data structure, a back-reference entry indexed to the first manifest; determining, using the back-reference entry indexed to the first manifest, a first set of data units included in the first manifest and that are indexed in the first container index; and decrementing, in the first container index, a set of reference counts for the determined first set of data units.