-
公开(公告)号:US20240061749A1
公开(公告)日:2024-02-22
申请号:US18501603
申请日:2023-11-03
申请人: Rubrik, Inc.
发明人: Vijay Karthik , Abdullah Reza
CPC分类号: G06F11/1451 , G06F16/125 , G06F16/128
摘要: A method of consolidating snapshots includes receiving a request to consolidate a first snapshot with a second snapshot into a third snapshot, the first and second snapshots stored in separate backup files, each backup file organized as a directory where data parts of the first and second snapshots can be hard linked to locations outside of the backup file, comparing the data parts of the first and second snapshots to determine if any second snapshot data part fully overlaps with any first snapshot data part, responsive to determining that a second snapshot data part fully overlaps with a first snapshot data part, hard linking the determined second snapshot data part into the third snapshot, and storing the third snapshot in the backup file.
-
公开(公告)号:US20210342297A1
公开(公告)日:2021-11-04
申请号:US16862470
申请日:2020-04-29
申请人: Rubrik, Inc.
IPC分类号: G06F16/11 , G06F16/174 , G06F16/14 , G06F16/13
摘要: A lightweight deduplication system can perform resource efficient data deduplication using an extent index and a content index. The extent index can store full fingerprints of data segments to be deduplicated and the content index can store shortened versions of the full fingerprints. The system can alternate between the extent and content indexes, and cache portions of the indices to perform lightweight data deduplication. Further, the system can be configured with an efficient heuristic approach for selecting content index data lookups for chains of volumes for deduplication, such as a long chain of snapshots.
-
公开(公告)号:US20240241798A1
公开(公告)日:2024-07-18
申请号:US18427404
申请日:2024-01-30
申请人: Rubrik, Inc.
发明人: Abdullah Reza , Vijay Karthik
CPC分类号: G06F11/1469 , G06F11/1464 , G06F16/128 , G06F2201/84
摘要: A method for recovering files from a filesystem stored across sparse files in a cloud environment is described. According to the method, a data management system may receive a request to read the files. The data management system may identify one or more target address ranges corresponding to the files indicated via the request. The data management system may read index information for the sparse files in the cloud environment. The index information may indicate respective address ranges for data blocks within the sparse files. The data management system may identify one or more data blocks within one or more sparse files as corresponding to address ranges that overlap with the one or more target address ranges based on the index information. The data management system may transmit, to the cloud environment, one or more read requests for the identified one or more data blocks.
-
公开(公告)号:US20240134758A1
公开(公告)日:2024-04-25
申请号:US18400836
申请日:2023-12-29
申请人: Rubrik, Inc.
CPC分类号: G06F11/1469 , G06F11/1464 , G06F16/128 , G06F2201/84
摘要: In some examples, a data management and storage (DMS) platform, comprises peer DMS nodes in a node cluster, a distributed data store comprising local and cloud storage, and at least one processor configured to perform operations in a method of creating a local consolidated patch file from a patch file chain stored in the cloud storage. The operations include, in a first dry-run phase, creating a logical patch file image of data blocks in one or more cloud patch files stored in the cloud storage; in a second data-transfer phase, downloading at least some of the data blocks from the cloud patch files identified by the logical patch file image, the second data-transfer phase comprising a coalescing operation to construct a set of coalesced reads of the data blocks; and creating and storing, in the local storage, the local consolidated patch file using the downloaded data blocks.
-
公开(公告)号:US20230168968A1
公开(公告)日:2023-06-01
申请号:US17536601
申请日:2021-11-29
申请人: Rubrik, Inc.
发明人: Vijay Karthik , Abdullah Reza
CPC分类号: G06F11/1451 , G06F16/128 , G06F16/125
摘要: A method of consolidating snapshots includes receiving a request to consolidate a first snapshot with a second snapshot into a third snapshot, the first and second snapshots stored in separate backup files, each backup file organized as a directory where data parts of the first and second snapshots can be hard linked to locations outside of the backup file, comparing the data parts of the first and second snapshots to determine if any second snapshot data part fully overlaps with any first snapshot data part, responsive to determining that a second snapshot data part fully overlaps with a first snapshot data part, hard linking the determined second snapshot data part into the third snapshot, and storing the third snapshot in the backup file.
-
公开(公告)号:US20240232022A1
公开(公告)日:2024-07-11
申请号:US18094891
申请日:2023-01-09
申请人: Rubrik, Inc.
发明人: Deepti Kochar , Abdullah Reza , Prasenjit Sarkar , Prabhu Mohan , Arjun Sinha , Yanzhe Wang
IPC分类号: G06F11/14
CPC分类号: G06F11/1464 , G06F2201/84
摘要: Methods, systems, and devices for data management are described. A data management system may receive a request to generate backup data for a set of data files from the one or more databases. The data management system may then generate, in response to the request, a file including a set of partitions including respective groups of shard files that correspond to respective groups of data files from among the set of data files. In some examples, a respective group of shard files within a partition of the set of partitions may include a first shard file representative of metadata for the partition and one or more additional shard files representative of the respective group of data files for the partition. The data management system may then distribute the respective groups of shard files to a set of nodes within the distributed backup system.
-
公开(公告)号:US11886226B2
公开(公告)日:2024-01-30
申请号:US17536601
申请日:2021-11-29
申请人: Rubrik, Inc.
发明人: Vijay Karthik , Abdullah Reza
CPC分类号: G06F11/1451 , G06F16/125 , G06F16/128
摘要: A method of consolidating snapshots includes receiving a request to consolidate a first snapshot with a second snapshot into a third snapshot, the first and second snapshots stored in separate backup files, each backup file organized as a directory where data parts of the first and second snapshots can be hard linked to locations outside of the backup file, comparing the data parts of the first and second snapshots to determine if any second snapshot data part fully overlaps with any first snapshot data part, responsive to determining that a second snapshot data part fully overlaps with a first snapshot data part, hard linking the determined second snapshot data part into the third snapshot, and storing the third snapshot in the backup file.
-
公开(公告)号:US11860817B2
公开(公告)日:2024-01-02
申请号:US17379613
申请日:2021-07-19
申请人: Rubrik, Inc.
发明人: Abdullah Reza , Vijay Karthik , Nitin Rathor , Vaibhav Gosain , Anshul Gupta
CPC分类号: G06F16/116 , G06F16/128 , G06F16/13 , G06F16/1815
摘要: In some examples, a data management system generates snapshots in a distributed file system based on a protocol or a user triggered event, The data management system identifies a snappable file in a distributed file system and a first data block in the snappable file, the first data block including data and attribute data. The system scans an index file to access the attribute data of the first data block and initiates construction of a patch file based on the accessed attribute data. The system repeats the scanning of the index file to access attribute data of at least a further second data block, the second data block including data and attribute data, and completes construction of the patch file based on the accessed attribute data of the first and second data blocks. The system generates conversion simulation information by collecting attribute data for all the data blocks of the constructed patch file, and writes the simulation information to a patch file image.
-
公开(公告)号:US20230350766A1
公开(公告)日:2023-11-02
申请号:US18344659
申请日:2023-06-29
申请人: Rubrik, Inc.
IPC分类号: G06F11/14 , H04L67/06 , G06F16/11 , G06F16/182
CPC分类号: G06F11/1464 , G06F11/1469 , H04L67/06 , G06F16/128 , G06F16/1827 , G06F11/1451
摘要: In some examples, a data management and storage (DMS) platform comprises peer DMS nodes in a node cluster, a distributed data store comprising local and cloud storage, and at least one processor configured to perform operations in a method of creating a local consolidated patch file from a patch file chain stored in the cloud storage. Example operations comprise, in a first dry-run phase, creating a patch file image of data blocks in one or more cloud patch files stored in the cloud storage; in a second data-transfer phase, downloading at least some of the data blocks from the cloud patch files identified by the patch file image; and creating and storing, in the local storage, the local consolidated patch file using the downloaded data blocks.
-
公开(公告)号:US20230350764A1
公开(公告)日:2023-11-02
申请号:US17732118
申请日:2022-04-28
申请人: Rubrik, Inc.
发明人: Vijay Karthik , Abdullah Reza
IPC分类号: G06F11/14
CPC分类号: G06F11/1464 , G06F2201/84
摘要: A storage system may store one or more snapshots of a computing system to support backup and restoration of data stored at the computing system. The storage system may identify an expiration of a first snapshot indicating a first set of physical storage locations to which first data of the computing system was stored as part of a first backup procedure. The storage system may identify a first subset physical storage locations of the first set as storing a first portion of the first data that is superseded by second data associated with a second snapshot. Based on identifying the first subset, the storage system may delete the first portion of the first data from the first subset of physical storage locations and retain a second portion of the first data at a second subset of physical storage locations.
-
-
-
-
-
-
-
-
-