METHODS AND SYSTEMS FOR RESTORING A DATA CONTAINER ARCHIVED AT AN OBJECT-BASED STORAGE

    公开(公告)号:US20180203865A1

    公开(公告)日:2018-07-19

    申请号:US15408736

    申请日:2017-01-18

    Applicant: NETAPP, INC.

    Abstract: Methods and systems for an object based storage are provided. As an example, a method for generating a metadata object for an archive data container having a plurality of data containers is disclosed. The method includes generating a first metadata signature for the archive data container using an archive data container identifier, a number of data containers within the archive data container, and placement information of each data container within the archive data container; assigning a plurality of blocks for storing data for the plurality of data containers at an object based storage to an intermediate logical object; updating a payload signature with placement information of the plurality of blocks within the intermediate logical object; and placing the first metadata signature and the updated payload signature within the metadata object, wherein the metadata object is used to retrieve location information for a specific data container within the archive data container.

    Predictive probabilistic deduplication of storage

    公开(公告)号:US09940337B2

    公开(公告)日:2018-04-10

    申请号:US14726597

    申请日:2015-05-31

    Applicant: VMware, Inc.

    Abstract: Examples perform predictive probabilistic deduplication of storage, such as virtualized or physical disks. Incoming input/output (I/O) commands include data, which is written to storage and tracked in a key-value store. The key-value store includes a hash of the data as the key, and a reference counter and the address of the data as the value. When a certain percentage of sampled incoming data is found to be duplicate, it is predicted that the I/O commands have become not unique (e.g., duplicate). Based on the prediction, subsequent incoming data is not written to storage, and instead the reference counter associated with the hash of the data is incremented. In this manner, predictions on the uniqueness of future data is made based on previous data, and extraneous writes and deletions from the chunk store are avoided.

    Hybrid image backup of a source storage

    公开(公告)号:US09886351B2

    公开(公告)日:2018-02-06

    申请号:US15284310

    申请日:2016-10-03

    Abstract: Creating hybrid image backups of a source storage. In one example embodiment, a method for creating one or more hybrid image backups of a source storage may include seeding a deduplication system by storing, in the deduplication system, blocks included in one or more files of one or more operating systems, blocks included in one or more files of one or more software applications, or both, and by storing a hash of each of the seeded blocks in a hash data structure; copying the hash data structure into a solid state drive (SSD); placing the SSD in a read-only state; comparing hashes of allocated blocks stored in the source storage at a first point in time to hashes in the hash data structure to determine whether each of the allocated blocks is in a first set of blocks that are stored in the deduplication system or in a second set of blocks that are not stored in the deduplication system; associating locations of the blocks in the first set of blocks stored in the source storage with locations of the corresponding blocks stored in the deduplication system; storing the blocks in the second set of blocks in an image backup separate from the deduplication system; associating locations of the blocks in the second set of blocks stored in the source storage with locations of the corresponding blocks stored in the image backup; and defining a full hybrid image backup of the source storage at the first point in time as a combination of the first set of blocks stored in the deduplication system and the second set of blocks stored in the image backup.

    Content-addressable data storage
    8.
    发明授权

    公开(公告)号:US09830345B1

    公开(公告)日:2017-11-28

    申请号:US15275681

    申请日:2016-09-26

    Applicant: SEMMLE LIMITED

    Inventor: Arthur Baars

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for storing data in a version control system. One of the methods includes maintaining, in a data store, a tree-structured index of files in which each leaf node stores an entry for each file in a plurality of files for a snapshot that includes a unique file identifier for the respective file; receiving a request for a particular file; generating a hash of a particular file path for the particular file; identifying, using the hash of the particular file path, a leaf node in the tree-structured index that includes an entry for the particular file; identifying, in the leaf node, an entry for the particular file path; obtaining, from the entry, the unique file identifier for the particular file in the data store; and using the unique file identifier for the particular file in response to the request.

    Snapshot manifest for static analysis results

    公开(公告)号:US09830335B1

    公开(公告)日:2017-11-28

    申请号:US15435672

    申请日:2017-02-17

    Applicant: SEMMLE LIMITED

    Inventor: Julian Tibble

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using a snapshot manifest when storing files in a data store. One of the methods includes computing first static analysis results for a current snapshot; requesting, from a data storage system, a manifest that includes a plurality of unique identifiers that each identify a corresponding second object of second static analysis results for a previous snapshot; determining, for each first object of the first static analysis results, a unique identifier for the first object; determining, for each first object of the first static analysis results, whether the unique identifier for the first object occurs in the manifest; generating a batch of first objects having unique identifiers that do not occur in the manifest; and providing, to the data storage system, the batch of first objects having unique identifiers that do not occur in the manifest.

    Script compilation
    10.
    发明授权

    公开(公告)号:US09811321B1

    公开(公告)日:2017-11-07

    申请号:US14861705

    申请日:2015-09-22

    Abstract: Systems and methods are described for utilizing script chunks in client-side browsing applications. A script chunking service divides scripts into smaller portions, and generates precompiled chunks of code that correspond to the portions. The service may then request compilation of only those portions of a script that do not have a corresponding precompiled chunk. Portions may be determined based on length, script keywords, or other criteria, and chunks may be identified by applying a hashing function to the portions. Scripts may be filtered to identify differing portions of source code that compile to functionally identical instructions, and the chunking service may use supplemental hashes, hashes of hashes, and complier state information to facilitate identification and compilation of precompiled chunks.

Patent Agency Ranking