-
1.
公开(公告)号:US20180336236A1
公开(公告)日:2018-11-22
申请号:US16048060
申请日:2018-07-27
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Ronald J. Barber , Viktor Leis , Guy M. Lohman , Vijayshankar Raman , Richard S. Sidle
IPC: G06F17/30
CPC classification number: G06F17/30359 , G06F17/30097 , G06F17/30194
Abstract: A method includes performing, by a data structure processor, concurrent read and write operations into a hierarchical data structure that includes a mutable tier including extendible hashing, a hash table, and an immutable tier including a concise hash table (CHT) bitmap. Writers acquire latches on the hierarchical data structure elements that the latches modify. The hierarchical data structure elements are directly accessed by readers without acquiring latches. A modify operation is executed by a writer for one or more levels of the hierarchical data structure. When removed portions of the hierarchical data structure are no longer referenced, tracking is performed by use of a combination of a global state value and a copied local state value.
-
公开(公告)号:US20180309638A1
公开(公告)日:2018-10-25
申请号:US16019271
申请日:2018-06-26
Applicant: International Business Machines Corporation
Inventor: Bryan S. Langston , Mohamed Mohamed , Yang Song
IPC: H04L12/24 , H04L12/26 , G06F17/30 , H04L29/08 , H04L12/751
CPC classification number: H04L41/12 , G06F17/30097 , G06F17/30949 , H04L41/5009 , H04L43/02 , H04L43/04 , H04L43/0817 , H04L45/02 , H04L67/10 , H04L67/26
Abstract: A method includes generating, by a processor, an index data structure including allocation of nodes that each represent a computing element of multiple computing elements. In response to determining a change in status of any of the nodes, the method propagates changes in status of any computing element between associated node levels of the index data structure using an application programming interface (API). An updated status of one or more of the nodes is provided based on the change in status.
-
3.
公开(公告)号:US20180203865A1
公开(公告)日:2018-07-19
申请号:US15408736
申请日:2017-01-18
Applicant: NETAPP, INC.
CPC classification number: G06F17/30073 , G06F11/1469 , G06F17/30097 , G06F17/30501 , H04L9/3247 , H04L63/0428
Abstract: Methods and systems for an object based storage are provided. As an example, a method for generating a metadata object for an archive data container having a plurality of data containers is disclosed. The method includes generating a first metadata signature for the archive data container using an archive data container identifier, a number of data containers within the archive data container, and placement information of each data container within the archive data container; assigning a plurality of blocks for storing data for the plurality of data containers at an object based storage to an intermediate logical object; updating a payload signature with placement information of the plurality of blocks within the intermediate logical object; and placing the first metadata signature and the updated payload signature within the metadata object, wherein the metadata object is used to retrieve location information for a specific data container within the archive data container.
-
公开(公告)号:US09940337B2
公开(公告)日:2018-04-10
申请号:US14726597
申请日:2015-05-31
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Tian Luo
CPC classification number: G06F17/30156 , G06F3/0608 , G06F3/0641 , G06F11/1453 , G06F17/30097 , G06F17/30159 , G06F17/3033 , G06N7/005
Abstract: Examples perform predictive probabilistic deduplication of storage, such as virtualized or physical disks. Incoming input/output (I/O) commands include data, which is written to storage and tracked in a key-value store. The key-value store includes a hash of the data as the key, and a reference counter and the address of the data as the value. When a certain percentage of sampled incoming data is found to be duplicate, it is predicted that the I/O commands have become not unique (e.g., duplicate). Based on the prediction, subsequent incoming data is not written to storage, and instead the reference counter associated with the hash of the data is incremented. In this manner, predictions on the uniqueness of future data is made based on previous data, and extraneous writes and deletions from the chunk store are avoided.
-
公开(公告)号:US09934237B1
公开(公告)日:2018-04-03
申请号:US14963224
申请日:2015-12-08
Applicant: EMC Corporation
Inventor: Philip Shilane , Grant Wallace
CPC classification number: G06F17/30156 , G06F3/0641 , G06F11/1448 , G06F11/1453 , G06F11/2094 , G06F11/2097 , G06F17/30097 , G06F17/3012 , G06F17/30159 , G06F17/30174
Abstract: A target storage system receives a representative fingerprint and fingerprint representations from a source storage system. Each fingerprint representation contains only a portion of a corresponding fingerprint and the representative fingerprint is a full fingerprint. The fingerprints of the data chunks are missing at the target storage system are identified based on the fingerprint representative and the fingerprint representations. A bitmap is transmitted to the source storage system, each bit indicating whether one of the fingerprints is missing. One or more fingerprints are received from the source storage system that are missing at the target storage system based on the bitmask. One or more missing data chunks are identified based on at least the one or more fingerprints received from the source storage system. The missing data chunks are then received from the source storage system to be stored at the target storage system.
-
公开(公告)号:US09906831B2
公开(公告)日:2018-02-27
申请号:US15138678
申请日:2016-04-26
Applicant: Sorenson Media, Inc.
Inventor: Juikun Chen
CPC classification number: H04N21/4345 , G06F17/3002 , G06F17/30038 , G06F17/30097 , G06F17/30109 , G06F17/30256 , G06F17/3053 , G06K9/6215 , H04N21/44008 , H04N21/4722 , H04N21/6582 , H04N21/8358 , H04N21/84 , H04N2201/3233
Abstract: A server device includes memory to store: a database of frame fingerprints ordered sequentially by media item number and frame number for media items, and a hash table containing (key, value) entries each including a hashed frame fingerprint and index number(s) to the database. A processor coupled to the memory is to, for an input set of hashed frame fingerprints, select a most recent hashed frame fingerprint. The processor performs a lookup with the most recent hashed frame fingerprint to identify a plurality of index numbers from a matching entry. The processor retrieves a target plurality of frame fingerprints from the database using each of the plurality of index numbers as a starting point; calculates a rank distance between the input set and each respective target plurality of frame fingerprints; and selects the target plurality of frame fingerprints for which the rank distance is shortest, to identify a media item.
-
公开(公告)号:US09886351B2
公开(公告)日:2018-02-06
申请号:US15284310
申请日:2016-10-03
Applicant: STORAGECRAFT TECHNOLOGY CORPORATION
Inventor: Nathan S. Bushman , Scott Barnes , Thomas Russell Shreeve
CPC classification number: G06F11/1453 , G06F3/067 , G06F11/00 , G06F11/1451 , G06F17/30067 , G06F17/30097 , G06F17/30159
Abstract: Creating hybrid image backups of a source storage. In one example embodiment, a method for creating one or more hybrid image backups of a source storage may include seeding a deduplication system by storing, in the deduplication system, blocks included in one or more files of one or more operating systems, blocks included in one or more files of one or more software applications, or both, and by storing a hash of each of the seeded blocks in a hash data structure; copying the hash data structure into a solid state drive (SSD); placing the SSD in a read-only state; comparing hashes of allocated blocks stored in the source storage at a first point in time to hashes in the hash data structure to determine whether each of the allocated blocks is in a first set of blocks that are stored in the deduplication system or in a second set of blocks that are not stored in the deduplication system; associating locations of the blocks in the first set of blocks stored in the source storage with locations of the corresponding blocks stored in the deduplication system; storing the blocks in the second set of blocks in an image backup separate from the deduplication system; associating locations of the blocks in the second set of blocks stored in the source storage with locations of the corresponding blocks stored in the image backup; and defining a full hybrid image backup of the source storage at the first point in time as a combination of the first set of blocks stored in the deduplication system and the second set of blocks stored in the image backup.
-
公开(公告)号:US09830345B1
公开(公告)日:2017-11-28
申请号:US15275681
申请日:2016-09-26
Applicant: SEMMLE LIMITED
Inventor: Arthur Baars
CPC classification number: G06F17/3033 , G06F11/1451 , G06F17/30088 , G06F17/30097 , G06F17/30327 , G06F17/30424 , G06F17/30589 , G06F2201/84
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for storing data in a version control system. One of the methods includes maintaining, in a data store, a tree-structured index of files in which each leaf node stores an entry for each file in a plurality of files for a snapshot that includes a unique file identifier for the respective file; receiving a request for a particular file; generating a hash of a particular file path for the particular file; identifying, using the hash of the particular file path, a leaf node in the tree-structured index that includes an entry for the particular file; identifying, in the leaf node, an entry for the particular file path; obtaining, from the entry, the unique file identifier for the particular file in the data store; and using the unique file identifier for the particular file in response to the request.
-
公开(公告)号:US09830335B1
公开(公告)日:2017-11-28
申请号:US15435672
申请日:2017-02-17
Applicant: SEMMLE LIMITED
Inventor: Julian Tibble
CPC classification number: G06F17/3023 , G06F11/1451 , G06F17/30088 , G06F17/30097 , G06F17/3033 , G06F17/30575 , G06F2201/84
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using a snapshot manifest when storing files in a data store. One of the methods includes computing first static analysis results for a current snapshot; requesting, from a data storage system, a manifest that includes a plurality of unique identifiers that each identify a corresponding second object of second static analysis results for a previous snapshot; determining, for each first object of the first static analysis results, a unique identifier for the first object; determining, for each first object of the first static analysis results, whether the unique identifier for the first object occurs in the manifest; generating a batch of first objects having unique identifiers that do not occur in the manifest; and providing, to the data storage system, the batch of first objects having unique identifiers that do not occur in the manifest.
-
公开(公告)号:US09811321B1
公开(公告)日:2017-11-07
申请号:US14861705
申请日:2015-09-22
Applicant: Amazon Technologies, Inc.
Inventor: Jari Juhani Karppanen
CPC classification number: G06F8/41 , G06F9/4552 , G06F9/45529 , G06F17/30097 , H04L67/06 , H04L67/34
Abstract: Systems and methods are described for utilizing script chunks in client-side browsing applications. A script chunking service divides scripts into smaller portions, and generates precompiled chunks of code that correspond to the portions. The service may then request compilation of only those portions of a script that do not have a corresponding precompiled chunk. Portions may be determined based on length, script keywords, or other criteria, and chunks may be identified by applying a hashing function to the portions. Scripts may be filtered to identify differing portions of source code that compile to functionally identical instructions, and the chunking service may use supplemental hashes, hashes of hashes, and complier state information to facilitate identification and compilation of precompiled chunks.
-
-
-
-
-
-
-
-
-