Invention Application
US20130054544A1 Content Aware Chunking for Achieving an Improved Chunk Size Distribution
有权
用于实现改进的块大小分布的内容意识分块
- Patent Title: Content Aware Chunking for Achieving an Improved Chunk Size Distribution
- Patent Title (中): 用于实现改进的块大小分布的内容意识分块
-
Application No.: US13222198Application Date: 2011-08-31
-
Publication No.: US20130054544A1Publication Date: 2013-02-28
- Inventor: Jin Li , Sudipta Sengupta , Sanjeev Mehrotra , Ran Kalach , Paul Adrian Oltean
- Applicant: Jin Li , Sudipta Sengupta , Sanjeev Mehrotra , Ran Kalach , Paul Adrian Oltean
- Applicant Address: US WA Redmond
- Assignee: MICROSOFT CORPORATION
- Current Assignee: MICROSOFT CORPORATION
- Current Assignee Address: US WA Redmond
- Main IPC: G06F7/00
- IPC: G06F7/00

Abstract:
The subject disclosure is directed towards partitioning a file into chunks that satisfy a chunk size restriction, such as maximum and minimum chunk sizes, using a sliding window. For file positions within the chunk size restriction, a signature representative of a window fingerprint is compared with a target pattern, with a chunk boundary candidate identified if matched. Other signatures and patterns are then checked to determine a highest ranking signature (corresponding to a lowest numbered Rule) to associate with that chunk boundary candidate, or set an actual boundary if the highest ranked signature is matched. If the maximum chunk size is reached without matching the highest ranked signature, the chunking mechanism regresses to set the boundary based on the candidate with the next highest ranked signature (if no candidates, the boundary is set at the maximum). Also described is setting chunk boundaries based upon pattern detection (e.g., runs of zeros).
Public/Granted literature
- US08918375B2 Content aware chunking for achieving an improved chunk size distribution Public/Granted day:2014-12-23
Information query