发明授权
US08983952B1 System and method for partitioning backup data streams in a deduplication based storage system 有权
用于在基于重复数据消除的存储系统中分区备份数据流的系统和方法

System and method for partitioning backup data streams in a deduplication based storage system
摘要:
A system and method for partitioning a data stream into a plurality of segments of varying sizes. A data stream manager partitions a received data stream into segments which are then conveyed to a deduplication engine for processing. The data stream received by the data stream manager includes metadata corresponding to the data stream. Based upon the metadata, which may include an indication as to a type of data included in the data stream, the data stream is partitioned into segments for further processing. A size of a segment used for partitioning given data is based at least in part on a type of data being partitioned. The variable segment sizes may be chosen to balance between maximizing the deduplication ratio and minimizing both the segment count and the size of the fingerprint index.
信息查询
0/0