Data compression method and storage system

    公开(公告)号:US10055134B2

    公开(公告)日:2018-08-21

    申请号:US15146076

    申请日:2016-05-04

    发明人: Lizhen Li

    IPC分类号: H03M7/30 G06F3/06 G06F17/30

    摘要: Collecting, by a metadata controller (MDC) in a storage system, first information indicating a current working status of each computing node in the storage system, and second information indicating a current working status of each storage node in the storage system, determining, by the MDC based on the first information, a computing node that is one of the computing nodes and whose current working status is not a busy state as a current active computing node, determining, by the MDC based on the second information, a storage node that is one of the storage nodes and whose current working status is not a busy state as a current active storage node, and compressing, by a distributed software header (DSH) in the current active computing node or a memory manager (MM) in the current active storage node, a piece of data when the storage system needs to compress the data.

    MANAGING DATA RECORDS
    5.
    发明申请

    公开(公告)号:US20170068697A1

    公开(公告)日:2017-03-09

    申请号:US15363373

    申请日:2016-11-29

    IPC分类号: G06F17/30 H03M7/30

    摘要: Data records may be managed in a relational database by monitoring, a record length for a first data record in a page of memory, an amount of free space in the page, and a page length. In response to receiving an operator command to replace the first data record with a second data record, a database management system may determine whether an estimated record length of a compressed second data record is outside of the amount of free space in the page. In response to determining the estimated record length of a compressed second data record is outside of the amount of free space in the page, the database management system may determine whether an estimated length of a compressed page is outside of the page length. In response to determining the estimated length of a compressed page is within the page length, the page may be compressed.

    METHODS AND DEVICES FOR SOURCE-CODING AND DECODING OF DATA INVOLVING SYMBOL COMPRESSION
    6.
    发明申请
    METHODS AND DEVICES FOR SOURCE-CODING AND DECODING OF DATA INVOLVING SYMBOL COMPRESSION 有权
    用于源代码和解码涉及符号压缩的数据的方法和设备

    公开(公告)号:US20170063392A1

    公开(公告)日:2017-03-02

    申请号:US15119365

    申请日:2015-02-20

    摘要: A method of encoding input data in an encoder to generate corresponding encoded data includes splitting and/or transforming the input data into data chunks, analyzing symbols present in the input data and compressing the symbols as a function of occurrence of the symbols in the data chunks; generating code tables, frequency tables, and/or length of code word tables for the symbols present in the data chunks; computing sets of indices relating the symbols in each data chunk and/or the compressed symbols to entries in the code tables, the frequency tables, and/or the length of code word tables; and assembling the sets of indices, together with the frequency tables, the code tables, and/or information indicative of such tables, for generating the encoded data. An encoder that utilizes the method, together with a corresponding decoder, wherein the encoder and the decoder in combination form a codec.

    摘要翻译: 一种在编码器中编码输入数据以生成相应的编码数据的方法包括将输入数据分解和/或变换为数据块,分析存在于输入数据中的符号,并根据数据块中符号的出现来压缩符号 ; 为存在于数据块中的符号生成代码表,频率表和/或代码字表的长度; 将每个数据块中的符号和/或压缩符号相关联的索引的计算集合与代码表中的条目,频率表和/或代码字表的长度相关联; 以及组合索引集合以及频率表,代码表和/或表示这些表的信息,用于生成编码数据。 利用该方法的编码器与相应的解码器一起,其中编码器和解码器组合形成编解码器。

    DATA COMPRESSION AND DECOMPRESSION METHOD
    7.
    发明申请
    DATA COMPRESSION AND DECOMPRESSION METHOD 有权
    数据压缩和分解方法

    公开(公告)号:US20170041023A1

    公开(公告)日:2017-02-09

    申请号:US14833146

    申请日:2015-08-24

    发明人: Tuncay ULUDAG

    IPC分类号: H03M7/30

    摘要: A data compression and decompression algorithm performing the function of data compression and decompression by using the steps of: dividing a main data stream into sub data streams, calculating frequency of occurrence of sub data streams in the main data stream, repeating the process of calculating frequency of occurrence by changing the number of digits in sub data stream and by changing the starting digit position in main data stream, assigning codes to sub data streams based on their frequency occurrence values, calculating group dimension index for each group, selecting the group with the lowest group dimension index and placing codes of the group with the lowest group dimension index in a multi dimensional space wherein vector placement is utilized to eliminate the need to use digits that are common to neighboring codes therefore providing an additional compression.

    摘要翻译: 一种数据压缩和解压缩算法,其通过以下步骤执行数据压缩和解压缩的功能:将主数据流划分为子数据流,计算主数据流中的子数据流的出现频率,重复计算频率的处理 通过改变子数据流中的位数,并通过改变主数据流中的起始位数,根据其频率出现值分配代码到子数据流,计算每组的组维度索引, 最小组维度索引和具有最小组维度索引的组的放置代码在多维空间中,其中使用向量布局来消除对相邻代码共同使用的数字的需要,从而提供额外的压缩。

    MANAGING DATA RECORDS
    8.
    发明申请
    MANAGING DATA RECORDS 有权
    管理数据记录

    公开(公告)号:US20160315633A1

    公开(公告)日:2016-10-27

    申请号:US15139477

    申请日:2016-04-27

    IPC分类号: H03M7/30 G06F17/30

    摘要: Data records may be managed in a relational database by monitoring, a record length for a first data record in a page of memory, an amount of free space in the page, and a page length. In response to receiving an operator command to replace the first data record with a second data record, a database management system may determine whether an estimated record length of a compressed second data record is outside of the amount of free space in the page. In response to determining the estimated record length of a compressed second data record is outside of the amount of free space in the page, the database management system may determine whether an estimated length of a compressed page is outside of the page length. In response to determining the estimated length of a compressed page is within the page length, the page may be compressed.

    摘要翻译: 数据记录可以通过监视关系数据库中的一个内存中的第一个数据记录的记录长度,页面中的可用空间量以及页面长度来管理。 响应于接收到用第二数据记录替换第一数据记录的操作员命令,数据库管理系统可以确定压缩的第二数据记录的估计记录长度是否在页面中的可用空间量之外。 响应于确定压缩的第二数据记录的估计记录长度超出页面中的可用空间量,数据库管理系统可以确定压缩页面的估计长度是否在页面长度之外。 响应于确定压缩页面的估计长度在页面长度内,可以压缩该页面。