System and methods for bandwidth-efficient data encoding

    公开(公告)号:US12189580B2

    公开(公告)日:2025-01-07

    申请号:US18449706

    申请日:2023-08-15

    Abstract: A system and methods for bandwidth-efficient data encoding comprising a sequence analyzer configured to: analyze a received sequence dataset to determine a sequence dataset file type, scan the sequence dataset to maintain a count of unique characters contained therein, identify positions where the unique character count increases by a power of two, deconstruct the sequence dataset into a plurality of sourceblocks at the identified positions, and encode the plurality of sourceblocks using a data deconstruction engine and library management module to assign each sourceblock a reference code.

    System and method for codebook management based on data source grouping

    公开(公告)号:US12147667B2

    公开(公告)日:2024-11-19

    申请号:US18593931

    申请日:2024-03-03

    Abstract: A system and method for codebook management is disclosed. Training datasets are obtained from various data sources. A similarity score is generated for each training dataset with reference to the other training datasets. In response to detecting a similarity score above a predetermined threshold for one or more of the other training datasets, a combined codebook is created based on training datasets that have a similarity score above a predetermined threshold. Based on the similarity score, multiple data sources are combined into a group, and the combined codebook is used for the data sources within the group. A mismatch performance metric can be computed for the combined codebook, and a revised combined codebook can be regenerated in response to the mismatch performance metric being above a predetermined threshold.

    SYSTEM AND METHOD FOR CODEBOOK MANAGEMENT BASED ON DATA SOURCE GROUPING

    公开(公告)号:US20240377949A1

    公开(公告)日:2024-11-14

    申请号:US18773999

    申请日:2024-07-16

    Abstract: A system and method for codebook management is disclosed. Training datasets are obtained from various data sources. A similarity score is generated for each training dataset with reference to the other training datasets. In response to detecting a similarity score above a predetermined threshold for one or more of the other training datasets, a combined codebook is created based on training datasets that have a similarity score above a predetermined threshold. Based on the similarity score, multiple data sources are combined into a group, and the combined codebook is used for the data sources within the group. A mismatch performance metric can be computed for the combined codebook, and a revised combined codebook can be regenerated in response to the mismatch performance metric being above a predetermined threshold.

    SYSTEM AND METHOD FOR DATA COMPACTION AND ENCRYPTION OF ANONYMIZED DATA RECORDS

    公开(公告)号:US20240329837A1

    公开(公告)日:2024-10-03

    申请号:US18737962

    申请日:2024-06-08

    Abstract: A system and method for data compaction and encryption of anonymized data records. A dataset may be pre-processed by dividing into a plurality of sourceblocks at all reasonable sourceblock lengths, and then counting how many times each sourceblock occurs in the dataset, resulting in a tally record of tokens and their count value. This tally record may then be anonymized and transmitted to a data deconstruction engine which combined with a library manager creates a codebook and performs optimization techniques on the codebook. The received anonymized tally record may be parsed into individual tokens by identifying the tokens with the highest count value. The tokens may then be sent, in descending order of count value, to the library manger where each token may be assigned a codeword. A half-backed codebook is then created using the tokens and each token's unique codeword, before sending the half-backed codebook to a system user.

    DATA COMPRESSION WITH PROTOCOL ADAPTATION
    6.
    发明公开

    公开(公告)号:US20240243754A1

    公开(公告)日:2024-07-18

    申请号:US18423287

    申请日:2024-01-25

    CPC classification number: H03M7/3059 G06N20/00 H03M7/6005

    Abstract: A system and method for data compression with protocol adaptation, that utilizes a codebook generator which leverages one or more machine/deep learning algorithms trained on at least a plurality of protocol policies in order to generate a protocol appendix and codebook, wherein original data is encoded by an encoder according to the codebook and sent to a decoder, but instead of just decoding the data according to the codebook to reconstruct the original data, data manipulation rules such as mapping and transformation are applied at the decoding stage to transform the decoded data into protocol formatted data.

    System and method for data compression with protocol adaptation

    公开(公告)号:US11967974B2

    公开(公告)日:2024-04-23

    申请号:US18501987

    申请日:2023-11-04

    CPC classification number: H03M7/3059 G06N20/00 H03M7/6005

    Abstract: A system and method for data compression with protocol adaptation, that utilizes a codebook generator which leverages one or more machine/deep learning algorithms trained on at least a plurality of protocol policies in order to generate a protocol appendix and codebook, wherein original data is encoded by an encoder according to the codebook and sent to a decoder, but instead of just decoding the data according to the codebook to reconstruct the original data, data manipulation rules such as mapping and transformation are applied at the decoding stage to transform the decoded data into protocol formatted data.

    SYSTEM AND METHOD FOR MANIPULATION OF COMPACTED DATA FILES

    公开(公告)号:US20240086372A1

    公开(公告)日:2024-03-14

    申请号:US18516924

    申请日:2023-11-21

    CPC classification number: G06F16/1752 G06F3/0608 G06F3/0641 G06F3/067

    Abstract: A system and method for manipulation of compacted data files, utilizing a reference codebook, a random-access engine, a data deconstruction engine, and a data deconstruction engine. The system may receive a data query pertaining to a data read or data write request, wherein the data file to be read from or written to is a compacted data file. A random-access engine may facilitate data manipulation processes by accessing a reference codebook associated with the compacted data file, a frequency table used to construct the reference codebook, and data query details. A data read request is supported by random-access search capabilities that may enable the locating and decoding of the bits corresponding to data query details. A random-access engine facilitates data write processes. The random-access engine may encode the data to be written, insert the encoded data into a compacted data file, and update the codebook as needed.

    SYSTEM AND METHODS FOR SECURE DEDUPLICATION OF COMPACTED DATA

    公开(公告)号:US20230401173A1

    公开(公告)日:2023-12-14

    申请号:US18450402

    申请日:2023-08-16

    CPC classification number: G06F16/1752 G06F3/067 G06F3/0641 G06F3/0608

    Abstract: A system and methods for secure deduplication of compacted data comprising a data deconstruction engine, a data reconstruction engine, a library manager, a reference codebook, and a codeword storage which performs simultaneous compaction and deduplication of data sets. A data set may be comprised of one or more sourcepackets which may be optimally deconstructed into a plurality of sourceblocks and wherein each sourceblock may be compared against a reference codebook that contains key-value pairs of a sourceblock and its associated reference code in order to determine if a received sourceblock is a duplicate of data already stored within the reference codebook. Non-duplicate sourceblocks can have a reference code algorithmically created and stored in the reference codebook, thereby ensuring that when a duplicate sourceblock is received, it will not be stored as duplicated data.

Patent Agency Ranking