Adaptation for entropy coding of blocks of image data

    公开(公告)号:US11477492B2

    公开(公告)日:2022-10-18

    申请号:US15668745

    申请日:2017-08-04

    Applicant: GOOGLE LLC

    Inventor: Ruijie Xu Dake He

    Abstract: Systems and methods are disclosed for entropy coding of blocks of image data. For example, methods may include partitioning a block of video data into a plurality of groups of elements; decoding, using an entropy decoder, data from an encoded bitstream to obtain elements of a first group from the plurality of groups of elements; determining a category based on the elements of the first group; based on the category, selecting a context for an element from a second group from the plurality of groups of elements; and decoding, using the entropy decoder using the context, data from the encoded bitstream to obtain the element of the second group from the plurality of groups of elements.

    Using Rate Distortion Cost as a Loss Function for Deep Learning

    公开(公告)号:US20220201316A1

    公开(公告)日:2022-06-23

    申请号:US17601639

    申请日:2019-03-21

    Applicant: Google LLC

    Abstract: An apparatus for encoding an image block includes a processor that presents, to a machine-learning model, the image block, obtains the partition decision for encoding the image block from the model, and encodes the image block using the partition decision. The model is trained to output a partition decision for encoding the image block by using training data for a plurality of training blocks as input, the training data including for a training block, partition decisions for encoding the training block, and, for each partition decision, a rate-distortion value resulting from encoding the training block using the partition decision. The model is trained using a loss function combining a partition loss function based upon a relationship between the partition decisions and respective predicted partitions, and a rate-distortion cost loss function based upon a relationship between the rate-distortion values and respective predicted rate-distortion values.

    Asymmetric probability model update and entropy coding precision

    公开(公告)号:US11218737B2

    公开(公告)日:2022-01-04

    申请号:US16042261

    申请日:2018-07-23

    Applicant: GOOGLE LLC

    Abstract: Asymmetric probability model updating and entropy coding includes using different numbers of bits for storing probabilities of a probability model and for entropy coding symbols using that probability model. The probabilities of a probability model are updated according to values of syntax elements decoded from a bitstream. The probabilities are associated with possible values of the syntax elements and are stored using a first bit precision. Based on the updated probabilities, a second bit precision to use to entropy decode the syntax elements is determined. The second bit precision is less than the first bit precision. The syntax elements are then entropy decoded using the second bit precision, such as to produce quantized transform coefficients, which may be further processed and output to an output video stream. Using the first bit precision to entropy decode the syntax elements results in a lower compression throughput than using the second bit precision.

    SAME FRAME MOTION ESTIMATION AND COMPENSATION

    公开(公告)号:US20210021859A1

    公开(公告)日:2021-01-21

    申请号:US17060483

    申请日:2020-10-01

    Applicant: GOOGLE LLC

    Inventor: Aki Kuusela Dake He

    Abstract: Motion estimation or compensation functionality of a hardware component is used to encode or decode key frames and other video frames. The hardware component includes a memory, which may, for example, be a local static random access memory or an external dynamic random access memory. Upon a block of a frame being encoded or decoded, data associated with that block is stored in the memory. That data can then be processed by motion estimation or motion compensation for use in encoding or decoding one or more later blocks within the same frame. The data may, for example, be stored in the memory after operations for reconstruction and loop filtering have been performed. The data stored in the memory may effectively be processed using traditional inter-prediction operations, such as to identify similar video objects within blocks of the same frame.

    Using multiple probability models for entropy coding in video compression

    公开(公告)号:US10887595B2

    公开(公告)日:2021-01-05

    申请号:US16835379

    申请日:2020-03-31

    Applicant: GOOGLE LLC

    Inventor: Dake He

    Abstract: Entropy encoding and decoding a sequence of symbols using probability mixing is disclosed. A method includes for at least a symbol, at a position of the symbols, determining a mixed probability, by: approximating a first conditional probability for coding the symbol, the first conditional probability being a conditional probability of the symbol given a sub-sequence of the sequence having a first value; approximating a second conditional probability for coding the symbol, the second conditional probability being a conditional probability of the symbol given the sub-sequence having a second value; and determining, using the first conditional probability and the second conditional probability, the mixed probability for coding the symbol; and coding the symbol using the mixed probability.

    ADAPTATION OF SCAN ORDER FOR ENTROPY CODING
    77.
    发明申请

    公开(公告)号:US20200351502A1

    公开(公告)日:2020-11-05

    申请号:US16930903

    申请日:2020-07-16

    Applicant: GOOGLE LLC

    Inventor: Dake He

    Abstract: An apparatus includes a processor that is configured to decode, as a first group, coefficients of a first row and a first column of a transform block using a first zig-zag scan order; and decode at least a subset of remaining coefficients of the transform block using a second scan order. Executable instructions in a non-transitory computer-readable storage medium include instructions to code, as a first group, coefficients of a first row and a first column of the transform block using a first scan order, where the transform block includes the coefficients of the first row, coefficients of the first column, and remaining coefficients; partition the remaining coefficients into a remaining first row and a remaining first column and other coefficients; code the remaining first row and the remaining first column using a second scan order; and code the other coefficients using a third scan order.

    Efficient Use of Quantization Parameters in Machine-Learning Models for Video Coding

    公开(公告)号:US20200275101A1

    公开(公告)日:2020-08-27

    申请号:US16868729

    申请日:2020-05-07

    Applicant: GOOGLE LLC

    Abstract: Encoding an image block using a quantization parameter includes presenting, to an encoder that includes a machine-learning model, the image block and a value derived from the quantization parameter, where the value is a result of a non-linear function using the quantization parameter as input, where the non-linear function relates to a second function used to calculate, using the quantization parameter, a Lagrange multiplier that is used in a rate-distortion calculation, and where the machine-learning model is trained to output mode decision parameters for encoding the image block; obtaining the mode decision parameters from the encoder; and encoding, in a compressed bitstream, the image block using the mode decision parameters.

    PROBABILITY MAPPING FOR ENTROPY CODING
    79.
    发明申请

    公开(公告)号:US20200213599A1

    公开(公告)日:2020-07-02

    申请号:US16812539

    申请日:2020-03-09

    Applicant: GOOGLE LLC

    Abstract: A method for coding a frame of a video stream includes selecting a first initial probability distribution for coding at least a first portion of the frame; updating, to obtain an updated first initial probability distribution and using backward adaptivity, the first initial probability distribution while coding the first portion of the frame; mapping the updated first initial probability distribution to a second initial probability distribution; and coding a second portion of the frame using the second initial probability distribution as an initial probability distribution. The first values of the first initial probability distribution are described using M bits, wherein M is a first positive integer. Second values of the updated first initial probability distribution are described using N bits, where N is a second positive integer that is greater than M. Third values of the second initial probability distribution are described using M bits.

    Context-adaptive scanning
    80.
    发明授权

    公开(公告)号:US10701238B1

    公开(公告)日:2020-06-30

    申请号:US16408126

    申请日:2019-05-09

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for context-adaptive scanning of digital components. In one aspect, a method comprises: selecting a given digital component from among a plurality of digital components based on a current scanning priority of the given digital component; scanning the given digital component, comprising determining a current state of the given digital component; determining a current context of the given digital component based on one or more of: (i) the current state of the given digital component, or (ii) a current scan index of the given digital component that specifies a number of times the given digital component has been scanned; determining an updated scanning priority of the given digital component based on the current context of the given digital component; and re-scanning the given digital component according to the updated scanning priority.

Patent Agency Ranking