-
公开(公告)号:US11477492B2
公开(公告)日:2022-10-18
申请号:US15668745
申请日:2017-08-04
Applicant: GOOGLE LLC
IPC: H04N19/44 , H04N19/91 , H04N19/60 , H04N19/124 , H04N19/176 , H04N19/129 , H04N19/159
Abstract: Systems and methods are disclosed for entropy coding of blocks of image data. For example, methods may include partitioning a block of video data into a plurality of groups of elements; decoding, using an entropy decoder, data from an encoded bitstream to obtain elements of a first group from the plurality of groups of elements; determining a category based on the elements of the first group; based on the category, selecting a context for an element from a second group from the plurality of groups of elements; and decoding, using the entropy decoder using the context, data from the encoded bitstream to obtain the element of the second group from the plurality of groups of elements.
-
公开(公告)号:US20220201316A1
公开(公告)日:2022-06-23
申请号:US17601639
申请日:2019-03-21
Applicant: Google LLC
Inventor: Claudionor Coelho , Aki Kuusela , Joseph Young , Shan Li , Dake He
IPC: H04N19/147 , H04N19/176 , H04N19/96 , G06T9/00
Abstract: An apparatus for encoding an image block includes a processor that presents, to a machine-learning model, the image block, obtains the partition decision for encoding the image block from the model, and encodes the image block using the partition decision. The model is trained to output a partition decision for encoding the image block by using training data for a plurality of training blocks as input, the training data including for a training block, partition decisions for encoding the training block, and, for each partition decision, a rate-distortion value resulting from encoding the training block using the partition decision. The model is trained using a loss function combining a partition loss function based upon a relationship between the partition decisions and respective predicted partitions, and a rate-distortion cost loss function based upon a relationship between the rate-distortion values and respective predicted rate-distortion values.
-
公开(公告)号:US11218737B2
公开(公告)日:2022-01-04
申请号:US16042261
申请日:2018-07-23
Applicant: GOOGLE LLC
Inventor: Jingning Han , Aki Kuusela , Stanislav Vitvitskyy , Yaowu Xu , Ching-Han Chiang , Dake He
Abstract: Asymmetric probability model updating and entropy coding includes using different numbers of bits for storing probabilities of a probability model and for entropy coding symbols using that probability model. The probabilities of a probability model are updated according to values of syntax elements decoded from a bitstream. The probabilities are associated with possible values of the syntax elements and are stored using a first bit precision. Based on the updated probabilities, a second bit precision to use to entropy decode the syntax elements is determined. The second bit precision is less than the first bit precision. The syntax elements are then entropy decoded using the second bit precision, such as to produce quantized transform coefficients, which may be further processed and output to an output video stream. Using the first bit precision to entropy decode the syntax elements results in a lower compression throughput than using the second bit precision.
-
公开(公告)号:US20210099720A1
公开(公告)日:2021-04-01
申请号:US17121820
申请日:2020-12-15
Applicant: GOOGLE LLC
Inventor: Joseph Young , Dake He
IPC: H04N19/18 , H04N19/176 , H04N19/13 , H04N19/129 , H04N19/91 , H04N19/93 , H04N19/147 , H04N19/184 , H04N19/124 , H04N19/159 , H04N19/44
Abstract: An apparatus for decoding a current block includes a processor that is configured to obtain a transform class of a transform type used for decoding a transform block of the current block; select, based on the transform class, a template for coding a value related to a transform coefficient at a row and a column of the transform block; obtain, using the template, an index of a probability distribution in a table of probability distributions; and decode, from a compressed bitstream, the value using the probability distribution.
-
公开(公告)号:US20210021859A1
公开(公告)日:2021-01-21
申请号:US17060483
申请日:2020-10-01
Applicant: GOOGLE LLC
Inventor: Aki Kuusela , Dake He
IPC: H04N19/513 , H04N19/70 , H04N19/14 , H04N19/423 , H04N19/433 , H04N19/593 , H04N19/176 , H04N19/107
Abstract: Motion estimation or compensation functionality of a hardware component is used to encode or decode key frames and other video frames. The hardware component includes a memory, which may, for example, be a local static random access memory or an external dynamic random access memory. Upon a block of a frame being encoded or decoded, data associated with that block is stored in the memory. That data can then be processed by motion estimation or motion compensation for use in encoding or decoding one or more later blocks within the same frame. The data may, for example, be stored in the memory after operations for reconstruction and loop filtering have been performed. The data stored in the memory may effectively be processed using traditional inter-prediction operations, such as to identify similar video objects within blocks of the same frame.
-
公开(公告)号:US10887595B2
公开(公告)日:2021-01-05
申请号:US16835379
申请日:2020-03-31
Applicant: GOOGLE LLC
Inventor: Dake He
IPC: H04N7/18 , H04N19/13 , H04N19/18 , H04N19/159 , H04N19/196 , H04N19/124 , H04N19/176 , H04N19/91 , H04N19/96 , H04N19/129
Abstract: Entropy encoding and decoding a sequence of symbols using probability mixing is disclosed. A method includes for at least a symbol, at a position of the symbols, determining a mixed probability, by: approximating a first conditional probability for coding the symbol, the first conditional probability being a conditional probability of the symbol given a sub-sequence of the sequence having a first value; approximating a second conditional probability for coding the symbol, the second conditional probability being a conditional probability of the symbol given the sub-sequence having a second value; and determining, using the first conditional probability and the second conditional probability, the mixed probability for coding the symbol; and coding the symbol using the mixed probability.
-
公开(公告)号:US20200351502A1
公开(公告)日:2020-11-05
申请号:US16930903
申请日:2020-07-16
Applicant: GOOGLE LLC
Inventor: Dake He
IPC: H04N19/129 , H04N19/18 , H04N19/172 , H04N19/176 , H04N19/182 , H04N19/61
Abstract: An apparatus includes a processor that is configured to decode, as a first group, coefficients of a first row and a first column of a transform block using a first zig-zag scan order; and decode at least a subset of remaining coefficients of the transform block using a second scan order. Executable instructions in a non-transitory computer-readable storage medium include instructions to code, as a first group, coefficients of a first row and a first column of the transform block using a first scan order, where the transform block includes the coefficients of the first row, coefficients of the first column, and remaining coefficients; partition the remaining coefficients into a remaining first row and a remaining first column and other coefficients; code the remaining first row and the remaining first column using a second scan order; and code the other coefficients using a third scan order.
-
公开(公告)号:US20200275101A1
公开(公告)日:2020-08-27
申请号:US16868729
申请日:2020-05-07
Applicant: GOOGLE LLC
Inventor: Claudionor Coelho , Dake He , Aki Kuusela , Shan Li
IPC: H04N19/124 , H04N19/164 , H04N19/176 , H04N19/96
Abstract: Encoding an image block using a quantization parameter includes presenting, to an encoder that includes a machine-learning model, the image block and a value derived from the quantization parameter, where the value is a result of a non-linear function using the quantization parameter as input, where the non-linear function relates to a second function used to calculate, using the quantization parameter, a Lagrange multiplier that is used in a rate-distortion calculation, and where the machine-learning model is trained to output mode decision parameters for encoding the image block; obtaining the mode decision parameters from the encoder; and encoding, in a compressed bitstream, the image block using the mode decision parameters.
-
公开(公告)号:US20200213599A1
公开(公告)日:2020-07-02
申请号:US16812539
申请日:2020-03-09
Applicant: GOOGLE LLC
Inventor: Joseph Young , Dake He
IPC: H04N19/18 , H04N19/176 , H04N19/13 , H04N19/129 , H04N19/91 , H04N19/93 , H04N19/147 , H04N19/184 , H04N19/124 , H04N19/159 , H04N19/44
Abstract: A method for coding a frame of a video stream includes selecting a first initial probability distribution for coding at least a first portion of the frame; updating, to obtain an updated first initial probability distribution and using backward adaptivity, the first initial probability distribution while coding the first portion of the frame; mapping the updated first initial probability distribution to a second initial probability distribution; and coding a second portion of the frame using the second initial probability distribution as an initial probability distribution. The first values of the first initial probability distribution are described using M bits, wherein M is a first positive integer. Second values of the updated first initial probability distribution are described using N bits, where N is a second positive integer that is greater than M. Third values of the second initial probability distribution are described using M bits.
-
公开(公告)号:US10701238B1
公开(公告)日:2020-06-30
申请号:US16408126
申请日:2019-05-09
Applicant: Google LLC
Inventor: Oleg Golubitsky , Dake He
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for context-adaptive scanning of digital components. In one aspect, a method comprises: selecting a given digital component from among a plurality of digital components based on a current scanning priority of the given digital component; scanning the given digital component, comprising determining a current state of the given digital component; determining a current context of the given digital component based on one or more of: (i) the current state of the given digital component, or (ii) a current scan index of the given digital component that specifies a number of times the given digital component has been scanned; determining an updated scanning priority of the given digital component based on the current context of the given digital component; and re-scanning the given digital component according to the updated scanning priority.
-
-
-
-
-
-
-
-
-