Block-wise entropy coding method in neural image compression

    公开(公告)号:US12225205B2

    公开(公告)日:2025-02-11

    申请号:US17825575

    申请日:2022-05-26

    Abstract: Systems and methods for block-wise entropy coding methods in neural image compression is provided. A method includes: receiving a bitstream that includes an image; partitioning the image into a plurality of blocks; compressing each of the plurality of blocks by a neural network-based encoder; obtaining compressed features by obtaining a compressed feature for each block from among the plurality of blocks in the image; processing the compressed features by an entropy encoder to generate a first compressed bitstream; obtaining a plurality of reshaped compressed features by concatenating the compressed features; processing the plurality of reshaped compressed features by the entropy encoder to generate a second compressed bitstream; and encoding the bitstream including the image based on the second compressed bitstream.

    Method and Apparatus for Codec Performance Comparison

    公开(公告)号:US20250024042A1

    公开(公告)日:2025-01-16

    申请号:US18621713

    申请日:2024-03-29

    Abstract: Methods, apparatus, and computer readable storage medium evaluating codec performance. One method includes obtaining m anchor data points each generated based on a respective anchor encoded video bitstream; obtaining n test data points each generated based on a respective encoded test video bitstream, n being an integer; fitting the m anchor data points with an anchor curve, the anchor curve being based on an anchor polynomial, wherein the anchor polynomial is monotonic in an x-axis range; fitting the n test data points with a test curve, the anchor curve being based on a test polynomial, wherein the test polynomial is monotonic in the x-axis range; and evaluating the test codec performance based on the anchor curve and the test curve, to obtain an evaluation result.

    Affine motion model derivation method

    公开(公告)号:US12200222B2

    公开(公告)日:2025-01-14

    申请号:US18521810

    申请日:2023-11-28

    Abstract: Systems and methods are described for video coding using affine motion prediction. In an example method, motion vector gradients are determined from respective motion vectors of a plurality of neighboring sub-blocks neighboring a current block. An estimate of at least one affine parameter for the current block is determined based on the motion vector gradients. An affine motion model is determined based at least in part on the estimated affine parameter(s), and a prediction of the current block is generated using the affine motion model. The estimated parameter(s) may be used in the affine motion model itself. Alternatively, the estimated parameter(s) may be used in a prediction of the affine motion model. In some embodiments, only neighboring sub-blocks above and/or to the left of the current block are used in estimating the affine parameter(s).

    Error Mitigation For Sublayers In Video Coding

    公开(公告)号:US20250016349A1

    公开(公告)日:2025-01-09

    申请号:US18777052

    申请日:2024-07-18

    Inventor: Ye-Kui Wang

    Abstract: A video coding mechanism is disclosed. The mechanism includes encoding a bitstream comprising a video parameter set (VPS) and one or more sublayers. A buffering period (BP) supplemental enhancement information (SEI) message comprising a BP maximum sublayers minus one (bp_max_sublayers_minus1) is also encoded into the bitstream. The bp_max_sublayers_minus1 is set to a value in a range of zero to a maximum number of sublayers indicated in the VPS. A hypothetical reference decoder (HRD) is initialized based on the BP SEI message. A set of bitstream conformance tests are performed on the sublayers. The bitstream is stored for communication toward a decoder.

    Method and apparatus for real-time SAO parameter estimation

    公开(公告)号:US12170780B2

    公开(公告)日:2024-12-17

    申请号:US18369315

    申请日:2023-09-18

    Abstract: The disclosure provides a sample adaptive offset (SAO) encoder. The SAO encoder includes a statistics collection (SC) block and a rate distortion optimization (RDO) block coupled to the SC block. The SC block receives a set of deblocked pixels and a set of original pixels. The SC block categorizes each deblocked pixel of the set of deblocked pixels in at least one of a plurality of band and edge categories. The SC block estimates an error in each category as difference between a deblocked pixel of the set of deblocked pixels and corresponding original pixel of the set of original pixels. The RDO block determines a set of candidate offsets associated with each category and selects a candidate offset with a minimum RD cost. The minimum RD cost is used by a SAO type block and a decision block to generate final offsets for the SAO encoder.

    IMAGE PROCESSING APPARATUS AND METHOD

    公开(公告)号:US20240380888A1

    公开(公告)日:2024-11-14

    申请号:US18784695

    申请日:2024-07-25

    Abstract: An upper limit value of the number of bins allocated to a processing target subblock by distributing the number of bins among nonzero subblocks is set, a syntax element value regarding the processing target subblock is derived by using coefficient data derived from image data so that the number of bins does not exceed the upper limit value, and the syntax element value derived is encoded and coded data is generated. The present disclosure can be applied, for example, to an image processing apparatus, an image encode apparatus, an image decode apparatus, a transmitting apparatus, a receiving apparatus, a transmitting/receiving apparatus, an information processing apparatus, an imaging apparatus, a reproducing apparatus, an electronic device, an image processing method, an information processing method, and the like.

    Method and apparatus for decoding a video signal

    公开(公告)号:US12120352B2

    公开(公告)日:2024-10-15

    申请号:US17208100

    申请日:2021-03-22

    Abstract: The present invention relates to a method for decoding a video signal, comprising the steps of: acquiring a transform size flag of the current macroblock from a video signal; checking the number of non-zero transform coefficients at each pixel position in a first transform block which corresponds to the transform size flag; changing a scan order of the first transform block by prioritizing the position of the pixel having the greatest number of the non-zero transform coefficients in the first transform block; determining the number of the non-zero transform coefficients at each pixel position in a second transform block, and setting the changed scan order of the first transform block as an initialized scan order of the second transform block; adding the number of the non-zero transform coefficients at each pixel position in the first transform block and the number of the non-zero transform coefficients at each pixel position in the second transform block, and changing the scan order of the second transform block by prioritizing the position of the pixel having the greatest number of the non-zero transform coefficients; and decoding the transform coefficients arranged in the scan order changed in the previous step, wherein the first transform block and the second transform block have sizes corresponding to the transform size flag, and are contained in the current macroblock.

Patent Agency Ranking