Method and apparatus for video coding

    公开(公告)号:US11948090B2

    公开(公告)日:2024-04-02

    申请号:US17096126

    申请日:2020-11-12

    摘要: In the present disclosure, a method for compressing a feature map is provided, where the feature map is generated by passing a first input through a deep neural network (DNN). A respective optimal index order and a respective optimal unifying method are determined for each of super-blocks that are partitioned from the feature map. A selective structured unification (SSU) layer is subsequently determined based on the respective optimal index order and the respective optimal unifying method for each of the super-blocks. The SSU layer is added to the DNN to form an updated DNN, and is configured to perform unification operations on the feature map. Further, a first estimated output is determined, where the first estimated output is generated by passing the first input through the updated DNN.

    Content-adaptive online training with image substitution in neural image compression

    公开(公告)号:US11849118B2

    公开(公告)日:2023-12-19

    申请号:US17730020

    申请日:2022-04-26

    摘要: Aspects of the disclosure provide a method and an apparatus for video encoding. The apparatus includes processing circuitry configured to perform an iterative update of sample values of a plurality of samples in an initial input image. The iterative update includes generating a coded representation of a final input image based on the final input image by an encoding neural network (NN) and at least one training module. The final input image has been updated from the initial input image by a number of iterations of the iterative update. The iterative update includes generating a reconstructed image of the final input image based on the coded representation of the final input image by a decoding NN. One of a rate-distortion loss for the final input image or the number of iterations of the iterative update satisfies a pre-determined condition. An encoded image corresponding to the final input image is generated.

    Method and apparatus for escape reorder mode using a codebook index for neural network model compression

    公开(公告)号:US11594008B2

    公开(公告)日:2023-02-28

    申请号:US17085212

    申请日:2020-10-30

    摘要: A method of an escape reorder mode for neural network model compression, is performed by at least one processor, and includes determining whether a frequency count of a codebook index included in a predicted codebook is less than a predetermined value, the codebook index corresponding to a neural network. The method further includes, based on the frequency count of the codebook index being determined to be greater than the predetermined value, maintaining the codebook index, and based on the frequency count of the codebook index being determined to be less than the predetermined value, assigning the codebook index to be an escape index of 0 or a predetermined number. The method further includes encoding the codebook index, and transmitting the encoded codebook index.

    Method and apparatus for multi-scale neural image compression with intra-prediction residuals

    公开(公告)号:US11582470B2

    公开(公告)日:2023-02-14

    申请号:US17333319

    申请日:2021-05-28

    摘要: A method of multi-scale neural image compression with intra-prediction residuals is performed by at least one processor and includes downsampling an input image, generating a current predicted image, based on a previously-recovered predicted image, and generating a prediction residual based on a difference between the downsampled input image and the generated current predicted image. The method further includes encoding the generated prediction residual, decoding the encoded prediction residual, and generating a currently-recovered predicted image based on an addition of the current predicted image and the decoded prediction residual. The method further includes upsampling the currently-recovered predicted image, generating a scale residual based on a difference between the input image and the upsampled currently-recovered predicted image, and encoding the scale residual.

    METHOD AND APPARATUS FOR ADAPTIVE NEURAL IMAGE COMPRESSION WITH RATE CONTROL BY META-LEARNING

    公开(公告)号:US20220230362A1

    公开(公告)日:2022-07-21

    申请号:US17365371

    申请日:2021-07-01

    IPC分类号: G06T9/00 G06N3/08

    摘要: A method of adaptive neural image compression with rate control by meta-learning includes receiving an input image and a hyperparameter; and encoding the received input image, based on the received hyperparameter, using an encoding neural network, to generate a compressed representation. The encoding includes performing a first shared encoding on the received input image, using a first shared encoding layer having first shared encoding parameters, performing a first adaptive encoding on the received input image, using a first adaptive encoding layer having first adaptive encoding parameters, combining the first shared encoded input image and the first adaptive encoded input image, to generate a first combined output, and performing a second shared encoding on the first combined output, using a second shared encoding layer having second shared encoding parameters.

    MULTI-TASK NEURAL NETWORK BY MICRO-STRUCTURED PARAMETER SHARING FOR MULTI-QUALITY LOOP FILTER

    公开(公告)号:US20220222505A1

    公开(公告)日:2022-07-14

    申请号:US17500339

    申请日:2021-10-13

    IPC分类号: G06N3/04 G06N3/08

    摘要: Video processing with a multi-quality loop filter using a multi-task neural network is performed by at least one processor and includes generating a first set of masked weight parameters, based on an input and a plurality of quantization parameter values with a corresponding first set of masks and first plurality of weight parameters, for a first set of shared neural network layers, selecting a second set of task specific neural network layers for the plurality of quantization parameter values with a second plurality of weight parameters, based on the plurality of quantization parameter values, computing an inference output, based on the first set of masked weight parameters and the second plurality of weight parameters, and outputting the computed inference output as an enhanced result.