Front-end architecture for neural network based video coding

    公开(公告)号:US12231666B2

    公开(公告)日:2025-02-18

    申请号:US17643383

    申请日:2021-12-08

    Abstract: Techniques are described herein for processing video data using a neural network system. For instance, a process can include generating, by a first convolutional layer of an encoder sub-network of the neural network system, output values associated with a luminance channel of a frame. The process can include generating, by a second convolutional layer of the encoder sub-network, output values associated with at least one chrominance channel of the frame. The process can include generating, by a third convolutional layer based on the output values associated with the luminance channel of the frame and the output values associated with the at least one chrominance channel of the frame, a combined representation of the frame. The process can further include generating encoded video data based on the combined representation of the frame.

    QUANTIZATION OFFSETS FOR DEPENDENT QUANTIZATION IN VIDEO CODING

    公开(公告)号:US20240414339A1

    公开(公告)日:2024-12-12

    申请号:US18734509

    申请日:2024-06-05

    Abstract: A method of processing video data includes determining a quantization level for a coefficient of a current block from a plurality of quantization levels; determining an offset value based on the quantization level, wherein the offset value is a first offset value based on the quantization level being a first quantization level or a second, different offset value based on the quantization level being a second quantization level; determining a quantization parameter or an inverse-quantization parameter for the coefficient based on the determined offset value; and as part of encoding or decoding the current block, performing one of quantization or inverse-quantization for the coefficient based on the determined quantization parameter or the determined inverse-quantization parameter.

    Machine learning based flow determination for video coding

    公开(公告)号:US12003734B2

    公开(公告)日:2024-06-04

    申请号:US17676510

    申请日:2022-02-21

    CPC classification number: H04N19/139 H04N19/172 H04N19/186 H04N23/632

    Abstract: Systems and techniques are described herein for processing video data. In some aspects, a method can include obtain, by a machine learning system, input video data. The input video data includes one or more luminance components for a current frame. The method can include determining, by the machine learning system, motion information for the luminance component(s) of the current frame and motion information for one or more chrominance components of the current frame using the luminance component(s) for the current frame. In some cases, the method can include determining the motion information for the luminance component(s) based on the luma component(s) of the current frame and at least one reconstructed luma component of a previous frame. In some cases, the method can further include determining the motion information for the chrominance component(s) of the current frame using the motion information determined for the luminance component(s) of the current frame.

    Processing video data picture size change request and notification messages

    公开(公告)号:US11924464B2

    公开(公告)日:2024-03-05

    申请号:US17819703

    申请日:2022-08-15

    CPC classification number: H04N19/59 H04N19/105 H04N19/593

    Abstract: An example device for requesting a reduced resolution for video data includes a memory configured to store video data; and one or more processors implemented in circuitry and configured to: decode a first sequence of pictures of a bitstream, the first sequence of pictures having a first resolution; in response to determining that the device is to enter a power saving mode, send a message requesting a reduced resolution relative to the first resolution for a second sequence of pictures, the second sequence of pictures being subsequent to the first sequence of pictures in coding order; and decode the second sequence of pictures of the video data of the bitstream, the second sequence of pictures having the reduced resolution. The reduced resolution may be reduced spatial resolution, reduced temporal resolution (frame rate), or both.

    Decoded picture buffer (DPB) operations and access unit delimiter (AUD)

    公开(公告)号:US11917174B2

    公开(公告)日:2024-02-27

    申请号:US17338468

    申请日:2021-06-03

    CPC classification number: H04N19/44 H04N19/30 H04N19/70

    Abstract: Systems, methods, and computer-readable storage media are provided for decoded picture buffer (DPB) operations and rewriting access unit delimiters (AUDs) after bitstream extractions. An example method can include storing one or more pictures associated with an access unit (AU) in a decoded picture buffer (DPB), the AU including a first plurality of pictures, the first plurality of pictures corresponding to a plurality of video coding layers; after each picture of a second plurality of pictures associated with the AU is removed from a coded picture buffer (CPB), removing at least one picture of the one or more pictures from the DPB; and storing, in the DPB, each picture of the second plurality of pictures removed from the CPB.

Patent Agency Ranking