Front-end architecture for neural network based video coding

    公开(公告)号:US12231666B2

    公开(公告)日:2025-02-18

    申请号:US17643383

    申请日:2021-12-08

    Abstract: Techniques are described herein for processing video data using a neural network system. For instance, a process can include generating, by a first convolutional layer of an encoder sub-network of the neural network system, output values associated with a luminance channel of a frame. The process can include generating, by a second convolutional layer of the encoder sub-network, output values associated with at least one chrominance channel of the frame. The process can include generating, by a third convolutional layer based on the output values associated with the luminance channel of the frame and the output values associated with the at least one chrominance channel of the frame, a combined representation of the frame. The process can further include generating encoded video data based on the combined representation of the frame.

    Derived intra prediction modes and most probable modes in video coding

    公开(公告)号:US12192469B2

    公开(公告)日:2025-01-07

    申请号:US17804972

    申请日:2022-06-01

    Abstract: A method of encoding or decoding video data comprises: for each respective intra prediction mode of a plurality of intra prediction modes in a most-probable mode (MPM) list: generating, based on reference samples for a template region and using the respective intra prediction mode, prediction samples for the template region; and determining a cost for the respective intra prediction mode; determining a first intra prediction mode and a second intra prediction mode in the MPM list having lowest costs; determining a preliminary prediction block for the first intra prediction mode and a preliminary prediction block for the second intra prediction mode; generating a prediction block based on a fusion of the preliminary prediction blocks weighted according to a weight for the first intra prediction mode and a weight for the second intra prediction mode.

    Model-based motion vector difference derivation and template matching prediction for video coding

    公开(公告)号:US12177475B2

    公开(公告)日:2024-12-24

    申请号:US17586492

    申请日:2022-01-27

    Abstract: An example device for decoding video data includes a memory configured to store video data; and one or more processors configured to: decode data representing an initial motion vector for a current block of the video data, the initial motion vector having integer-motion vector difference (MVD) precision; determine a search range around a reference area identified by the initial motion vector in a reference picture; perform a template matching search process in the search range to identify a best matching region; determine error values for neighboring pixels to the best matching region; use the error values for the neighboring pixels to perform a model-based fractional-pixel motion vector refinement to derive motion vector difference values; apply at least one of the motion vector difference values to the initial motion vector to determine a refined motion vector for the current block; and decode the current block using the refined motion vector.

    QUANTIZATION OFFSETS FOR DEPENDENT QUANTIZATION IN VIDEO CODING

    公开(公告)号:US20240414339A1

    公开(公告)日:2024-12-12

    申请号:US18734509

    申请日:2024-06-05

    Abstract: A method of processing video data includes determining a quantization level for a coefficient of a current block from a plurality of quantization levels; determining an offset value based on the quantization level, wherein the offset value is a first offset value based on the quantization level being a first quantization level or a second, different offset value based on the quantization level being a second quantization level; determining a quantization parameter or an inverse-quantization parameter for the coefficient based on the determined offset value; and as part of encoding or decoding the current block, performing one of quantization or inverse-quantization for the coefficient based on the determined quantization parameter or the determined inverse-quantization parameter.

    ADAPTIVE VIDEO FILTER
    10.
    发明公开

    公开(公告)号:US20240357095A1

    公开(公告)日:2024-10-24

    申请号:US18605416

    申请日:2024-03-14

    CPC classification number: H04N19/117 H04N19/176 H04N19/82

    Abstract: An example device includes one or more memories and one or more processors coupled to the one or more memories. The one or more processors are configured to determine a first value associated with a first window, the first window including a target block of video data. The one or more processors are configured to determine a respective difference between each sample value within a second window and the first value, the second window including the target block. The one or more processors are configured to determine a second value based on the respective differences. The one or more processors are configured to determine a Laplacian activity value of the target block. The one or more processors are configured to determine a class index based on the second value and the Laplacian activity value and decode the target block based on the class index.

Patent Agency Ranking