METHOD AND APPARATUS FOR ENCODING OR DECODING A PICTURE USING A NEURAL NETWORK COMPRISING SUB-NETWORKS

    公开(公告)号:US20240013446A1

    公开(公告)日:2024-01-11

    申请号:US18338105

    申请日:2023-06-20

    CPC classification number: G06T9/002 G06T3/4046

    Abstract: A method for encoding a picture and decoding a bitstream that represents a picture using a neural network (NN) that comprises a plurality of sub-networks is provided. The method includes applying, before processing an input with the at least one sub-network comprising at least two downsampling layers, a rescaling to the input, wherein the rescaling comprises changing the size S1 in the at least one dimension to be S1 so that S1 is an integer multiple of a combined downsampling ratio Rk of the at least one sub-network, after the rescaling, processing the input by the at least one sub-network comprising at least two downsampling layers and providing an output with the size S2, wherein S2 is smaller than S1, and providing, after processing the picture using the NN, a bitstream as output.

Patent Agency Ranking