SIGNALING ZERO COEFFICIENTS FOR DISPLACEMENT CODING

    公开(公告)号:US20240355002A1

    公开(公告)日:2024-10-24

    申请号:US18644022

    申请日:2024-04-23

    IPC分类号: G06T9/00 G06T17/20

    CPC分类号: G06T9/001 G06T17/20

    摘要: An apparatus for mesh decoding is provided. The apparatus includes processing circuitry. The processing circuitry is configured to receive a bitstream that includes displacement information and base mesh information of a mesh in a current mesh frame. The displacement information indicates a plurality of displacements associated with a base mesh and the mesh. The base mesh includes a subset of a plurality of vertices of the mesh. The processing circuitry is configured to determine a last non-zero quantized wavelet coefficient of a plurality of non-zero quantized wavelet coefficients in a coefficient array associated with the plurality of displacements based on index information included in the bitstream. The processing circuitry is configured to reconstruct the mesh based on the plurality of displacements indicated by the plurality of non-zero quantized wavelet coefficients.

    ADAPTIVE INTEGRATING DUPLICATED VERTICES IN MESH MOTION VECTOR CODING

    公开(公告)号:US20240282010A1

    公开(公告)日:2024-08-22

    申请号:US18444065

    申请日:2024-02-16

    IPC分类号: G06T9/00

    CPC分类号: G06T9/001

    摘要: A method and apparatus comprising computer code configured to cause a processor or processors to obtain encoded volumetric data of at least one three-dimensional (3D) visual content, the encoded volumetric data representing a mesh sequence of a plurality of meshes of the 3D visual content, determine a syntax obtained with the encoded volumetric data, the syntax signaling whether to predict vertices of the plurality of meshes as a group, and decode the encoded volumetric data by predicting the vertices as the group based on the syntax element, and at least two of the vertices of the group are not edge connected, and the syntax is based on an adaptive decision to predict the vertices of the plurality of meshes as the group.

    IBC MERGE MODE WITH A BLOCK VECTOR DIFFERENCE

    公开(公告)号:US20240236351A1

    公开(公告)日:2024-07-11

    申请号:US18409445

    申请日:2024-01-10

    摘要: Aspects of the disclosure include methods and an apparatus for video coding. The apparatus includes processing circuitry that receives coded information of a current block in a current picture. The current block is predicted based on a reference block in the current picture indicated by a block vector (BV) to be determined based on a BV predictor (BVP) and a BV difference (BVD). The processing circuitry determines a BVD list including BVD candidates based at least on BVD offsets from the BVP, determines the BVD from the BVD candidates in the BVD list, and reconstructs the current block using the BVD. For each adjacent pair of the BVD offsets, an initial interval size indicates a difference between the adjacent pair of the BVD offsets. Each of the initial interval sizes is different from other initial interval sizes corresponding to other adjacent pairs of the BVD offsets.

    FLIPPING MODE FOR CHROMA AND INTRA TEMPLATE MATCHING

    公开(公告)号:US20240236324A9

    公开(公告)日:2024-07-11

    申请号:US18382922

    申请日:2023-10-23

    摘要: A video bitstream comprising coding information of a current block in a current picture is received. The coding information indicates that the current block is coded by a flip mode in which locations of samples of the current block are adjusted within the current block. A reference block is determined from a plurality of candidate reference blocks in a reconstructed region of the current picture for the current block based on template matching (TM) costs. The TM costs indicate differences between a template of the current block and respective templates of the plurality of candidate reference blocks. A reconstruction block of the current block is determined based on the determined reference block. The current block is reconstructed by adjusting locations of samples of the reconstruction block within the reconstruction block based on the flip mode.

    FLIPPING MODE FOR CHROMA AND INTRA TEMPLATE MATCHING

    公开(公告)号:US20240137515A1

    公开(公告)日:2024-04-25

    申请号:US18382922

    申请日:2023-10-22

    摘要: A video bitstream comprising coding information of a current block in a current picture is received. The coding information indicates that the current block is coded by a flip mode in which locations of samples of the current block are adjusted within the current block. A reference block is determined from a plurality of candidate reference blocks in a reconstructed region of the current picture for the current block based on template matching (TM) costs. The TM costs indicate differences between a template of the current block and respective templates of the plurality of candidate reference blocks. A reconstruction block of the current block is determined based on the determined reference block. The current block is reconstructed by adjusting locations of samples of the reconstruction block within the reconstruction block based on the flip mode.

    MOTION VECTOR DERIVATION OF SUBBLOCK-BASED TEMPLATE-MATCHING FOR SUBBLOCK BASED MOTION VECTOR PREDICTOR

    公开(公告)号:US20240129479A1

    公开(公告)日:2024-04-18

    申请号:US18241084

    申请日:2023-08-31

    摘要: A video bitstream is received. The video bitstream includes a current block comprising a plurality of subblocks and a template region of the current block comprising a plurality of template subblocks adjacent to at least one of a top side and a left side of the current block. A motion vector (MV) located in a center position of the current block is determined. The MV is determined based on at least one MV of the plurality of subblocks of the current block. A MV for each of the plurality of template subblocks is determined based on the MV located in the center position of the current block and a respective MV of a corresponding subblock of the plurality of subblocks that is adjacent to the respective template subblock. The current block is reconstructed based on the determined MVs for the plurality of template subblocks.

    REGION OF INTEREST CODING FOR VCM
    7.
    发明公开

    公开(公告)号:US20240121408A1

    公开(公告)日:2024-04-11

    申请号:US18477189

    申请日:2023-09-28

    摘要: A technique for encoding video for machine vision and human/machine hybrid vision, including receiving image data. The technique may also include detecting a plurality of bounding boxes associated with a plurality of objects of interest in a frame of the image data and detecting a frame-level bounding box for the frame based on coordinates of the plurality of bounding boxes. Then, the technique may include encoding the frame-level bounding box using a first bitrate.

    DIRECTIONAL NEAREST NEIGHBOR PREDICTION MODE

    公开(公告)号:US20240121404A1

    公开(公告)日:2024-04-11

    申请号:US18377277

    申请日:2023-10-05

    摘要: Aspects of the disclosure include methods and apparatuses for video coding. One of the apparatuses includes processing circuitry that receives a bitstream of a current block in a current picture. The current block is coded with a directional nearest neighbor prediction (DNNP) mode. The processing circuitry selects a prediction value for a sample in the current block from a top-left value, a top value, or a left value based on one or more difference values between respective paired values of (i) the top-left value associated with a top-left reference sample that is a top-left neighbor of the current block, (ii) the top value associated with a top reference sample that is a top neighbor of the sample in the current block, and (iii) the left value associated with a left reference sample that is a left neighbor of the sample in the current block. The processing circuitry reconstructs the current block using the selected prediction value for the sample in the current block.

    DISPLACEMENT CODING FOR MESH COMPRESSION
    9.
    发明公开

    公开(公告)号:US20240089499A1

    公开(公告)日:2024-03-14

    申请号:US18314986

    申请日:2023-05-10

    摘要: A method and apparatus comprising computer code configured to cause a processor or processors to obtain volumetric data of at least one three-dimensional (3D) visual content, derive a mesh from a frame of the volumetric data, the mesh including a plurality of base mesh vertices, determine a displacement of at least one vertex, that is not of the base mesh vertices, based on a series of projections from at least one of the plurality of base mesh vertices that is a neighboring one of the plurality of base mesh vertices to the at least one vertex, predicting the at least one vertex based at least on the determined displacement, and encode the volumetric data based on the predicted at least one vertex.