SIGNALING OF PICTURE-IN-PICTURE IN MEDIA FILES

    公开(公告)号:US20250071332A1

    公开(公告)日:2025-02-27

    申请号:US18942138

    申请日:2024-11-08

    Applicant: Bytedance Inc.

    Inventor: Ye-Kui Wang

    Abstract: A mechanism for processing video data is disclosed. A type for a value taken by a region identifier (ID) in a region identifier (ID) type field (region_id_type) is determined. The type for the value taken by the region ID is coded in the region ID type field using four or less bits. A conversion between the visual media data and a media data file is performed based on the type for the value taken by the region ID in the region ID type field.

    DETERMINATION METHOD FOR CHROMA INTRA PREDICTION MODE AND IMAGE ENCODING DEVICE

    公开(公告)号:US20250071270A1

    公开(公告)日:2025-02-27

    申请号:US18791861

    申请日:2024-08-01

    Abstract: A determination method for a chroma intra prediction mode includes: performing a simple RDO calculation on a luminance value of each pixel of an image block to obtain multiple luminance candidate modes; determining multiple chroma candidate modes according to one of the luminance candidate modes and a chroma simple RDO result calculated by performing a simple RDO calculation on a chroma value of each pixel of the image block; performing a full RDO calculation of the multiple luminance candidate modes on the luminance value of each pixel in the image block to select a luminance target mode; performing a full RDO calculation of the multiple chroma candidate modes on the chroma value of each pixel of the image block to a obtain chroma full RDO result; and determining a chroma target mode according to the luminance target mode and the chroma full RDO result.

    Signaling of gradual decoding refresh and reference picture lists

    公开(公告)号:US12231653B2

    公开(公告)日:2025-02-18

    申请号:US18517328

    申请日:2023-11-22

    Applicant: Bytedance Inc.

    Abstract: Examples of video encoding methods and apparatus and video decoding methods and apparatus are described. An example method of video processing includes performing a conversion between a current picture of a video and a bitstream of the video according to a rule. The rule specifies that responsive to a picture being referred to by an inter-layer reference picture (ILRP) entry in a reference picture list of a slice of the current picture, the picture is allowed to have a gradual decoding refresh (GDR) type and a syntax element specifying a recovery point of the picture in an output order is 0.

    Inter prediction method, encoder, decoder and storage medium

    公开(公告)号:US12231649B2

    公开(公告)日:2025-02-18

    申请号:US17958420

    申请日:2022-10-02

    Abstract: Disclosed are an inter prediction method, encoder/decoder, and storage medium. The method includes: determining a prediction mode parameter of a current block; when the parameter indicates that a GPM is used for determining an inter prediction value of the current block, determining an angle and a distance corresponding to a dividing line in the current block, setting an angle index value and a distance index value to index serial numbers corresponding to the angle and the distance in a preset mapping table respectively; determining a value of shifting direction indicator of the current block, which is used for indicating shifting directions of different dividing lines of the current block at the angle, by using a preset model based on size information and the angle index value of the current block; performing inter prediction on the current block based on the value of shifting direction indicator and the distance index value.

    METHODS AND DEVICES FOR PROGRESSIVE ENCODING AND DECODING OF MULTIPLANE IMAGES

    公开(公告)号:US20250056016A1

    公开(公告)日:2025-02-13

    申请号:US18719969

    申请日:2022-12-07

    Abstract: Methods and devices for encoding, decoding and transmitting a three-dimensional scene initially represented as a multiplane image (MPI) are provided. Each layer of the MPI is split into patches based on the transparency component. Patches of a layer are grouped in a tile. The greater the depth of the layer, the greater the identifying number of the tile. When several tiles are packed in an atlas image, the same monotonic (i.e. ascending or descending) function according to depth applies to atlas numbers. At the decoding side, the current viewport to render is initially cleared and each decoded tile is sequentially blended over from the nearest one to the furthest due to the numbering of the set of atlases and tiles. Pixels of a patch under rendering are projected onto pixels of the viewport image according to the depth of the tile comprising the patch and metadata indicating the position of the patch in the layer of the MPI the patch has been clustered from.

    Coding/decoding video with reference boundary correction

    公开(公告)号:US12225229B2

    公开(公告)日:2025-02-11

    申请号:US17552397

    申请日:2021-12-16

    Abstract: A picture coding device includes: a block vector candidate derivation unit that derives block vector candidates of a coding target block in a coding target picture from coding information stored in a coding information storage memory; a selector that selects a selected block vector from the block vector candidates; a storage that stores coded pictures of a predetermined number of intra block copy standard blocks immediately before the coding target block; and a reference region boundary correction unit that removes a coded picture of one intra block copy standard block in the storage from a referenceable region after completion of a coding process of the coding target block, and determines whether an upper left position and a lower right position of a reference block indicated by the selected block vector are both included in the referenceable region.

    FILTER SHAPE SWITCHING
    10.
    发明申请

    公开(公告)号:US20250047907A1

    公开(公告)日:2025-02-06

    申请号:US18922219

    申请日:2024-10-21

    Abstract: Aspects of the disclosure provide methods and apparatuses for video encoding/decoding. An example method of video decoding includes receiving a video bitstream and reconstructing a first sample of the video bitstream using a non-linear mapping-based filter with a first filter shape configuration of a plurality of filter shape configurations. The plurality of filter shape configurations includes at least two filter shape configurations that are based on a same geometric shape and include a same number of filter taps, and the filter taps of the at least two filter shape configurations are located at different positions. The method further includes selecting a second filter shape configuration of the plurality of filter shape configurations, and reconstructing a second sample in the video based on the non-linear mapping-based filter with the second filter shape configuration.

Patent Agency Ranking