-
公开(公告)号:US20240357179A1
公开(公告)日:2024-10-24
申请号:US18624636
申请日:2024-04-02
Applicant: Alibaba Innovation Private Limited
Inventor: Jie CHEN , Yan YE , Shurun WANG
Abstract: Methods and apparatuses are provided for processing video data by using an object mask information (OMI) supplemental enhancement information (SEI) message. An exemplary encoding method includes: receiving a video sequence; and encoding one or more pictures of the video sequence to generate a bitstream, comprising: encoding an auxiliary picture indicating a mask of an object in a primary picture, the mask of the object being represented by a sample value of the auxiliary picture; and generating a supplemental enhancement information (SEI) message indicating an attribute of the mask of the object.
-
2.
公开(公告)号:US20240357118A1
公开(公告)日:2024-10-24
申请号:US18618551
申请日:2024-03-27
Applicant: Alibaba Innovation Private Limited
Inventor: Shurun WANG , Yan YE
IPC: H04N19/132 , H04N19/172 , H04N19/186 , H04N19/436
CPC classification number: H04N19/132 , H04N19/172 , H04N19/186 , H04N19/436
Abstract: A method of encoding a video sequence into a bitstream. The method includes receiving a video sequence; performing a plurality of convolutions on an input image data of the video sequence in YUV format; wherein performing the plurality of convolutions includes performing a first stage convolution on the input image data, wherein the first stage convolution comprises a first convolution and a second convolution that are provided in parallel; performing a second stage convolution on a channel-wise concatenation result of an output of the first convolution and an output of the second convolution; performing a third stage convolution on an output of the second stage convolution; and obtaining an output image data based on an output of the third stage convolution; and encoding the output image data for generating the bitstream.
-
3.
公开(公告)号:US20240340415A1
公开(公告)日:2024-10-10
申请号:US18625679
申请日:2024-04-03
Applicant: Alibaba Innovation Private Limited
Inventor: Shengyang XU , Jianhua CHEN , Yan YE
IPC: H04N19/117 , H04N19/137 , H04N19/156 , H04N19/172 , H04N19/176 , H04N19/177 , H04N19/80
CPC classification number: H04N19/117 , H04N19/137 , H04N19/156 , H04N19/172 , H04N19/176 , H04N19/177 , H04N19/80
Abstract: A time-domain filtering method is provided, including: determining a to-be-filtered image frame in a group of pictures; extracting a relative motion feature of the to-be-filtered image frame, where the relative motion feature represents relative motion complexity between image contents of the to-be-filtered image frame and image contents of the remaining image frames in the group of pictures; and determining a target filtering magnitude corresponding to the relative motion feature, and performing time-domain filtering on the to-be-filtered image frame by using the target filtering magnitude.
-
公开(公告)号:US20240357122A1
公开(公告)日:2024-10-24
申请号:US18628086
申请日:2024-04-05
Applicant: Alibaba Innovation Private Limited
Inventor: Jie CHEN , Ru-Ling LIAO , Xinwei LI , Yan YE
IPC: H04N19/137 , H04N19/176 , H04N19/192
CPC classification number: H04N19/137 , H04N19/176 , H04N19/192
Abstract: Methods and apparatuses are provided for optical flow-based motion refinement. An exemplary method includes: dividing a coding block into a first set of subblocks and a second set of subblocks; performing a first pass of optical flow-based motion vector refinement on the first set of subblocks; and performing a second pass of optical flow-based motion vector refinement on the second set of subblocks.
-
公开(公告)号:US20240348782A1
公开(公告)日:2024-10-17
申请号:US18628723
申请日:2024-04-06
Applicant: Alibaba Innovation Private Limited
Inventor: Ru-Ling LIAO , Yan YE , Jie CHEN , Xinwei LI
IPC: H04N19/13 , H04N19/174 , H04N19/91
CPC classification number: H04N19/13 , H04N19/174 , H04N19/91
Abstract: Methods and apparatuses are provided for initializing a set of context model probability for a slice in context-based adaptive binary arithmetic coding (CABAC). An exemplary video decoding method includes: selecting, from a plurality of predefined sets of probability parameters, a first set of probability parameters for initiating one or more context models for a B-slice; and performing entropy decoding of the B-slice based on the one or more context models and the first set of probability parameters, wherein the selecting is based on a coding condition of the B-slice or a signal in the bitstream.
-
公开(公告)号:US20240340456A1
公开(公告)日:2024-10-10
申请号:US18622621
申请日:2024-03-29
Applicant: Alibaba Innovation Private Limited
Inventor: Bolin CHEN , Jie CHEN , Yan YE , Shiqi WANG
IPC: H04N19/70 , H04N19/136 , H04N19/157 , H04N19/172 , H04N19/42 , H04N19/463 , H04N19/80
CPC classification number: H04N19/70 , H04N19/136 , H04N19/157 , H04N19/172 , H04N19/42 , H04N19/463 , H04N19/80
Abstract: Methods and apparatuses are provided for processing video data by using generative face video supplemental enhancement information (SEI) messages. An exemplary method for generating a face picture includes: receiving a bitstream; decoding coded information of the bitstream to obtain a base picture and a supplemental enhancement information (SEI) message; determining whether the SEI message applies to a neural network for generating a face picture; in response to the SEI message applies to the neural network for generating the face picture, determining a mode and a corresponding face information parameter used to code the face picture based on the SEI message; and generating the face picture based on the base picture and the face information parameter by the neural network.
-
公开(公告)号:US20250024024A1
公开(公告)日:2025-01-16
申请号:US18902031
申请日:2024-09-30
Applicant: Alibaba Innovation Private Limited
Inventor: Ru-Ling LIAO , Jie CHEN , Yan YE , Xinwei LI
IPC: H04N19/105 , H04N19/139 , H04N19/172
Abstract: A VVC-standard encoder and a VVC-standard decoder implement improvements over VVC and ECM in a number of regards: a temporal motion vector prediction candidate selection method utilizing relocation of a collocated CTU; a temporal motion vector prediction candidate selection method utilizing expanded selection range; a temporal motion vector prediction candidate selection method utilizing unconditional derivation of a scaled motion vector; a temporal motion vector prediction candidate selection method utilizing omission of scaling uni-predicted motion vectors to bi-predicted motion vectors; a temporal motion vector prediction candidate selection method utilizing multiple options in setting a reference picture index; a temporal motion vector prediction candidate selection method utilizing scaling factor offsetting; a merge candidate list building method omitting a temporal motion vector prediction candidate; and a picture reconstruction method utilizing motion information refinement.
-
8.
公开(公告)号:US20240348816A1
公开(公告)日:2024-10-17
申请号:US18628002
申请日:2024-04-05
Applicant: Alibaba Innovation Private Limited
Inventor: Jie CHEN , Yan YE , Bolin CHEN
IPC: H04N19/463 , H04N19/136 , H04N19/169 , H04N19/172 , H04N19/184
CPC classification number: H04N19/463 , H04N19/136 , H04N19/172 , H04N19/184 , H04N19/188
Abstract: A method of decoding a bitstream to get one or more pictures for a video stream includes: receiving a bitstream; and decoding the bitstream to get the one or more pictures. The decoding includes: decoding a picture unit comprising one or more supplemental enhancement information (SEI) messages; and generating the one or more pictures based on a key picture and the one or more SEI messages, respectively.
-
-
-
-
-
-
-