Invention Publication
- Patent Title: METHODS AND NON-TRANSITORY COMPUTER READABLE STORAGE MEDIUM FOR SPATIAL RESAMPLING TOWARDS MACHINE VISION
-
Application No.: US18618551Application Date: 2024-03-27
-
Publication No.: US20240357118A1Publication Date: 2024-10-24
- Inventor: Shurun WANG , Yan YE
- Applicant: Alibaba Innovation Private Limited
- Applicant Address: SG Singapore
- Assignee: Alibaba Innovation Private Limited
- Current Assignee: Alibaba Innovation Private Limited
- Current Assignee Address: SG Singapore
- Main IPC: H04N19/132
- IPC: H04N19/132 ; H04N19/172 ; H04N19/186 ; H04N19/436

Abstract:
A method of encoding a video sequence into a bitstream. The method includes receiving a video sequence; performing a plurality of convolutions on an input image data of the video sequence in YUV format; wherein performing the plurality of convolutions includes performing a first stage convolution on the input image data, wherein the first stage convolution comprises a first convolution and a second convolution that are provided in parallel; performing a second stage convolution on a channel-wise concatenation result of an output of the first convolution and an output of the second convolution; performing a third stage convolution on an output of the second stage convolution; and obtaining an output image data based on an output of the third stage convolution; and encoding the output image data for generating the bitstream.
Information query