Machine learning based rate-distortion optimizer for video compression

Invention Grant

US11496746B2 Machine learning based rate-distortion optimizer for video compression 有权

Please log in to see more content

Patent Title: Machine learning based rate-distortion optimizer for video compression
Application No.: US17165680

Application Date: 2021-02-02
Publication No.: US11496746B2

Publication Date: 2022-11-08
Inventor: Mahant Siddaramanna , Naveen Srinivasamurthy , Apoorva Nagarajan , Prasant Shekhar Singh , Pawan Kumar Baheti , Narendranath Malayath
Applicant: QUALCOMM Incorporated
Applicant Address: US CA San Diego
Assignee: QUALCOMM Incorporated
Current Assignee: QUALCOMM Incorporated
Current Assignee Address: US CA San Diego
Agency: Polsinelli LLP
Main IPC: H04N19/147
IPC: H04N19/147 ; H04N19/159 ; H04N19/19 ; G06N3/04 ; H04N19/176

Machine learning based rate-distortion optimizer for video compression

Abstract:

Systems and techniques are described for data encoding using a machine learning approach to generate a distortion prediction {circumflex over (D)} and a predicted bit rate {circumflex over (R)}, and to use {circumflex over (D)} and {circumflex over (R)} to perform rate-distortion optimization (RDO). For example, a video encoder can generate the distortion prediction {circumflex over (D)} and the bit rate residual prediction based on outputs of the one or more neural networks in response to the one or more neural networks receiving a residual portion of a block of a video frame as input. The video encoder can determine bit rate metadata prediction based on metadata associated with a mode of compression, and determine {circumflex over (R)} to be the sum of and . The video encoder can determine a rate-distortion cost prediction Ĵ as a function of {circumflex over (D)} and {circumflex over (R)}, and can determine a prediction mode for compressing the block based on Ĵ.

Public/Granted literature

US20220256169A1 MACHINE LEARNING BASED RATE-DISTORTION OPTIMIZER FOR VIDEO COMPRESSION Public/Granted day:2022-08-11

Information query

Espacenet

IPC分类:

H	电学
H04	电通信技术
H04N	图像通信，如电视
H04N19/00	用于数字视频信号编码，解码，压缩或解压缩的方法或装置
H04N19/10	.使用自适应编码
H04N19/134	..以影响或控制自适应编码的元素，参数或标准为特征的
H04N19/146	...在编码器输出的数据率或编码量
H04N19/147	....根据率失真准则（用于运动估计的率失真准则入H04N 19/567）