Perceptual audio coding as sequential decision-making problems

发明授权

US10580424B2 Perceptual audio coding as sequential decision-making problems 有权

请登陆查看更多内容

专利标题： Perceptual audio coding as sequential decision-making problems
申请号： US16050966

申请日： 2018-07-31
公开(公告)号： US10580424B2

公开(公告)日： 2020-03-03
发明人: Taher Shahbazi Mirzahasanloo
申请人： QUALCOMM Incorporated
申请人地址： US CA San Diego
专利权人： Qualcomm Incorporated
当前专利权人： Qualcomm Incorporated
当前专利权人地址： US CA San Diego
代理机构： Shumaker & Sieffert, P.A.
主分类号： G10L19/00
IPC分类号： G10L19/00 ; G10L19/038 ; G10L19/22 ; H04R3/00 ; H04W84/12 ; G06N20/00

摘要：

In general, techniques are described by which to perform perceptual audio coding as sequential decision making problems. A source device comprising a memory and a processor may be configured to perform the techniques. The memory may store at least a portion of the audio data. The processor may apply a filter to the audio data to obtain subbands of the audio data. The processor may adapt a controller according to a machine learning algorithm, the controller configured to determine bit distributions across the subbands of the audio data. The processor may specify, based on the bit distributions and in a bitstream representative of the audio data, one or more indications representative of the subbands of the audio data, and output the bitstream via a wireless connection in accordance with a wireless communication protocol.

公开/授权文献

US20190371348A1 PERCEPTUAL AUDIO CODING AS SEQUENTIAL DECISION-MAKING PROBLEMS 公开/授权日：2019-12-05

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L19/00	用于冗余度下降情形（例如在声码器中）的语音或音频信号分析-合成技术；语音或音频信号编码或解码，采用源滤波器模型或心理声学分析（乐器中的入G10H）