- 专利标题: Perceptual audio coding as sequential decision-making problems
-
申请号: US16050966申请日: 2018-07-31
-
公开(公告)号: US10580424B2公开(公告)日: 2020-03-03
- 发明人: Taher Shahbazi Mirzahasanloo
- 申请人: QUALCOMM Incorporated
- 申请人地址: US CA San Diego
- 专利权人: Qualcomm Incorporated
- 当前专利权人: Qualcomm Incorporated
- 当前专利权人地址: US CA San Diego
- 代理机构: Shumaker & Sieffert, P.A.
- 主分类号: G10L19/00
- IPC分类号: G10L19/00 ; G10L19/038 ; G10L19/22 ; H04R3/00 ; H04W84/12 ; G06N20/00
摘要:
In general, techniques are described by which to perform perceptual audio coding as sequential decision making problems. A source device comprising a memory and a processor may be configured to perform the techniques. The memory may store at least a portion of the audio data. The processor may apply a filter to the audio data to obtain subbands of the audio data. The processor may adapt a controller according to a machine learning algorithm, the controller configured to determine bit distributions across the subbands of the audio data. The processor may specify, based on the bit distributions and in a bitstream representative of the audio data, one or more indications representative of the subbands of the audio data, and output the bitstream via a wireless connection in accordance with a wireless communication protocol.
公开/授权文献
信息查询