- 专利标题: Deep neural network based speech enhancement
-
申请号: US16442279申请日: 2019-06-14
-
公开(公告)号: US11756564B2公开(公告)日: 2023-09-12
- 发明人: Ganesh Sivaraman , Elie Khoury
- 申请人: PINDROP SECURITY, INC.
- 申请人地址: US GA Atlanta
- 专利权人: PINDROP SECURITY, INC.
- 当前专利权人: PINDROP SECURITY, INC.
- 当前专利权人地址: US GA Atlanta
- 代理机构: FOLEY & LARDNER LLP
- 主分类号: G10L21/0232
- IPC分类号: G10L21/0232 ; G10L25/30 ; G06N3/048
摘要:
A computer may segment a noisy audio signal into audio frames and execute a deep neural network (DNN) to estimate an instantaneous function of clean speech spectrum and noisy audio spectrum in the audio frame. This instantaneous function may correspond to a ratio of an a-priori signal to noise ratio (SNR) and an a-posteriori SNR of the audio frame. The computer may add estimated instantaneous function to the original noisy audio frame to output an enhanced speech audio frame.
公开/授权文献
- US20190385630A1 DEEP NEURAL NETWORK BASED SPEECH ENHANCEMENT 公开/授权日:2019-12-19
信息查询