- 专利标题: Deep learning driven multi-channel filtering for speech enhancement
-
申请号: US15830955申请日: 2017-12-04
-
公开(公告)号: US10546593B2公开(公告)日: 2020-01-28
- 发明人: Jason Wung , Mehrez Souden , Ramin Pishehvar , Joshua D. Atkins
- 申请人: Apple Inc.
- 申请人地址: US CA Cupertino
- 专利权人: APPLE INC.
- 当前专利权人: APPLE INC.
- 当前专利权人地址: US CA Cupertino
- 代理机构: Womble Bond Dickinson (US) LLP
- 主分类号: G10L21/00
- IPC分类号: G10L21/00 ; G10L19/00 ; G10L21/02 ; G10L15/02 ; G10L21/0232 ; G10L25/30 ; H04R1/40 ; G10L25/03 ; G10L21/0208
摘要:
A number of features are extracted from a current frame of a multi-channel speech pickup and from side information that is a linear echo estimate, a diffuse signal component, or a noise estimate of the multi-channel speech pickup. A DNN-based speech presence probability is produced for the current frame, where the SPP value is produced in response to the extracted features being input to the DNN. The DNN-based SPP value is applied to configure a multi-channel filter whose input is the multi-channel speech pickup and whose output is a single audio signal. In one aspect, the system is designed to run online, at low enough latency for real time applications such voice trigger detection. Other aspects are also described and claimed.
公开/授权文献
信息查询