- 专利标题: Microphone array based deep learning for time-domain speech signal extraction
-
申请号: US17100802申请日: 2020-11-20
-
公开(公告)号: US11508388B1公开(公告)日: 2022-11-22
- 发明人: Mehrez Souden , Symeon Delikaris Manias , Joshua D. Atkins , Ante Jukic , Ramin Pishehvar
- 申请人: Apple Inc.
- 申请人地址: US CA Cupertino
- 专利权人: Apple Inc.
- 当前专利权人: Apple Inc.
- 当前专利权人地址: US CA Cupertino
- 代理机构: BakerHostetler
- 主分类号: G10L21/0232
- IPC分类号: G10L21/0232 ; H04R1/40 ; G10L25/30 ; G06N3/08 ; H04R3/00 ; G10L21/0216
摘要:
A device for processing audio signals in a time-domain includes a processor configured to receive multiple audio signals corresponding to respective microphones of at least two or more microphones of the device, at least one of the multiple audio signals comprising speech of a user of the device. The processor is configured to provide the multiple audio signals to a machine learning model, the machine learning model having been trained based at least in part on an expected position of the user of the device and expected positions of the respective microphones on the device. The processor is configured to provide an audio signal that is enhanced with respect to the speech of the user relative to the multiple audio signals, wherein the audio signal is a waveform output from the machine learning model.
信息查询