Microphone array based deep learning for time-domain speech signal extraction

发明授权

US11508388B1 Microphone array based deep learning for time-domain speech signal extraction 有权

请登陆查看更多内容

专利标题： Microphone array based deep learning for time-domain speech signal extraction
申请号： US17100802

申请日： 2020-11-20
公开(公告)号： US11508388B1

公开(公告)日： 2022-11-22
发明人: Mehrez Souden , Symeon Delikaris Manias , Joshua D. Atkins , Ante Jukic , Ramin Pishehvar
申请人： Apple Inc.
申请人地址： US CA Cupertino
专利权人： Apple Inc.
当前专利权人： Apple Inc.
当前专利权人地址： US CA Cupertino
代理机构： BakerHostetler
主分类号： G10L21/0232
IPC分类号： G10L21/0232 ; H04R1/40 ; G10L25/30 ; G06N3/08 ; H04R3/00 ; G10L21/0216

Microphone array based deep learning for time-domain speech signal extraction

摘要：

A device for processing audio signals in a time-domain includes a processor configured to receive multiple audio signals corresponding to respective microphones of at least two or more microphones of the device, at least one of the multiple audio signals comprising speech of a user of the device. The processor is configured to provide the multiple audio signals to a machine learning model, the machine learning model having been trained based at least in part on an expected position of the user of the device and expected positions of the respective microphones on the device. The processor is configured to provide an audio signal that is enhanced with respect to the speech of the user relative to the multiple audio signals, wherein the audio signal is a waveform output from the machine learning model.

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L21/00	为了改变语音或声音信号的质量或其可识度而处理语音或声音信号，以产生另一种可听的或非可听的信号，例如视觉信号或触觉信号（G10L19/00优先）
G10L21/02	.语音增强，例如降低噪声或消除回声（在直线传送系统中减轻回声效应入H04B3/20；免提电话中的回声抑制入H04M9/08）
G10L21/0208	..噪声过滤
G10L21/0216	...以噪声估计使用的方法为特征的
G10L21/0232	....在频域上的处理