ACOUSTIC ENVIRONMENT PROFILE ESTIMATION

发明公开

US20240005908A1 ACOUSTIC ENVIRONMENT PROFILE ESTIMATION 审中-公开

请登陆查看更多内容

专利标题： ACOUSTIC ENVIRONMENT PROFILE ESTIMATION
申请号： US18058266

申请日： 2022-11-22
公开(公告)号： US20240005908A1

公开(公告)日： 2024-01-04
发明人: Dushyant SHARMA , Patrick Aubrey NAYLOR , Ge LI
申请人： Nuance Communications, Inc.
申请人地址： US MA Burlington
专利权人： Nuance Communications, Inc.
当前专利权人： Nuance Communications, Inc.
当前专利权人地址： US MA Burlington
主分类号： G10L15/02
IPC分类号： G10L15/02 ; G10L25/18

摘要：

An acoustic environment profile estimation is provided for automatic speech recognition (ASR) to compensate for the acoustic behavior of an environment in which audio is collected. Examples receive an audio signal and extract spectral features and modulation features. Extracting spectral features involves determining Mel filter bank (MFB) coefficients, and extracting modulation features involves applying Fourier transforms. The spectral features and modulation features are combined, and an acoustic environment profile estimate is extracted and provided as an input to the ASR. In some examples, the acoustic environment profile estimate is realized as acoustic environment parameters, whereas in some other examples, the acoustic environment profile estimate is realized as an acoustic embedding vector. For versions using acoustic environment parameters, when the acoustic environment changes significantly, such as flooring changes and/or speakers or microphones changing position, a new set of acoustic environment parameters is determined.

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/02	.语音识别的特征提取；识别单位的选择