- 专利标题: ACOUSTIC ENVIRONMENT PROFILE ESTIMATION
-
申请号: US18058266申请日: 2022-11-22
-
公开(公告)号: US20240005908A1公开(公告)日: 2024-01-04
- 发明人: Dushyant SHARMA , Patrick Aubrey NAYLOR , Ge LI
- 申请人: Nuance Communications, Inc.
- 申请人地址: US MA Burlington
- 专利权人: Nuance Communications, Inc.
- 当前专利权人: Nuance Communications, Inc.
- 当前专利权人地址: US MA Burlington
- 主分类号: G10L15/02
- IPC分类号: G10L15/02 ; G10L25/18
摘要:
An acoustic environment profile estimation is provided for automatic speech recognition (ASR) to compensate for the acoustic behavior of an environment in which audio is collected. Examples receive an audio signal and extract spectral features and modulation features. Extracting spectral features involves determining Mel filter bank (MFB) coefficients, and extracting modulation features involves applying Fourier transforms. The spectral features and modulation features are combined, and an acoustic environment profile estimate is extracted and provided as an input to the ASR. In some examples, the acoustic environment profile estimate is realized as acoustic environment parameters, whereas in some other examples, the acoustic environment profile estimate is realized as an acoustic embedding vector. For versions using acoustic environment parameters, when the acoustic environment changes significantly, such as flooring changes and/or speakers or microphones changing position, a new set of acoustic environment parameters is determined.
信息查询