ACOUSTIC ENVIRONMENT PROFILE ESTIMATION
    2.
    发明公开

    公开(公告)号:US20240005908A1

    公开(公告)日:2024-01-04

    申请号:US18058266

    申请日:2022-11-22

    IPC分类号: G10L15/02 G10L25/18

    CPC分类号: G10L15/02 G10L25/18

    摘要: An acoustic environment profile estimation is provided for automatic speech recognition (ASR) to compensate for the acoustic behavior of an environment in which audio is collected. Examples receive an audio signal and extract spectral features and modulation features. Extracting spectral features involves determining Mel filter bank (MFB) coefficients, and extracting modulation features involves applying Fourier transforms. The spectral features and modulation features are combined, and an acoustic environment profile estimate is extracted and provided as an input to the ASR. In some examples, the acoustic environment profile estimate is realized as acoustic environment parameters, whereas in some other examples, the acoustic environment profile estimate is realized as an acoustic embedding vector. For versions using acoustic environment parameters, when the acoustic environment changes significantly, such as flooring changes and/or speakers or microphones changing position, a new set of acoustic environment parameters is determined.