ACOUSTIC ENVIRONMENT PROFILE ESTIMATION
摘要:
An acoustic environment profile estimation is provided for automatic speech recognition (ASR) to compensate for the acoustic behavior of an environment in which audio is collected. Examples receive an audio signal and extract spectral features and modulation features. Extracting spectral features involves determining Mel filter bank (MFB) coefficients, and extracting modulation features involves applying Fourier transforms. The spectral features and modulation features are combined, and an acoustic environment profile estimate is extracted and provided as an input to the ASR. In some examples, the acoustic environment profile estimate is realized as acoustic environment parameters, whereas in some other examples, the acoustic environment profile estimate is realized as an acoustic embedding vector. For versions using acoustic environment parameters, when the acoustic environment changes significantly, such as flooring changes and/or speakers or microphones changing position, a new set of acoustic environment parameters is determined.
信息查询
0/0