- 专利标题: SYSTEM AND METHOD FOR KEYWORD SPOTTING IN NOISY ENVIRONMENTS
-
申请号: US18470788申请日: 2023-09-20
-
公开(公告)号: US20240339123A1公开(公告)日: 2024-10-10
- 发明人: Chou-Chang Yang , Yashas Malur Saidutta , Rakshith Sharma Srinivasa , Ching-Hua Lee , Yilin Shen , Hongxia Jin
- 申请人: Samsung Electronics Co., Ltd.
- 申请人地址: KR Suwon-si
- 专利权人: Samsung Electronics Co., Ltd.
- 当前专利权人: Samsung Electronics Co., Ltd.
- 当前专利权人地址: KR Suwon-si
- 主分类号: G10L21/0232
- IPC分类号: G10L21/0232 ; G10L15/06 ; G10L15/08 ; G10L25/18
摘要:
A method includes receiving an audio input and generating a noisy time-frequency representation based on the audio input. The method also includes providing the noisy time-frequency representation to a noise management model trained to predict a denoising mask and a signal presence probability (SPP) map indicating a likelihood of a presence of speech. The method further includes determining an enhanced spectrogram using the denoising mask and the noisy time-frequency representation. The method also includes providing the enhanced spectrogram and the SPP map as inputs to a keyword classification model trained to determine a likelihood of a keyword being present in the audio input. In addition, the method includes, responsive to determining that a keyword is in the audio input, transmitting the audio input to a downstream application associated with the keyword.
信息查询