-
公开(公告)号:US20250111857A1
公开(公告)日:2025-04-03
申请号:US18478759
申请日:2023-09-29
Applicant: Amazon Technologies, Inc.
Inventor: Ritwik Giri , Zhepei Wang , Devansh Shah , Jean-Marc Valin , Michael Mark Goodwin
IPC: G10L21/0208 , G10L25/30 , H04M3/56
Abstract: Examples herein provide an approach to enhance an audio mixture of a teleconference application by switching between noise suppression modes using a single model. Specifically, a machine learning (ML) model may be configured to, in response to receiving an audio mixture representation as input, suppress either a background noise of the audio mixture or suppress all noise of the audio mixture except a user's voice. In some examples, the ML model may be trained on speech and background noise training data during a training phase. In addition, the ML model may be trained on a user's voice during an enrollment phase. In addition, during an inference phase, the ML model may enhance the audio mixture by suppressing a portion of the audio mixture.