SYSTEM AND METHOD FOR PERFORMING SPEECH ENHANCEMENT USING A DEEP NEURAL NETWORK-BASED SIGNAL

    公开(公告)号:US20180040333A1

    公开(公告)日:2018-02-08

    申请号:US15227885

    申请日:2016-08-03

    Applicant: Apple Inc.

    CPC classification number: G10L21/0232 G10L25/30 G10L25/87 G10L2021/02082

    Abstract: Method for performing speech enhancement using a Deep Neural Network (DNN)-based signal starts with training DNN offline by exciting a microphone using target training signal that includes signal approximation of clean speech. Loudspeaker is driven with a reference signal and outputs loudspeaker signal. Microphone then generates microphone signal based on at least one of: near-end speaker signal, ambient noise signal, or loudspeaker signal. Acoustic-echo-canceller (AEC) generates AEC echo-cancelled signal based on reference signal and microphone signal. Loudspeaker signal estimator generates estimated loudspeaker signal based on microphone signal and AEC echo-cancelled signal. DNN receives microphone signal, reference signal, AEC echo-cancelled signal, and estimated loudspeaker signal and generates a speech reference signal that includes signal statistics for residual echo or for noise. Noise suppressor generates a clean speech signal by suppressing noise or residual echo in the microphone signal based on speech reference signal. Other embodiments are described.

Patent Agency Ranking