-
公开(公告)号:US09886968B2
公开(公告)日:2018-02-06
申请号:US14197149
申请日:2014-03-04
Applicant: SYNAPTICS INCORPORATED
Inventor: Sahar E. Bou-Ghazale , Trausti Thormundsson , Willie B. Wu
CPC classification number: G10L25/84
Abstract: A system for audio processing comprising an initial background statistical model system configured to generate an initial background statistical model using a predetermined sample size of audio data. A parameter computation system configured to generate parametric data for the audio data including cepstral and energy parameters. A background statistics computation system configured to generate preliminary background statistics for determining whether speech has been detected. A first speech detection system configured to determine whether speech was present in the initial sample of audio data. An adaptive background statistical model system configured to provide an adaptive background statistical model for use in continuous processing of audio data for speech detection. A parameter computation system configured to calculate cepstral parameters, energy parameters and other suitable parameters for speech detection. A speech/non-speech classification system configured to classify individual frames as speech frames or non-speech frames, based on the computed parameters and the adaptive background statistical model data. A background statistics update system configured to update the background statistical model based on detected speech and non-speech frames. A second speech detection system configured to perform speech detection processing and to generate a suitable indicator for use in processing audio data that is determined to include speech signals.