Adaptive sound event classification

    公开(公告)号:US11410677B2

    公开(公告)日:2022-08-09

    申请号:US17102724

    申请日:2020-11-24

    Abstract: A device includes one or more processors configured to provide audio data samples to a sound event classification model. The one or more processors are also configured to determine, based on an output of the sound event classification model responsive to the audio data samples, whether a sound class of with the audio data samples was recognized by the sound event classification model. The one or more processors are further configured to, based on a determination that the sound class was not recognized, determine whether the sound event classification model corresponds to an audio scene associated with the audio data samples. The one or more processors are also configured to, based on a determination that the sound event classification model corresponds to the audio scene associated with the audio data samples, store model update data based on the audio data samples.

    SYSTEMS AND METHODS FOR SPEAKER DICTIONARY BASED SPEECH MODELING
    18.
    发明申请
    SYSTEMS AND METHODS FOR SPEAKER DICTIONARY BASED SPEECH MODELING 有权
    基于语音基础的语音建模系统与方法

    公开(公告)号:US20150243284A1

    公开(公告)日:2015-08-27

    申请号:US14629109

    申请日:2015-02-23

    CPC classification number: G10L15/20 G10L15/06 G10L21/0208 G10L21/028

    Abstract: A method for speech modeling by an electronic device is described. The method includes obtaining a real-time noise reference based on a noisy speech signal. The method also includes obtaining a real-time noise dictionary based on the real-time noise reference. The method further includes obtaining a first speech dictionary and a second speech dictionary. The method additionally includes reducing residual noise based on the real-time noise dictionary and the first speech dictionary to produce a residual noise-suppressed speech signal at a first modeling stage. The method also includes generating a reconstructed speech signal based on the residual noise-suppressed speech signal and the second speech dictionary at a second modeling stage.

    Abstract translation: 描述了一种由电子设备进行语音建模的方法。 该方法包括基于噪声语音信号获得实时噪声参考。 该方法还包括基于实时噪声参考获得实时噪声字典。 该方法还包括获得第一语音词典和第二语音词典。 该方法还包括基于实时噪声字典和第一语音字典减少残留噪声,以在第一建模阶段产生残留噪声抑制语音信号。 该方法还包括在第二建模阶段基于剩余噪声抑制语音信号和第二语音词典生成重构语音信号。

    SYSTEMS AND METHODS FOR AUDIO SIGNAL PROCESSING
    19.
    发明申请
    SYSTEMS AND METHODS FOR AUDIO SIGNAL PROCESSING 审中-公开
    用于音频信号处理的系统和方法

    公开(公告)号:US20130282372A1

    公开(公告)日:2013-10-24

    申请号:US13828158

    申请日:2013-03-14

    Abstract: A method for detecting voice activity by an electronic device is described. The method includes detecting near end speech based on a near end voiced speech detector and at least one single channel voice activity detector. The near end voiced speech detector is associated with a harmonic statistic based on a speech pitch histogram.

    Abstract translation: 描述了一种用于由电子设备检测语音活动的方法。 该方法包括基于近端浊音语音检测器和至少一个单声道语音活动检测器检测近端语音。 近端浊音语音检测器与基于语音音调直方图的谐波统计量相关联。

Patent Agency Ranking