APPARATUS AND METHOD FOR IMPROVING VOICE RECOGNITION
    1.
    发明申请
    APPARATUS AND METHOD FOR IMPROVING VOICE RECOGNITION 有权
    改进语音识别的装置和方法

    公开(公告)号:US20150279385A1

    公开(公告)日:2015-10-01

    申请号:US14667675

    申请日:2015-03-24

    CPC classification number: G10L15/20 G10L15/02 G10L21/0208 G10L25/24

    Abstract: An apparatus and method for improving voice recognition are disclosed herein. The apparatus includes a standard voice transmission unit, a Mel-frequency cepstrum coefficient (MFCC) generation unit, and an MFCC compensation unit. The standard voice transmission unit generates a standard voice. The MFCC generation unit generates voice feature data (MFCC) based on the utterance of the standard voice before voice recognition. The MFCC compensation unit stores a gain value generated based on the standard voice, and compensates for the distortion of the voice feature data based on the utterance of a user using the gain value during the voice recognition.

    Abstract translation: 本文公开了一种用于改善语音识别的装置和方法。 该装置包括标准语音发送单元,梅尔频率倒谱系数(MFCC)生成单元和MFCC补偿单元。 标准语音传输单元生成标准语音。 MFCC生成单元基于语音识别之前的标准语音的发音来生成语音特征数据(MFCC)。 MFCC补偿单元存储基于标准语音生成的增益值,并且基于在语音识别期间使用增益值的用户的话语来补偿语音特征数据的失真。

    METHOD AND APPARATUS OF EXPANDING SPEECH RECOGNITION DATABASE
    2.
    发明申请
    METHOD AND APPARATUS OF EXPANDING SPEECH RECOGNITION DATABASE 审中-公开
    扩展语音识别数据库的方法和设备

    公开(公告)号:US20160232892A1

    公开(公告)日:2016-08-11

    申请号:US14991716

    申请日:2016-01-08

    Abstract: Disclosed herein are a method and an apparatus of expanding a speech recognition database used for speech recognition. The method of expanding a speech recognition database includes generating a pronunciation text from a corpus; confirming whether or not a non-registered word that is not registered in advance in a pronunciation dictionary among words included in the pronunciation text is present; generating lexical model information on the corresponding non-registered word with reference to a built-up acoustic model in the case in which the non-registered word is present as a confirmation result; and adding the generated lexical model information to a built-up lexical model. According to exemplary embodiments of the present invention, various speeches may be recognized in a stand-along speech recognizer in which an infrastructure is insufficient.

    Abstract translation: 这里公开了扩展用于语音识别的语音识别数据库的方法和装置。 扩展语音识别数据库的方法包括从语料库生成发音文本; 确认在发音文本中包含的字之间是否存在在发音字典中未预先注册的未注册字; 在存在非注册字的情况下,参考建立的声学模型,在对应的非注册字上产生词汇模型信息作为确认结果; 并将生成的词汇模型信息添加到已建立的词汇模型中。 根据本发明的示例性实施例,可以在基础设施不足的独立语音识别器中识别各种语音。

    NOISE CANCELLATION APPARATUS AND METHOD
    3.
    发明申请
    NOISE CANCELLATION APPARATUS AND METHOD 有权
    噪声消除装置和方法

    公开(公告)号:US20150294667A1

    公开(公告)日:2015-10-15

    申请号:US14681187

    申请日:2015-04-08

    CPC classification number: G10L21/034 G10L21/0224 G10L21/0232

    Abstract: Disclosed herein is a noise cancellation apparatus and method, which select in advance parameters to be used for noise cancellation in a reference voice signal section by generating a reference voice signal in advance before a voice signal is generated, thus improving noise cancellation effects. The noise cancellation apparatus includes a parameter initialization unit for determining an initial value of a parameter to be used for noise cancellation, based on reference signals filtered for respective frequencies, a parameter estimation unit for receiving the initial value of the parameter, and estimating the parameter in response to signals that are input after being filtered for respective frequencies, a gain estimation unit for calculating gains for respective frequencies based on the parameter from the parameter estimation unit, and a gain application unit for cancelling noise by applying the gains to the signals that are input after being filtered for respective frequencies.

    Abstract translation: 这里公开了一种噪声消除装置和方法,其通过在生成语音信号之前预先生成参考语音信号,预先选择参考语音信号部分中用于噪声消除的参数,从而改善噪声消除效果。 噪声消除装置包括:参数初始化单元,用于基于针对各个频率滤波的参考信号,确定要用于噪声消除的参数的初始值;参数估计单元,用于接收参数的初始值,以及估计参数 响应于在针对各个频率被滤波后输入的信号,增益估计单元,用于基于来自参数估计单元的参数计算各个频率的增益,以及增益应用单元,用于通过将增益应用于 在对各个频率进行滤波后输入。

Patent Agency Ranking