SPEECH RECOGNITION SYSTEM AND METHOD
    1.
    发明申请
    SPEECH RECOGNITION SYSTEM AND METHOD 审中-公开
    语音识别系统与方法

    公开(公告)号:US20170011735A1

    公开(公告)日:2017-01-12

    申请号:US15187948

    申请日:2016-06-21

    CPC classification number: G10L15/005 G10L15/183 G10L15/32

    Abstract: A system and a method of speech recognition which enable a spoken language to be automatically identified while recognizing speech of a person who vocalize to effectively process multilingual speech recognition without a separate process for user registration or recognized language setting such as use of a button for allowing a user to manually select a language to be vocalized and support speech recognition of each language to be automatically performed even though persons who speak different languages vocalize by using one terminal to increase convenience of the user.

    Abstract translation: 语音识别的系统和方法,其能够在识别发声的人的语音时有效地处理多语言语音识别的语音被自动识别,而无需用户注册或识别的语言设置的单独过程,例如使用按钮来允许 用户手动选择要发出的语言,并且支持即使通过使用一个终端来增加用户的便利而说出不同语言的人发音的每种语言的语音识别。

    APPARATUS FOR SPEECH RECOGNITION USING MULTIPLE ACOUSTIC MODEL AND METHOD THEREOF
    2.
    发明申请
    APPARATUS FOR SPEECH RECOGNITION USING MULTIPLE ACOUSTIC MODEL AND METHOD THEREOF 有权
    使用多种声学模型进行语音识别的装置及其方法

    公开(公告)号:US20140180689A1

    公开(公告)日:2014-06-26

    申请号:US13845941

    申请日:2013-03-18

    Inventor: Dong Hyun KIM

    CPC classification number: G10L15/32 G10L15/065

    Abstract: Disclosed are an apparatus for recognizing voice using multiple acoustic models according to the present invention and a method thereof. An apparatus for recognizing voice using multiple acoustic models includes a voice data database (DB) configured to store voice data collected in various noise environments; a model generating means configured to perform classification for each speaker and environment based on the collected voice data, and to generate an acoustic model of a binary tree structure as the classification result; and a voice recognizing means configured to extract feature data of voice data when the voice data is received from a user, to select multiple models from the generated acoustic model based on the extracted feature data, to parallel recognize the voice data based on the selected multiple models, and to output a word string corresponding to the voice data as the recognition result.

    Abstract translation: 公开了根据本发明的使用多个声学模型识别语音的装置及其方法。 一种用于使用多个声学模型识别语音的装置包括:语音数据数据库(DB),被配置为存储在各种噪声环境中收集的语音数据; 模型生成装置,被配置为基于所收集的语音数据对每个说话者和环境进行分类,并且生成作为分类结果的二叉树结构的声学模型; 以及语音识别装置,被配置为当从用户接收到语音数据时提取语音数据的特征数据,基于所提取的特征数据从所生成的声学模型中选择多个模型,以基于所选择的多个并行识别语音数据 模型,并输出与语音数据相对应的字串作为识别结果。

    TERMINAL AND SERVER OF SPEAKER-ADAPTATION SPEECH-RECOGNITION SYSTEM AND METHOD FOR OPERATING THE SYSTEM
    6.
    发明申请
    TERMINAL AND SERVER OF SPEAKER-ADAPTATION SPEECH-RECOGNITION SYSTEM AND METHOD FOR OPERATING THE SYSTEM 有权
    语音识别系统的终端和服务器及操作系统的方法

    公开(公告)号:US20150371634A1

    公开(公告)日:2015-12-24

    申请号:US14709359

    申请日:2015-05-11

    Inventor: Dong Hyun KIM

    CPC classification number: G10L15/07 G10L15/30 G10L2015/221

    Abstract: Provided are a terminal and server of a speaker-adaptation speech-recognition system and a method for operating the system. The terminal in the speaker-adaptation speech-recognition system includes a speech recorder which transmits speech data of a speaker to a speech-recognition server, a statistical variable accumulator which receives a statistical variable including acoustic statistical information about speech of the speaker from the speech-recognition server which recognizes the transmitted speech data, and accumulates the received statistical variable, a conversion parameter generator which generates a conversion parameter about the speech of the speaker using the accumulated statistical variable and transmits the generated conversion parameter to the speech-recognition server, and a result displaying user interface which receives and displays result data when the speech-recognition server recognizes the speech data of the speaker using the transmitted conversion parameter and transmits the recognized result data.

    Abstract translation: 提供了一种扬声器适配语音识别系统的终端和服务器以及用于操作该系统的方法。 扬声器适配语音识别系统中的终端包括将语音数据发送到语音识别服务器的语音记录器,统计变量累加器,其从语音接收包括关于说话者的语音的声学统计信息 识别所发送的语音数据并累加接收到的统计变量,转换参数生成器,其使用累积的统计变量生成关于说话者的语音的转换参数,并将生成的转换参数发送到语音识别服务器, 并且显示用户界面的结果,其在语音识别服务器使用所发送的转换参数识别说话者的语音数据时接收并显示结果数据,并发送所识别的结果数据。

Patent Agency Ranking