Patent search ap:("Electronics AND Telecommunications Research Institute") AND inv:"Dong Hyun KIM" Page 1

1.

发明申请
SPEECH RECOGNITION SYSTEM AND METHOD 审中-公开
Title translation: 语音识别系统与方法

公开(公告)号：US20170011735A1

公开(公告)日：2017-01-12

申请号：US15187948

申请日：2016-06-21

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Dong Hyun KIM , Min Kyu LEE

IPC: G10L15/00 , G10L15/183 , G10L15/02

CPC classification number: G10L15/005 , G10L15/183 , G10L15/32

Abstract: A system and a method of speech recognition which enable a spoken language to be automatically identified while recognizing speech of a person who vocalize to effectively process multilingual speech recognition without a separate process for user registration or recognized language setting such as use of a button for allowing a user to manually select a language to be vocalized and support speech recognition of each language to be automatically performed even though persons who speak different languages vocalize by using one terminal to increase convenience of the user.

Abstract translation: 语音识别的系统和方法，其能够在识别发声的人的语音时有效地处理多语言语音识别的语音被自动识别，而无需用户注册或识别的语言设置的单独过程，例如使用按钮来允许用户手动选择要发出的语言，并且支持即使通过使用一个终端来增加用户的便利而说出不同语言的人发音的每种语言的语音识别。

2.

发明申请
APPARATUS FOR SPEECH RECOGNITION USING MULTIPLE ACOUSTIC MODEL AND METHOD THEREOF 有权
Title translation: 使用多种声学模型进行语音识别的装置及其方法

公开(公告)号：US20140180689A1

公开(公告)日：2014-06-26

申请号：US13845941

申请日：2013-03-18

Applicant: Electronics and Telecommunications Research Institute

Inventor： Dong Hyun KIM

IPC: G10L15/20

CPC classification number: G10L15/32 , G10L15/065

Abstract: Disclosed are an apparatus for recognizing voice using multiple acoustic models according to the present invention and a method thereof. An apparatus for recognizing voice using multiple acoustic models includes a voice data database (DB) configured to store voice data collected in various noise environments; a model generating means configured to perform classification for each speaker and environment based on the collected voice data, and to generate an acoustic model of a binary tree structure as the classification result; and a voice recognizing means configured to extract feature data of voice data when the voice data is received from a user, to select multiple models from the generated acoustic model based on the extracted feature data, to parallel recognize the voice data based on the selected multiple models, and to output a word string corresponding to the voice data as the recognition result.

Abstract translation: 公开了根据本发明的使用多个声学模型识别语音的装置及其方法。一种用于使用多个声学模型识别语音的装置包括：语音数据数据库（DB），被配置为存储在各种噪声环境中收集的语音数据; 模型生成装置，被配置为基于所收集的语音数据对每个说话者和环境进行分类，并且生成作为分类结果的二叉树结构的声学模型; 以及语音识别装置，被配置为当从用户接收到语音数据时提取语音数据的特征数据，基于所提取的特征数据从所生成的声学模型中选择多个模型，以基于所选择的多个并行识别语音数据模型，并输出与语音数据相对应的字串作为识别结果。

3.

发明申请
SYSTEM AND METHOD FOR AUTOMATIC SPEECH TRANSLATION BASED ON ZERO USER INTERFACE 有权

公开(公告)号：US20220147722A1

公开(公告)日：2022-05-12

申请号：US17522218

申请日：2021-11-09

Applicant: Electronics and Telecommunications Research Institute

Inventor： Sang Hun KIM , Seung YUN , Min Kyu LEE , Joon Gyu MAENG , Dong Hyun KIM

IPC: G06F40/58 , G10L21/0232 , G10L25/21 , G10L21/04 , G10L25/78 , G10L17/06 , G10L17/02 , G10L25/18 , G10L25/06 , G10L21/0316 , G10L17/18 , G06N3/08

Abstract: Disclosed are a Zero User Interface (UI)-based automatic speech translation system and method. The system and method can solve problems such as the procedural inconvenience of inputting speech signals and the malfunction of speech recognition due to crosstalk when users who speak difference languages have a face-to-face conversation.
The system includes an automatic speech translation server configured to select a speech signal of a speaker from among multiple speech signals received from user terminals connected to an automatic speech translation service and configured to transmit a result of translating the speech signal of the speaker into a target language, a speaker terminal configured to receive the speech signal of the speaker and transmit the speech signal of the speaker to the automatic speech translation server, and a counterpart terminal configured to output the result of the translation in a form of text or voice in the target language.

4.

发明申请
SPEECH RECOGNITION SYSTEM AND METHOD 审中-公开

公开(公告)号：US20180075844A1

公开(公告)日：2018-03-15

申请号：US15646302

申请日：2017-07-11

Applicant: Electronics and Telecommunications Research Institute

Inventor： Dong Hyun KIM , Young Jik Lee , Sang Hun Kim , Seung Hi Kim , Min Kyu Lee , Mu Yeol Choi

IPC: G10L15/14 , G10L17/04 , G10L15/065

CPC classification number: G10L15/144 , G10L15/063 , G10L15/065 , G10L15/08 , G10L15/142 , G10L17/04 , G10L2015/0631

Abstract: A speech recognition method capable of automatic generation of phones according to the present invention includes: unsupervisedly learning a feature vector of speech data; generating a phone set by clustering acoustic features selected based on an unsupervised learning result; allocating a sequence of phones to the speech data on the basis of the generated phone set; and generating an acoustic model on the basis of the sequence of phones and the speech data to which the sequence of phones is allocated.

5.

发明申请
APPARATUS AND METHOD FOR PROCESSING VOICE SIGNAL AND TERMINAL 审中-公开
Title translation: 用于处理语音信号和终端的装置和方法

公开(公告)号：US20170013105A1

公开(公告)日：2017-01-12

申请号：US15202912

申请日：2016-07-06

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Min Kyu LEE , Sang Hun KIM , Young Ik KIM , Dong Hyun KIM , Mu Yeol CHOI

IPC: H04M1/725 , G10L21/0272 , G10L25/87 , H04M1/60

CPC classification number: H04M1/7253 , G10L15/20 , G10L15/30 , G10L21/0364 , G10L25/78 , H04M1/6066 , H04M2201/40

Abstract: A voice signal processing apparatus includes: an input unit which receives a voice signal of a user; a detecting unit which detects an auxiliary signal, and a signal processing unit which transmits the voice signal to an external terminal in a first operation mode and transmits the voice signal and the auxiliary signal to the external terminal using the same or different protocols in a second operation mode.

Abstract translation: 语音信号处理装置包括：接收用户的语音信号的输入单元; 检测单元，检测辅助信号;以及信号处理单元，其以第一操作模式将语音信号发送到外部终端，并且在第二操作模式中使用相同或不同的协议将语音信号和辅助信号发送到外部终端操作模式。

6.

发明申请
TERMINAL AND SERVER OF SPEAKER-ADAPTATION SPEECH-RECOGNITION SYSTEM AND METHOD FOR OPERATING THE SYSTEM 有权
Title translation: 语音识别系统的终端和服务器及操作系统的方法

公开(公告)号：US20150371634A1

公开(公告)日：2015-12-24

申请号：US14709359

申请日：2015-05-11

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Dong Hyun KIM

IPC: G10L15/22 , G10L21/18 , G10L21/10 , G10L15/02 , G10L15/14

CPC classification number: G10L15/07 , G10L15/30 , G10L2015/221

Abstract: Provided are a terminal and server of a speaker-adaptation speech-recognition system and a method for operating the system. The terminal in the speaker-adaptation speech-recognition system includes a speech recorder which transmits speech data of a speaker to a speech-recognition server, a statistical variable accumulator which receives a statistical variable including acoustic statistical information about speech of the speaker from the speech-recognition server which recognizes the transmitted speech data, and accumulates the received statistical variable, a conversion parameter generator which generates a conversion parameter about the speech of the speaker using the accumulated statistical variable and transmits the generated conversion parameter to the speech-recognition server, and a result displaying user interface which receives and displays result data when the speech-recognition server recognizes the speech data of the speaker using the transmitted conversion parameter and transmits the recognized result data.

Abstract translation: 提供了一种扬声器适配语音识别系统的终端和服务器以及用于操作该系统的方法。扬声器适配语音识别系统中的终端包括将语音数据发送到语音识别服务器的语音记录器，统计变量累加器，其从语音接收包括关于说话者的语音的声学统计信息识别所发送的语音数据并累加接收到的统计变量，转换参数生成器，其使用累积的统计变量生成关于说话者的语音的转换参数，并将生成的转换参数发送到语音识别服务器，并且显示用户界面的结果，其在语音识别服务器使用所发送的转换参数识别说话者的语音数据时接收并显示结果数据，并发送所识别的结果数据。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification