Patent search ap:("ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE") AND inv:"Sanghun KIM" Page 1

1.

发明公开
GAZE-BASED AND AUGMENTED AUTOMATIC INTERPRETATION METHOD AND SYSTEM 审中-公开

公开(公告)号：US20230377558A1

公开(公告)日：2023-11-23

申请号：US18319946

申请日：2023-05-18

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Min kyu LEE , Sanghun KIM , Seung YUN , Jeonguk BANG , Namhyeong KIM

IPC: G10L13/027 , G06V40/16 , G06F40/58 , G10L15/00

CPC classification number: G10L13/027 , G06V40/161 , G06F40/58 , G10L15/005

Abstract: The present invention relates to an automatic interpretation method and system for converting only voice of a speaker into a target language. The present invention may significantly improve performance of automatic interpretation with a foreigner to be communicated with, even in a high-noise environment in which multiple speakers utter at the same time by utilizing voice and image information input to a smart device in a complex manner. In addition, the present invention may determine a situation based on text information and image information existing around a user, and reflect the situation information together with multimodal information to an interpretation engine in real time. In addition, the present invention may significantly improve user convenience of an automatic interpretation system by directly augmenting and displaying an interpreted sentence directly next to a speaker image or generating a synthesized sound by distinguishing the interpreted sentence from other speeches.

2.

发明申请
METHOD AND APPARATUS FOR CORRECTING ERROR IN SPEECH RECOGNITION SYSTEM 审中-公开
Title translation: 用于校正语音识别系统中的错误的方法和装置

公开(公告)号：US20140195226A1

公开(公告)日：2014-07-10

申请号：US13902057

申请日：2013-05-24

Applicant: Electronics and Telecommunications Research Institute

Inventor： Seung YUN , Sanghun KIM , Jeong Se KIM , Soo-jong LEE , Ki Hyun KIM

IPC: G10L15/01

CPC classification number: G10L15/01 , G10L2015/225

Abstract: A method of correcting errors in a speech recognition system includes a process of searching a speech recognition error-answer pair DB based on a sound model for a first candidate answer group for a speech recognition error, a process of searching a word relationship information DB for a second candidate answer group for the speech recognition error, a process of searching a user error correction information DB for a third candidate answer group for the speech recognition error, a process of searching a domain articulation pattern DB and a proper noun DB for a fourth candidate answer group for the speech recognition error, and a process of aligning candidate answers within each of the retrieved candidate answer groups and displaying the aligned candidate answers.

Abstract translation: 一种校正语音识别系统中的错误的方法包括：基于用于语音识别错误的第一候选答案组的声音模型搜索语音识别错误 - 答复对DB的过程，搜索词关系信息DB的处理用于语音识别错误的第二候选答案组，搜索用于语音识别错误的第三候选答案组的用户错误校正信息DB的处理，搜索域关节模式DB和专用名词DB的处理，用于第四用于语音识别错误的候选答案组，以及在每个所检索的候选答案组内对准候选答案并显示对齐的候选答案的处理。

3.

发明公开
VOICE RECOGNITION DEVICE HAVING BARGE-IN FUNCTION AND METHOD THEREOF 审中-公开

公开(公告)号：US20240212681A1

公开(公告)日：2024-06-27

申请号：US18498241

申请日：2023-10-31

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Min Kyu LEE , Seung Hi KIM , Sanghun KIM , Jeonguk BANG , Seung YUN

IPC: G10L15/22 , G06V40/16 , G10L13/02 , G10L17/00 , H04N23/611

CPC classification number: G10L15/22 , G06V40/172 , G10L13/02 , G10L17/00 , H04N23/611

Abstract: A voice recognition device having a barge-in function and a method thereof are proposed.
In an exemplary embodiment, there are disclosed an intelligent robot and a method for operating the intelligent robot, including an input unit for receiving a user's voice data, one or more processors, and an output unit for outputting a response generated on a basis of the user's voice data, wherein the processors generate the response corresponding to the users' voice data while maintaining a listening mode for identifying a dialogue partner by using the user's face image data and the user's voice data, and perform a speaking mode for control so as to perform an operation corresponding to the response.

4.

发明公开
METHOD AND APPARATUS FOR CONSTRUCTING DOMAIN-SPECIFIC SPEECH RECOGNITION MODEL AND END-TO-END SPEECH RECOGNIZER USING THE SAME 审中-公开

公开(公告)号：US20230215419A1

公开(公告)日：2023-07-06

申请号：US17979471

申请日：2022-11-02

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung YUN , Sanghun KIM , Min Kyu LEE

IPC: G10L13/04 , G10L15/16 , G10L15/30

CPC classification number: G10L13/04 , G10L15/16 , G10L15/30

Abstract: Provided is an end-to-end speech recognition technology capable of improving speech recognition performance in a desired specific domain, which includes collecting domain text data be specialized and comparing the data with a basic transcript text DB to determine domain text that is not included in the basic transcript text DB and requires additional training and constructing a specialization target domain text DB. The end-to-end speech recognition technology generates a speech signal from the domain text of the specialization target domain text DB, and trains a speech recognition neural network with the generated speech signal to generate an end-to-end speech recognition model specialized for the domain to be specialized. The specialized speech recognition model may be applied to the end-to-end speech recognizer to perform the domain-specific end-to-end speech recognition.

5.

发明公开
METHOD AND SYSTEM FOR GENERATING SYMPATHETIC BACK-CHANNEL SIGNAL 审中-公开

公开(公告)号：US20240221742A1

公开(公告)日：2024-07-04

申请号：US18488333

申请日：2023-10-17

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung Yun , Seung Hi Kim , Sanghun KIM , Jeonguk BANG , Min Kyu LEE

IPC: G10L15/22 , G10L15/05 , G10L15/26 , G10L25/63

CPC classification number: G10L15/22 , G10L15/05 , G10L15/26 , G10L25/63

Abstract: A method of generating a sympathetic back-channel signal is provided. The method includes receiving a voice signal from a user, determining whether predetermined timing is timing at which a back-channel signal is output in response to the input of the voice signal at the predetermined timing, storing the voice signal that has been input so far if the predetermined timing is the timing at which the back-channel signal is output as a result of the determination, determining back-channel signal information based on the stored voice signal, and outputting the determined back-channel signal information.

6.

发明公开
APPARATUS AND METHOD FOR IMPROVING CONTEXT-BASED AUTOMATIC INTERPRETATION PERFORMANCE 审中-公开

公开(公告)号：US20230290360A1

公开(公告)日：2023-09-14

申请号：US18085889

申请日：2022-12-21

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung YUN , Jeonguk BANG , Min Kyu LEE , Sanghun KIM

IPC: G10L19/00 , G10L25/78

CPC classification number: G10L19/00 , G10L25/78

Abstract: An apparatus for improving context-based automatic interpretation performance includes: an uttered voice input unit configured to receive a voice signal from a user; a previous sentence input unit configured to determine whether there is a user’s previous utterance when the voice signal is input by the uttered voice input unit; a voice encoding processing unit configured to decode only the voice signal through the uttered voice input unit when it is determined that there is no user’s previous utterance and extract a vector of the voice signal when it is determined that there is the user’s previous utterance; a context encoding processing unit configured to extract a context vector from a previous utterance when there is the previous utterance and transmit the extracted context vector of the previous utterance; and an interpretation decoding processing unit configured to output an interpretation result text.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification