Patent search ap:("Electronics AND Telecommunications Research Institute") AND inv:"Seung YUN" Page 1

1.

发明公开
APPARATUS AND METHOD FOR IMPROVING CONTEXT-BASED AUTOMATIC INTERPRETATION PERFORMANCE 审中-公开

公开(公告)号：US20230290360A1

公开(公告)日：2023-09-14

申请号：US18085889

申请日：2022-12-21

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung YUN , Jeonguk BANG , Min Kyu LEE , Sanghun KIM

IPC: G10L19/00 , G10L25/78

CPC classification number: G10L19/00 , G10L25/78

Abstract: An apparatus for improving context-based automatic interpretation performance includes: an uttered voice input unit configured to receive a voice signal from a user; a previous sentence input unit configured to determine whether there is a user’s previous utterance when the voice signal is input by the uttered voice input unit; a voice encoding processing unit configured to decode only the voice signal through the uttered voice input unit when it is determined that there is no user’s previous utterance and extract a vector of the voice signal when it is determined that there is the user’s previous utterance; a context encoding processing unit configured to extract a context vector from a previous utterance when there is the previous utterance and transmit the extracted context vector of the previous utterance; and an interpretation decoding processing unit configured to output an interpretation result text.

2.

发明申请
APPARATUS AND METHOD FOR MULTILINGUAL INTERPRETATION AND TRANSLATION HAVING AUTOMATIC LANGUAGE SETTING FUNCTION 审中-公开

公开(公告)号：US20170147558A1

公开(公告)日：2017-05-25

申请号：US15192055

申请日：2016-06-24

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Jeong Se KIM , Sang Hun KIM , Seung YUN

IPC: G06F17/28 , H04M1/725 , G06F17/27

CPC classification number: G06F17/289 , G06F17/275 , G06F17/2854 , H04M1/7255

Abstract: Provided is a method for interpretation and translation accomplished by an interpretation and translation apparatus of a user through interfacing with an interpretation and translation apparatus of the other party. The method includes: automatically setting a translation target language which enables to communicate with the other party based on a message from the interpretation and translation apparatus of the other party by using a communication connection in a network; receiving input information of a use language of the user; calling a translator corresponding to the translation target language to transmit a result obtained by translating the input information into the translation target language to the interpretation and translation apparatus of the other party; and outputting received data from the interpretation and translation apparatus of the other party or outputting the result obtained by translating the received data into the use language of the user by using the translator.

3.

发明申请
METHOD AND APPARATUS FOR IMPROVING PERFORMANCE OF ARTIFICIAL INTELLIGENCE MODEL USING SPEECH RECOGNITION RESULTS AS TEXT INPUT 有权

公开(公告)号：US20240420682A1

公开(公告)日：2024-12-19

申请号：US18585204

申请日：2024-02-23

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung Hi KIM , Jeong Uk BANG , Seung YUN

IPC: G10L15/06 , G10L15/26

Abstract: The present disclosure relates to a method and device for improving the performance of an AI model that uses voice recognition results as text input. A method of training an AI model according to an embodiment of the present disclosure may include: generating first time information on a plurality of words included in a voice and transcription, using a first learning sample including the voice and the transcription; generating second time information by adding a pre-configured delay time to the first time information; generating a modified transcription based on an end time of a last word among the plurality of words and the second time information; and performing training of the AI model based on a second training sample including the voice and the modified transcription.

4.

发明申请
METHOD AND APPARATUS FOR GENERATING MULTIPLE PHONEME STRINGS FOR FOREIGN NOUN 审中-公开
Title translation: 用于生成外部多种多种波音线的方法和装置

公开(公告)号：US20150066472A1

公开(公告)日：2015-03-05

申请号：US14244044

申请日：2014-04-03

Applicant: Electronics and Telecommunications Research Institute

Inventor： Min-Kyu LEE , Sang-Hun KIM , Seung YUN , Cheol-Soon YI

IPC: G06F17/28 , G10L15/02

CPC classification number: G10L15/187

Abstract: A method for generating multiple phoneme for foreign proper nouns according to the present invention comprises: converting a second language proper noun uttered in a first language to a second language word using an automatic translator; generating second language phoneme strings corresponding to the second language word using a second language G2P; converting the second language phoneme strings to first language phoneme strings; generating first language phoneme strings corresponding to the second language proper noun uttered in the first language using a first language G2P; and generating a plurality of phoneme strings by using the first language phoneme strings obtained through the step of converting to the first language phoneme strings and the first language phoneme strings obtained through the step of generating the first language phoneme strings.

Abstract translation: 根据本发明的用于产生用于外来专有名词的多个音素的方法包括：使用自动翻译器将用第一语言发音的第二语言专有名称转换为第二语言词; 使用第二语言G2P生成与第二语言词对应的第二语言音素字符串; 将第二语言音素字符串转换为第一语言音素字符串; 使用第一语言G2P生成与以第一语言发出的第二语言专有名词相对应的第一语言音素字符串; 以及通过使用通过转换为通过生成第一语言音素串的步骤获得的第一语言音素字符串和第一语言音素字符串的步骤获得的第一语言音素字符串来生成多个音素串。

5.

发明公开
GAZE-BASED AND AUGMENTED AUTOMATIC INTERPRETATION METHOD AND SYSTEM 审中-公开

公开(公告)号：US20230377558A1

公开(公告)日：2023-11-23

申请号：US18319946

申请日：2023-05-18

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Min kyu LEE , Sanghun KIM , Seung YUN , Jeonguk BANG , Namhyeong KIM

IPC: G10L13/027 , G06V40/16 , G06F40/58 , G10L15/00

CPC classification number: G10L13/027 , G06V40/161 , G06F40/58 , G10L15/005

Abstract: The present invention relates to an automatic interpretation method and system for converting only voice of a speaker into a target language. The present invention may significantly improve performance of automatic interpretation with a foreigner to be communicated with, even in a high-noise environment in which multiple speakers utter at the same time by utilizing voice and image information input to a smart device in a complex manner. In addition, the present invention may determine a situation based on text information and image information existing around a user, and reflect the situation information together with multimodal information to an interpretation engine in real time. In addition, the present invention may significantly improve user convenience of an automatic interpretation system by directly augmenting and displaying an interpreted sentence directly next to a speaker image or generating a synthesized sound by distinguishing the interpreted sentence from other speeches.

6.

发明申请
AUTOMATIC INTERPRETATION SERVER AND METHOD BASED ON ZERO UI 有权

公开(公告)号：US20230038407A1

公开(公告)日：2023-02-09

申请号：US17868747

申请日：2022-07-19

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung YUN , Sang Hun KIM , Min Kyu LEE , Joon Gyu MAENG

IPC: G10L17/22 , G10L17/04 , G10L17/06 , G06F3/16

Abstract: Provided a method performed by an automatic interpretation server based on a zero user interface (UI), which communicates with a plurality of terminal devices having a microphone function, a speaker function, a communication function, and a wearable function. The method includes connecting terminal devices disposed within a designated automatic interpretation zone, receiving a voice signal of a first user from a first terminal device among the terminal devices within the automatic interpretation zone, matching a plurality of users placed within a speech-receivable distance of the first terminal device, and performing automatic interpretation on the voice signal and transmitting results of the automatic interpretation to a second terminal device of at least one second user corresponding to a result of the matching.

7.

发明申请
APPARATUS AND METHOD FOR PROVIDING PERSONAL ASSISTANT SERVICE BASED ON AUTOMATIC TRANSLATION 有权

公开(公告)号：US20210004542A1

公开(公告)日：2021-01-07

申请号：US16919748

申请日：2020-07-02

Applicant: Electronics and Telecommunications Research Institute

Inventor： Seung YUN , Sang Hun KIM , Min Kyu LEE , Yun Keun LEE , Mu Yeol CHOI , Yeo Jeong KIM , Sang Kyu PARK

IPC: G06F40/47 , G10L15/22

Abstract: Provided are an apparatus and method for providing a personal assistant service based on automatic translation. The apparatus for providing a personal assistant service based on automatic translation includes an input section configured to receive a command of a user, a memory in which a program for providing a personal assistant service according to the command of the user is stored, and a processor configured to execute the program. The processor updates at least one of a speech recognition model, an automatic interpretation model, and an automatic translation model on the basis of an intention of the command of the user using a recognition result of the command of the user and provides the personal assistant service on the basis of an automatic translation call.

8.

发明申请
AUTOMATIC TRANSLATION SYSTEM, DEVICE, AND METHOD 审中-公开

公开(公告)号：US20190042565A1

公开(公告)日：2019-02-07

申请号：US16014203

申请日：2018-06-21

Applicant: Electronics and Telecommunications Research Institute

Inventor： Mu Yeol CHOI , Min Kyu LEE , Sang Hun KIM , Seung YUN

IPC: G06F17/28 , G10L21/0208 , G10L25/69 , G10L25/84 , G10L25/93

CPC classification number: G06F17/289 , G10L15/00 , G10L21/0208 , G10L25/69 , G10L25/84 , G10L25/93 , G10L2021/02165 , H04R1/1016 , H04R3/005 , H04R2201/107

Abstract: An automatic translation device includes a communications module transmitting and receiving data to and from an ear-set device including a speaker, a first microphone, and a second microphone, a memory storing a program generating a result of translation using a dual-channel audio signal, and a processor executing the program stored in the memory. When the program is executed, the processor compares a first audio signal including a voice signal of a user, received using the first microphone, with a second audio signal including a noise signal and the voice signal of the user, received using the second microphone, and entirely or selectively extracting the voice signal of the user from the first and second audio signals, based on a result of the comparison, to perform automatic translation.

9.

发明申请
METHOD AND APPARATUS FOR CORRECTING ERROR IN SPEECH RECOGNITION SYSTEM 审中-公开
Title translation: 用于校正语音识别系统中的错误的方法和装置

公开(公告)号：US20140195226A1

公开(公告)日：2014-07-10

申请号：US13902057

申请日：2013-05-24

Applicant: Electronics and Telecommunications Research Institute

Inventor： Seung YUN , Sanghun KIM , Jeong Se KIM , Soo-jong LEE , Ki Hyun KIM

IPC: G10L15/01

CPC classification number: G10L15/01 , G10L2015/225

Abstract: A method of correcting errors in a speech recognition system includes a process of searching a speech recognition error-answer pair DB based on a sound model for a first candidate answer group for a speech recognition error, a process of searching a word relationship information DB for a second candidate answer group for the speech recognition error, a process of searching a user error correction information DB for a third candidate answer group for the speech recognition error, a process of searching a domain articulation pattern DB and a proper noun DB for a fourth candidate answer group for the speech recognition error, and a process of aligning candidate answers within each of the retrieved candidate answer groups and displaying the aligned candidate answers.

Abstract translation: 一种校正语音识别系统中的错误的方法包括：基于用于语音识别错误的第一候选答案组的声音模型搜索语音识别错误 - 答复对DB的过程，搜索词关系信息DB的处理用于语音识别错误的第二候选答案组，搜索用于语音识别错误的第三候选答案组的用户错误校正信息DB的处理，搜索域关节模式DB和专用名词DB的处理，用于第四用于语音识别错误的候选答案组，以及在每个所检索的候选答案组内对准候选答案并显示对齐的候选答案的处理。

10.

发明申请
SYSTEM, USER TERMINAL, AND METHOD FOR PROVIDING AUTOMATIC INTERPRETATION SERVICE BASED ON SPEAKER SEPARATION 有权

公开(公告)号：US20220215857A1

公开(公告)日：2022-07-07

申请号：US17531316

申请日：2021-11-19

Applicant: Electronics and Telecommunications Research Institute

Inventor： Jeong Uk BANG , Seung YUN , Sang Hun KIM , Min Kyu LEE , Joon Gyu MAENG

IPC: G10L25/84 , G10L15/02 , G10L15/08

Abstract: Provided is a method of performing automatic interpretation based on speaker separation by a user terminal, the method including: receiving a first speech signal including at least one of a user speech of a user and a user surrounding speech around the user from an automatic interpretation service providing terminal, separating the first speech signal into speaker-specific speech signals, performing interpretation on the speaker-specific speech signals in a language selected by the user on the basis of an interpretation mode, and providing a second speech signal generated as a result of the interpretation to at least one of a counterpart terminal and the automatic interpretation service providing terminal according to the interpretation mode.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification