-
公开(公告)号:US11776557B2
公开(公告)日:2023-10-03
申请号:US17221364
申请日:2021-04-02
Inventor: Seung Yun , Sang Hun Kim , Min Kyu Lee
IPC: G10L25/21 , G10L21/02 , G06F40/40 , G10L21/0308 , G10L15/20 , G10L15/22 , G10L15/30 , G10L21/0208
CPC classification number: G10L21/0308 , G10L15/20 , G10L15/22 , G10L15/30 , G10L25/21 , G10L2015/227 , G10L2021/02087
Abstract: Provided is a zero user interface (UI)-based automatic interpretation method including receiving a plurality of speech signals uttered by a plurality of users from a plurality of terminal devices, acquiring a plurality of speech energies from the plurality of received speech signals, determining main speech signal uttered in a current utterance turn among the plurality of speech signals by comparing the plurality of acquired speech energies, and transmitting an automatic interpretation result acquired by performing automatic interpretation on the determined main speech signal to the plurality of terminal devices.
-
公开(公告)号:US11620978B2
公开(公告)日:2023-04-04
申请号:US16990482
申请日:2020-08-11
Inventor: Seung Yun , Sang Hun Kim , Min Kyu Lee
Abstract: An automatic interpretation method performed by a correspondent terminal communicating with an utterer terminal includes receiving, by a communication unit, voice feature information about an utterer and an automatic translation result, obtained by automatically translating a voice uttered in a source language by the utterer in a target language, from the utterer terminal and performing, by a sound synthesizer, voice synthesis on the basis of the automatic translation result and the voice feature information to output a personalized synthesis voice as an automatic interpretation result. The voice feature information about the utterer includes a hidden variable including a first additional voice result and a voice feature parameter and a second additional voice feature, which are extracted from a voice of the utterer.
-
3.
公开(公告)号:US11551012B2
公开(公告)日:2023-01-10
申请号:US16919748
申请日:2020-07-02
Inventor: Seung Yun , Sang Hun Kim , Min Kyu Lee , Yun Keun Lee , Mu Yeol Choi , Yeo Jeong Kim , Sang Kyu Park
Abstract: Provided are an apparatus and method for providing a personal assistant service based on automatic translation. The apparatus for providing a personal assistant service based on automatic translation includes an input section configured to receive a command of a user, a memory in which a program for providing a personal assistant service according to the command of the user is stored, and a processor configured to execute the program. The processor updates at least one of a speech recognition model, an automatic interpretation model, and an automatic translation model on the basis of an intention of the command of the user using a recognition result of the command of the user and provides the personal assistant service on the basis of an automatic translation call.
-
公开(公告)号:US10558763B2
公开(公告)日:2020-02-11
申请号:US16014203
申请日:2018-06-21
Inventor: Mu Yeol Choi , Min Kyu Lee , Sang Hun Kim , Seung Yun
IPC: G06F17/28 , G10L21/0208 , G10L25/84 , G10L25/93 , G10L25/69 , G10L21/0216
Abstract: An automatic translation device includes a communications module transmitting and receiving data to and from an ear-set device including a speaker, a first microphone, and a second microphone, a memory storing a program generating a result of translation using a dual-channel audio signal, and a processor executing the program stored in the memory. When the program is executed, the processor compares a first audio signal including a voice signal of a user, received using the first microphone, with a second audio signal including a noise signal and the voice signal of the user, received using the second microphone, and entirely or selectively extracting the voice signal of the user from the first and second audio signals, based on a result of the comparison, to perform automatic translation.
-
公开(公告)号:US20180075844A1
公开(公告)日:2018-03-15
申请号:US15646302
申请日:2017-07-11
Inventor: Dong Hyun KIM , Young Jik Lee , Sang Hun Kim , Seung Hi Kim , Min Kyu Lee , Mu Yeol Choi
IPC: G10L15/14 , G10L17/04 , G10L15/065
CPC classification number: G10L15/144 , G10L15/063 , G10L15/065 , G10L15/08 , G10L15/142 , G10L17/04 , G10L2015/0631
Abstract: A speech recognition method capable of automatic generation of phones according to the present invention includes: unsupervisedly learning a feature vector of speech data; generating a phone set by clustering acoustic features selected based on an unsupervised learning result; allocating a sequence of phones to the speech data on the basis of the generated phone set; and generating an acoustic model on the basis of the sequence of phones and the speech data to which the sequence of phones is allocated.
-
公开(公告)号:US12260865B2
公开(公告)日:2025-03-25
申请号:US17868747
申请日:2022-07-19
Inventor: Seung Yun , Sang Hun Kim , Min Kyu Lee , Joon Gyu Maeng
Abstract: Provided a method performed by an automatic interpretation server based on a zero user interface (UI), which communicates with a plurality of terminal devices having a microphone function, a speaker function, a communication function, and a wearable function. The method includes connecting terminal devices disposed within a designated automatic interpretation zone, receiving a voice signal of a first user from a first terminal device among the terminal devices within the automatic interpretation zone, matching a plurality of users placed within a speech-receivable distance of the first terminal device, and performing automatic interpretation on the voice signal and transmitting results of the automatic interpretation to a second terminal device of at least one second user corresponding to a result of the matching.
-
公开(公告)号:US10108606B2
公开(公告)日:2018-10-23
申请号:US15214215
申请日:2016-07-19
Inventor: Seung Yun , Ki Hyun Kim , Sang Hun Kim , Yun Young Kim , Jeong Se Kim , Min Kyu Lee , Soo Jong Lee , Young Jik Lee , Mu Yeol Choi
IPC: G06F17/28 , G10L13/033 , G10L13/06 , G10L25/24 , G10L25/75
Abstract: Provided are an automatic interpretation system and method for generating a synthetic sound having characteristics similar to those of an original speaker's voice. The automatic interpretation system for generating a synthetic sound having characteristics similar to those of an original speaker's voice includes a speech recognition module configured to generate text data by performing speech recognition for an original speech signal of an original speaker and extract at least one piece of characteristic information among pitch information, vocal intensity information, speech speed information, and vocal tract characteristic information of the original speech, an automatic translation module configured to generate a synthesis-target translation by translating the text data, and a speech synthesis module configured to generate a synthetic sound of the synthesis-target translation.
-
公开(公告)号:US12112769B2
公开(公告)日:2024-10-08
申请号:US17531316
申请日:2021-11-19
Inventor: Jeong Uk Bang , Seung Yun , Sang Hun Kim , Min Kyu Lee , Joon Gyu Maeng
IPC: G10L25/84 , G06F40/40 , G06F40/58 , G10L13/00 , G10L15/02 , G10L15/08 , G10L15/26 , G10L21/0208
CPC classification number: G10L25/84 , G06F40/40 , G06F40/58 , G10L13/00 , G10L15/02 , G10L15/08 , G10L15/26 , G10L21/0208
Abstract: Provided is a method of performing automatic interpretation based on speaker separation by a user terminal, the method including: receiving a first speech signal including at least one of a user speech of a user and a user surrounding speech around the user from an automatic interpretation service providing terminal, separating the first speech signal into speaker-specific speech signals, performing interpretation on the speaker-specific speech signals in a language selected by the user on the basis of an interpretation mode, and providing a second speech signal generated as a result of the interpretation to at least one of a counterpart terminal and the automatic interpretation service providing terminal according to the interpretation mode.
-
公开(公告)号:US11977855B2
公开(公告)日:2024-05-07
申请号:US17522218
申请日:2021-11-09
Inventor: Sang Hun Kim , Seung Yun , Min Kyu Lee , Joon Gyu Maeng , Dong Hyun Kim
IPC: G06F40/58 , G10L17/02 , G10L17/06 , G10L17/18 , G10L21/0232 , G10L21/0316 , G10L21/04 , G10L25/06 , G10L25/18 , G10L25/21 , G10L25/78
CPC classification number: G06F40/58 , G10L17/02 , G10L17/06 , G10L17/18 , G10L21/0232 , G10L21/0316 , G10L21/04 , G10L25/06 , G10L25/18 , G10L25/21 , G10L25/78
Abstract: The Zero User Interface (UI)-based automatic speech translation system and method can solve problems such as the procedural inconvenience of inputting speech signals and the malfunction of speech recognition due to crosstalk when users who speak difference languages have a face-to-face conversation. The system includes an automatic speech translation server, speaker terminals and a counterpart terminal. The automatic speech translation server selects a speech signal of a speaker among multiple speech signals received from speaker terminals connected to an automatic speech translation service and transmits a result of translating the speech signal of the speaker into a target language to a counterpart terminal.
-
公开(公告)号:US10298736B2
公开(公告)日:2019-05-21
申请号:US15202912
申请日:2016-07-06
Inventor: Min Kyu Lee , Sang Hun Kim , Young Ik Kim , Dong Hyun Kim , Mu Yeol Choi
Abstract: A voice signal processing apparatus includes: an input unit which receives a voice signal of a user; a detecting unit which detects an auxiliary signal; and a signal processing unit which transmits the voice signal to an external terminal in a first operation mode and transmits the voice signal and the auxiliary signal to the external terminal using the same or different protocols in a second operation mode.
-
-
-
-
-
-
-
-
-