-
公开(公告)号:US10216729B2
公开(公告)日:2019-02-26
申请号:US14914390
申请日:2014-04-30
Inventor: Sang-Hun Kim , Ki-Hyun Kim , Ji-Hyun Wang , Dong-Hyun Kim , Seung Yun , Min-Kyu Lee , Dam-Heo Lee , Mu-Yeol Choi
IPC: G06F17/28 , H04M1/60 , G10L13/033 , G10L15/00
Abstract: A user terminal, hands-free device and method for hands-free automatic interpretation service. The user terminal includes an interpretation environment initialization unit, an interpretation intermediation unit, and an interpretation processing unit. The interpretation environment initialization unit performs pairing with a hands-free device in response to a request from the hands-free device, and initializes an interpretation environment. The interpretation intermediation unit sends interpretation results obtained by interpreting a user's voice information received from the hands-free device to a counterpart terminal, and receives interpretation results obtained by interpreting a counterpart's voice information from the counterpart terminal. The interpretation processing unit synthesizes the interpretation results of the counterpart into a voice form based on the initialized interpretation environment when the interpretation results are received from the counterpart terminal, and sends the synthesized voice information to the hands-free device.
-
公开(公告)号:US12260865B2
公开(公告)日:2025-03-25
申请号:US17868747
申请日:2022-07-19
Inventor: Seung Yun , Sang Hun Kim , Min Kyu Lee , Joon Gyu Maeng
Abstract: Provided a method performed by an automatic interpretation server based on a zero user interface (UI), which communicates with a plurality of terminal devices having a microphone function, a speaker function, a communication function, and a wearable function. The method includes connecting terminal devices disposed within a designated automatic interpretation zone, receiving a voice signal of a first user from a first terminal device among the terminal devices within the automatic interpretation zone, matching a plurality of users placed within a speech-receivable distance of the first terminal device, and performing automatic interpretation on the voice signal and transmitting results of the automatic interpretation to a second terminal device of at least one second user corresponding to a result of the matching.
-
公开(公告)号:US10108606B2
公开(公告)日:2018-10-23
申请号:US15214215
申请日:2016-07-19
Inventor: Seung Yun , Ki Hyun Kim , Sang Hun Kim , Yun Young Kim , Jeong Se Kim , Min Kyu Lee , Soo Jong Lee , Young Jik Lee , Mu Yeol Choi
IPC: G06F17/28 , G10L13/033 , G10L13/06 , G10L25/24 , G10L25/75
Abstract: Provided are an automatic interpretation system and method for generating a synthetic sound having characteristics similar to those of an original speaker's voice. The automatic interpretation system for generating a synthetic sound having characteristics similar to those of an original speaker's voice includes a speech recognition module configured to generate text data by performing speech recognition for an original speech signal of an original speaker and extract at least one piece of characteristic information among pitch information, vocal intensity information, speech speed information, and vocal tract characteristic information of the original speech, an automatic translation module configured to generate a synthesis-target translation by translating the text data, and a speech synthesis module configured to generate a synthetic sound of the synthesis-target translation.
-
公开(公告)号:US11776557B2
公开(公告)日:2023-10-03
申请号:US17221364
申请日:2021-04-02
Inventor: Seung Yun , Sang Hun Kim , Min Kyu Lee
IPC: G10L25/21 , G10L21/02 , G06F40/40 , G10L21/0308 , G10L15/20 , G10L15/22 , G10L15/30 , G10L21/0208
CPC classification number: G10L21/0308 , G10L15/20 , G10L15/22 , G10L15/30 , G10L25/21 , G10L2015/227 , G10L2021/02087
Abstract: Provided is a zero user interface (UI)-based automatic interpretation method including receiving a plurality of speech signals uttered by a plurality of users from a plurality of terminal devices, acquiring a plurality of speech energies from the plurality of received speech signals, determining main speech signal uttered in a current utterance turn among the plurality of speech signals by comparing the plurality of acquired speech energies, and transmitting an automatic interpretation result acquired by performing automatic interpretation on the determined main speech signal to the plurality of terminal devices.
-
公开(公告)号:US11620978B2
公开(公告)日:2023-04-04
申请号:US16990482
申请日:2020-08-11
Inventor: Seung Yun , Sang Hun Kim , Min Kyu Lee
Abstract: An automatic interpretation method performed by a correspondent terminal communicating with an utterer terminal includes receiving, by a communication unit, voice feature information about an utterer and an automatic translation result, obtained by automatically translating a voice uttered in a source language by the utterer in a target language, from the utterer terminal and performing, by a sound synthesizer, voice synthesis on the basis of the automatic translation result and the voice feature information to output a personalized synthesis voice as an automatic interpretation result. The voice feature information about the utterer includes a hidden variable including a first additional voice result and a voice feature parameter and a second additional voice feature, which are extracted from a voice of the utterer.
-
6.
公开(公告)号:US11551012B2
公开(公告)日:2023-01-10
申请号:US16919748
申请日:2020-07-02
Inventor: Seung Yun , Sang Hun Kim , Min Kyu Lee , Yun Keun Lee , Mu Yeol Choi , Yeo Jeong Kim , Sang Kyu Park
Abstract: Provided are an apparatus and method for providing a personal assistant service based on automatic translation. The apparatus for providing a personal assistant service based on automatic translation includes an input section configured to receive a command of a user, a memory in which a program for providing a personal assistant service according to the command of the user is stored, and a processor configured to execute the program. The processor updates at least one of a speech recognition model, an automatic interpretation model, and an automatic translation model on the basis of an intention of the command of the user using a recognition result of the command of the user and provides the personal assistant service on the basis of an automatic translation call.
-
公开(公告)号:US10558763B2
公开(公告)日:2020-02-11
申请号:US16014203
申请日:2018-06-21
Inventor: Mu Yeol Choi , Min Kyu Lee , Sang Hun Kim , Seung Yun
IPC: G06F17/28 , G10L21/0208 , G10L25/84 , G10L25/93 , G10L25/69 , G10L21/0216
Abstract: An automatic translation device includes a communications module transmitting and receiving data to and from an ear-set device including a speaker, a first microphone, and a second microphone, a memory storing a program generating a result of translation using a dual-channel audio signal, and a processor executing the program stored in the memory. When the program is executed, the processor compares a first audio signal including a voice signal of a user, received using the first microphone, with a second audio signal including a noise signal and the voice signal of the user, received using the second microphone, and entirely or selectively extracting the voice signal of the user from the first and second audio signals, based on a result of the comparison, to perform automatic translation.
-
公开(公告)号:US12112769B2
公开(公告)日:2024-10-08
申请号:US17531316
申请日:2021-11-19
Inventor: Jeong Uk Bang , Seung Yun , Sang Hun Kim , Min Kyu Lee , Joon Gyu Maeng
IPC: G10L25/84 , G06F40/40 , G06F40/58 , G10L13/00 , G10L15/02 , G10L15/08 , G10L15/26 , G10L21/0208
CPC classification number: G10L25/84 , G06F40/40 , G06F40/58 , G10L13/00 , G10L15/02 , G10L15/08 , G10L15/26 , G10L21/0208
Abstract: Provided is a method of performing automatic interpretation based on speaker separation by a user terminal, the method including: receiving a first speech signal including at least one of a user speech of a user and a user surrounding speech around the user from an automatic interpretation service providing terminal, separating the first speech signal into speaker-specific speech signals, performing interpretation on the speaker-specific speech signals in a language selected by the user on the basis of an interpretation mode, and providing a second speech signal generated as a result of the interpretation to at least one of a counterpart terminal and the automatic interpretation service providing terminal according to the interpretation mode.
-
公开(公告)号:US20240221742A1
公开(公告)日:2024-07-04
申请号:US18488333
申请日:2023-10-17
Inventor: Seung Yun , Seung Hi Kim , Sanghun KIM , Jeonguk BANG , Min Kyu LEE
Abstract: A method of generating a sympathetic back-channel signal is provided. The method includes receiving a voice signal from a user, determining whether predetermined timing is timing at which a back-channel signal is output in response to the input of the voice signal at the predetermined timing, storing the voice signal that has been input so far if the predetermined timing is the timing at which the back-channel signal is output as a result of the determination, determining back-channel signal information based on the stored voice signal, and outputting the determined back-channel signal information.
-
公开(公告)号:US11977855B2
公开(公告)日:2024-05-07
申请号:US17522218
申请日:2021-11-09
Inventor: Sang Hun Kim , Seung Yun , Min Kyu Lee , Joon Gyu Maeng , Dong Hyun Kim
IPC: G06F40/58 , G10L17/02 , G10L17/06 , G10L17/18 , G10L21/0232 , G10L21/0316 , G10L21/04 , G10L25/06 , G10L25/18 , G10L25/21 , G10L25/78
CPC classification number: G06F40/58 , G10L17/02 , G10L17/06 , G10L17/18 , G10L21/0232 , G10L21/0316 , G10L21/04 , G10L25/06 , G10L25/18 , G10L25/21 , G10L25/78
Abstract: The Zero User Interface (UI)-based automatic speech translation system and method can solve problems such as the procedural inconvenience of inputting speech signals and the malfunction of speech recognition due to crosstalk when users who speak difference languages have a face-to-face conversation. The system includes an automatic speech translation server, speaker terminals and a counterpart terminal. The automatic speech translation server selects a speech signal of a speaker among multiple speech signals received from speaker terminals connected to an automatic speech translation service and transmits a result of translating the speech signal of the speaker into a target language to a counterpart terminal.
-
-
-
-
-
-
-
-
-