Patent search ap:("LG Electronics Inc.") AND inv:"Sungmin HAN" Page 1

1.

发明申请
METHOD FOR PROVIDING VOICE SYNTHESIS SERVICE AND SYSTEM THEREFOR 有权

公开(公告)号：US20250006177A1

公开(公告)日：2025-01-02

申请号：US18708348

申请日：2022-10-20

Applicant: LG ELECTRONICS INC.

Inventor： Siyoung YANG , Sangki KIM , Sungmin HAN

IPC: G10L13/08 , G10L13/033

Abstract: A method for providing a voice synthesis service and a system therefor are disclosed. A method of providing a voice synthesis service according to at least one of various embodiments of the present disclosure may comprise the steps of: receiving sound source data for synthesizing a voice of a speaker for a plurality of predefined first texts through a voice synthesis service platform that provides a development toolkit; performing tone conversion training on the sound source data of the speaker using a pre-generated tone conversion base model; generating a voice synthesis model for the speaker through the voice conversion training; receiving a second text; generating a voice synthesis model through voice synthesis inference on the basis of the voice synthesis model for the speaker and the second text; and generating a synthesized voice using the voice synthesis model.

2.

发明申请
DEVICE, SYSTEM AND METHOD FOR CONTROLLING A PLURALITY OF VOICE RECOGNITION DEVICES 有权

公开(公告)号：US20210233537A1

公开(公告)日：2021-07-29

申请号：US16917784

申请日：2020-06-30

Applicant: LG ELECTRONICS INC.

Inventor： Siyoung YANG , Yongchul PARK , Sungmin HAN , Sangki KIM , Juyeong JANG , Minook KIM

IPC: G10L15/32 , G10L15/30 , G10L15/22 , G16Y40/35

Abstract: Disclosed is a device for controlling a plurality of voice recognition devices for determining and selecting a first voice recognition device that a user wants to use based on a point in time when the voice of the user is spoken or a place where the user spoke the voice. The device for controlling a plurality of voice recognition devices according to the present disclosure may be associated with an artificial intelligence module, a robot, an augmented reality (AR) device, a virtual reality (VR) device, a device related to 5G service, etc.

3.

发明申请
METHOD AND DEVICE FOR FOCUSING SOUND SOURCE 有权

公开(公告)号：US20210096810A1

公开(公告)日：2021-04-01

申请号：US16703768

申请日：2019-12-04

Applicant: LG ELECTRONICS INC.

Inventor： Sang Ki KIM , Yongchul PARK , Sungmin HAN , Siyoung YANG , Juyeong JANG , Minook KIM

IPC: G06F3/16 , G10L25/51 , G06K9/00 , H04N5/93

Abstract: Disclosed are a sound source focus method and device in which the sound source focus device, in a 5G communication environment by amplifying and outputting a sound source signal of a user's object of interest extracted from an acoustic signal included in video content by executing a loaded artificial intelligence (AI) algorithm and/or machine learning algorithm. The sound source focus method includes playing video content including a video signal including at least one moving object and the acoustic signal in which sound sources output by the object are mixed, determining the user's object of interest from the video signal, acquiring unique sound source information about the user's object of interest, extracting an actual sound source for the user's object of interest corresponding to the unique sound source information from the acoustic signal, and outputting the actual sound source extracted for the user's object of interest.

4.

发明申请
ARTIFICIAL INTELLIGENCE DEVICE 审中-公开

公开(公告)号：US20200051564A1

公开(公告)日：2020-02-13

申请号：US16538172

申请日：2019-08-12

Applicant: LG ELECTRONICS INC.

Inventor： Jonghoon CHAE , Yongchul PARK , Siyoung YANG , Juyeong JANG , Sungmin HAN

IPC: G10L15/22 , G10L15/02 , G10L25/90

Abstract: An artificial intelligence device includes a speaker, a microphone configured to receive a user's speech, and one or more controllers configured to extract an utterance feature of the received speech, determine a user type corresponding to the extracted utterance feature, map a speech agent associated with the determined user type, and output an audio response through the speaker using the mapped speech agent.

5.

发明申请
SPEECH SYNTHESIZER USING ARTIFICIAL INTELLIGENCE, METHOD OF OPERATING SPEECH SYNTHESIZER AND COMPUTER-READABLE RECORDING MEDIUM 有权

公开(公告)号：US20210327405A1

公开(公告)日：2021-10-21

申请号：US16499755

申请日：2019-05-15

Applicant: LG ELECTRONICS INC.

Inventor： Jonghoon CHAE , Sungmin HAN

IPC: G10L13/02 , G10L13/08 , G10L25/30 , G06N3/08

Abstract: A speech synthesizer using artificial intelligence includes a memory configured to store a first ratio of a word classified into a minor class among a plurality of classes and a synthesized speech model, and a processor configured to determine a class classification probability set of the word using the word, the first ratio and the synthesized speech model. The first ratio indicates a ratio in which the word is classified into the minor class within a plurality of characters, the plurality of classes includes a first class corresponding to first reading break, a second class corresponding to second reading break greater than the first break and a third class corresponding to third reading break greater than the second break, and the minor class has a smallest count among the first to third classes.

6.

发明申请
METHOD AND APPARATUS FOR CONTROLLING DEVICE 有权

公开(公告)号：US20210174796A1

公开(公告)日：2021-06-10

申请号：US16810013

申请日：2020-03-05

Applicant: LG ELECTRONICS INC.

Inventor： Jong Hoon CHAE , Minook KIM , Yongchul PARK , Sungmin HAN , Siyoung YANG , Sangki KIM , Juyeong JANG

IPC: G10L15/22 , G10L15/02 , G10L15/16 , G10L25/90 , G10L15/187 , G10L25/84 , G10L25/18 , G06N3/04

Abstract: A method and apparatus for controlling a device according to an embodiment of the present disclosure may be based on a speech feature of a user reflecting the Lombard effect so as to operate a device located far away from the user, among a plurality of electronic devices. As such, even when the user calls a device located far away from the user without any separate context information, speech recognition neural networks and weight calculation neural networks may be selected and used to operate the device located far away from the user, and reception of a speech signal of the user calling a device located far away from the user may be performed in an Internet of Things (IoT) environment using a 5G network.

7.

发明申请
VOICE SYNTHESIS DEVICE 审中-公开

公开(公告)号：US20200074981A1

公开(公告)日：2020-03-05

申请号：US16547323

申请日：2019-08-21

Applicant: LG ELECTRONICS INC.

Inventor： Jonghoon CHAE , Yongchul PARK , Siyoung YANG , Juyeong JANG , Sungmin HAN

IPC: G10L13/08 , G06F17/28 , G10L13/033 , G10L13/04 , G10L25/90 , G10L25/63

Abstract: Disclosed is a voice synthesis device. The voice synthesis device includes a database configured to store a voice and a text corresponding to the voice and a processor configured to extract characteristic information and a tone of a first-language voice stored in the database, classify an utterance style of an utterer on basis of the extracted characteristic information, generate utterer analysis information including the utterance style and the tone, translate a text corresponding to the first-language voice into a second language, and synthesize the text, translated into the second language, in a second-language voice by using the utterer analysis information.

8.

发明申请
ARTIFICIAL INTELLIGENCE APPARATUS FOR CORRECTING SYNTHESIZED SPEECH AND METHOD THEREOF 审中-公开

公开(公告)号：US20200058290A1

公开(公告)日：2020-02-20

申请号：US16660947

申请日：2019-10-23

Applicant: LG ELECTRONICS INC.

Inventor： Jonghoon CHAE , Minook KIM , Sangki KIM , Yongchul PARK , Siyoung YANG , Juyeong JANG , Sungmin HAN

IPC: G10L13/04 , G06N20/00 , G10L15/02 , G10L15/16

Abstract: Disclosed herein is an artificial intelligence apparatus includes a memory configured to store learning target text and human speech of a person who pronounces the text, a processor configured to generate synthesized speech in which the text is pronounced by synthesized sound and extract a synthesized speech feature set including information on a feature pronounced in the synthesized speech and a human speech feature set including information on a feature pronounced in the human speech, and a learning processor configured to train a speech correction model for outputting a corrected speech feature set to allow predetermined synthesized speech to be corrected based on a human pronunciation feature when a synthesized speech feature set extracted from predetermined synthesized speech is input, based on the synthesized speech feature set and the human speech feature set.

9.

发明申请
METHOD AND APPARATUS FOR PERFORMING MULTI-LANGUAGE COMMUNICATION 审中-公开

公开(公告)号：US20200043495A1

公开(公告)日：2020-02-06

申请号：US16601787

申请日：2019-10-15

Applicant: LG ELECTRONICS INC.

Inventor： Yongchul PARK , Minook KIM , Sang Ki KIM , Siyoung YANG , Juyeong JANG , Sungmin HAN

IPC: G10L15/22 , G10L13/04 , G10L15/00 , G10L15/26 , G10L15/25

Abstract: A method for performing multi-language communication includes receiving an utterance, identifying a language of the received utterance, determining whether the identified language matches a preset reference language, applying, to the received utterance, an interpretation model interpreting the identified language into the reference language when the identified language does not match the reference language, changing, to text, speech data which is outputted in the reference language as a result of applying the interpretation model, generating a response message responding to the text of the speech data, and outputting the response message. Here, the interpretation model may be a deep neural network model generated through machine learning, and the interpretation model may be stored in an edge device or provided through a server in an Internet of things environment through a 5G network.

10.

发明申请
TEXT-TO-SPEECH (TTS) METHOD AND DEVICE ENABLING MULTIPLE SPEAKERS TO BE SET 有权

公开(公告)号：US20220351714A1

公开(公告)日：2022-11-03

申请号：US16485776

申请日：2019-06-07

Applicant: LG ELECTRONICS INC.

Inventor： Siyoung YANG , Minwook KIM , Yongchul PARK , Juyeong JANG , Sungmin HAN

IPC: G10L13/08 , G06F40/279 , G06F40/143 , G10L13/047

Abstract: Disclosed is a text-to-speech (TTS) method enabling multiple speakers to be set. The present invention sets speaker information for the multiple characters with respect to a script composed to enable utterance by the multiple characters, and utilizes metadata including the speaker information corresponding to the multiple characters for speech synthesis, thereby realizing an audiobook which allows the multiple speakers to output speech utterance. In addition, the speaker information for the multiple characters may be set through Artificial Intelligence (AI) processing to thereby perform multi-speaker speech synthesis by a TTS device including an AI module.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification