Patent search ap:("LG Electronics Inc.") AND inv:"Sangki KIM" Page 2

11.

发明公开
SPEECH SYNTHESIS DEVICE AND SPEECH SYNTHESIS METHOD 审中-公开

公开(公告)号：US20230148275A1

公开(公告)日：2023-05-11

申请号：US17959050

申请日：2022-10-03

Applicant: LG ELECTRONICS INC.

Inventor： Sangki KIM , Sungmin HAN , Siyoung YANG

IPC: G10L13/047 , G10L25/30

CPC classification number: G10L13/047 , G10L25/30

Abstract: Provided is a speech synthetic device capable of outputting a synthetic voice having various speech styles. The speech synthesis device includes a speaker, and a processor to acquire voice feature information through a text and a user input; generate a synthetic voice, by receiving the text and the voice feature information inputs into a decoder supervised-trained to minimize a difference between feature information of a learning text and characteristic information of a learning voice, and output the generated synthetic voice through the speaker.

12.

发明申请
ARTIFICIAL INTELLIGENCE DEVICE AND METHOD FOR SYNTHESIZING SPEECH BY CONTROLLING SPEECH STYLE 有权

公开(公告)号：US20210174782A1

公开(公告)日：2021-06-10

申请号：US16803941

申请日：2020-02-27

Applicant: LG ELECTRONICS INC.

Inventor： Minook KIM , Yongchul PARK , Sungmin HAN , Siyoung YANG , Sangki KIM , Juyeong JANG

IPC: G10L13/10 , G10L13/047 , G06N20/00 , G06N5/04

Abstract: An artificial intelligence device includes a memory and a processor. The memory is configured to store audio data having a predetermined speech style. The processor is configured to generate a condition vector relating to a condition for determining the speech style of the audio data, reduce a dimension of the condition vector to a predetermined reduction dimension, acquire a sparse code vector based on a dictionary vector acquired through sparse dictionary coding with respect to the condition vector having the predetermined reduction dimension, and change a vector element value included in the sparse code vector.

13.

发明申请
SPEECH SYNTHESIS IN NOISY ENVIRONMENT 有权

公开(公告)号：US20210134262A1

公开(公告)日：2021-05-06

申请号：US17029582

申请日：2020-09-23

Applicant: LG ELECTRONICS INC.

Inventor： Minook KIM , Yongchul PARK , Sungmin HAN , Siyoung YANG , Sangki KIM , Juyeong JANG

IPC: G10L13/033 , G10L15/02 , G10L13/047 , G10L25/51 , G10L15/24

Abstract: Disclosed is speech synthesis in a noisy environment. According to an embodiment of the disclosure, a method of speech synthesis may generate a Lombard effect-applied synthesized speech using a feature vector generated from an utterance feature. According to the disclosure, the speech synthesis method and device may be related to artificial intelligence (AI) modules, unmanned aerial vehicles (UAVs), robots, augmented reality (AR) devices, virtual reality (VR) devices, and 5G service-related devices.

14.

发明申请
SPEECH SYNTHESIS METHOD AND APPARATUS BASED ON EMOTION INFORMATION 审中-公开

公开(公告)号：US20200035215A1

公开(公告)日：2020-01-30

申请号：US16593161

申请日：2019-10-04

Applicant: LG Electronics Inc.

Inventor： Siyoung YANG , Minook KIM , Sangki KIM , Yongchul PARK , Juyeong JANG , Sungmin HAN

IPC: G10L13/02 , G10L25/63 , G10L25/30 , G10L15/30 , G06F17/16

Abstract: A speech synthesis method and apparatus based on emotion information are disclosed. A speech synthesis method based on emotion information extracts speech synthesis target text from received data and determines whether the received data includes situation explanation information. First metadata corresponding to first emotion information is generated on the basis of the situation explanation information. When the extracted data does not include situation explanation information, second metadata corresponding to second emotion information generated on the basis of semantic analysis and context analysis is generated. One of the first metadata and the second metadata is added to the speech synthesis target text to synthesize speech corresponding to the extracted data. A speech synthesis apparatus of this disclosure may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.

15.

发明申请
ARTIFICIAL INTELLIGENCE (AI)-BASED VOICE SAMPLING APPARATUS AND METHOD FOR PROVIDING SPEECH STYLE 审中-公开

公开(公告)号：US20200005763A1

公开(公告)日：2020-01-02

申请号：US16561410

申请日：2019-09-05

Applicant: LG ELECTRONICS INC.

Inventor： Jonghoon CHAE , Minook KIM , Sangki KIM , Yongchul PARK , Siyoung YANG , Juyeong JANG , Sungmin HAN

IPC: G10L13/08 , G10L15/02 , G10L13/047 , G10L13/033

Abstract: Disclosed is an artificial intelligence (AI)-based voice sampling apparatus for providing a speech style, including a rhyme encoder configured to receive a user's voice, extract a voice sample, and analyze a vocal feature included in the voice sample, a text encoder configured to receive text for reflecting the vocal feature, a processor configured to classify the vocal feature of the voice sample input to the rhyme encoder according to a label, extract an embedding vector representing the vocal feature from the label, and generate a speech style from the embedding vector and apply the generated speech style to the text, and a rhyme decoder configured to output synthesized voice data in which the speech style is applied to the text by the processor.

16.

发明申请
METHOD FOR PROVIDING VOICE SYNTHESIS SERVICE AND SYSTEM THEREFOR 有权

公开(公告)号：US20250006177A1

公开(公告)日：2025-01-02

申请号：US18708348

申请日：2022-10-20

Applicant: LG ELECTRONICS INC.

Inventor： Siyoung YANG , Sangki KIM , Sungmin HAN

IPC: G10L13/08 , G10L13/033

Abstract: A method for providing a voice synthesis service and a system therefor are disclosed. A method of providing a voice synthesis service according to at least one of various embodiments of the present disclosure may comprise the steps of: receiving sound source data for synthesizing a voice of a speaker for a plurality of predefined first texts through a voice synthesis service platform that provides a development toolkit; performing tone conversion training on the sound source data of the speaker using a pre-generated tone conversion base model; generating a voice synthesis model for the speaker through the voice conversion training; receiving a second text; generating a voice synthesis model through voice synthesis inference on the basis of the voice synthesis model for the speaker and the second text; and generating a synthesized voice using the voice synthesis model.

17.

发明申请
DEVICE, SYSTEM AND METHOD FOR CONTROLLING A PLURALITY OF VOICE RECOGNITION DEVICES 有权

公开(公告)号：US20210233537A1

公开(公告)日：2021-07-29

申请号：US16917784

申请日：2020-06-30

Applicant: LG ELECTRONICS INC.

Inventor： Siyoung YANG , Yongchul PARK , Sungmin HAN , Sangki KIM , Juyeong JANG , Minook KIM

IPC: G10L15/32 , G10L15/30 , G10L15/22 , G16Y40/35

Abstract: Disclosed is a device for controlling a plurality of voice recognition devices for determining and selecting a first voice recognition device that a user wants to use based on a point in time when the voice of the user is spoken or a place where the user spoke the voice. The device for controlling a plurality of voice recognition devices according to the present disclosure may be associated with an artificial intelligence module, a robot, an augmented reality (AR) device, a virtual reality (VR) device, a device related to 5G service, etc.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification