-
公开(公告)号:US11721334B2
公开(公告)日:2023-08-08
申请号:US16810013
申请日:2020-03-05
Applicant: LG ELECTRONICS INC.
Inventor: Jong Hoon Chae , Minook Kim , Yongchul Park , Sungmin Han , Siyoung Yang , Sangki Kim , Juyeong Jang
CPC classification number: G10L15/22 , G06N3/04 , G10L15/02 , G10L15/16 , G10L15/187 , G10L25/18 , G10L25/84 , G10L25/90 , G10L2015/223
Abstract: A method and apparatus for controlling a device according to an embodiment of the present disclosure may be based on a speech feature of a user reflecting the Lombard effect so as to operate a device located far away from the user, among a plurality of electronic devices. As such, even when the user calls a device located far away from the user without any separate context information, speech recognition neural networks and weight calculation neural networks may be selected and used to operate the device located far away from the user, and reception of a speech signal of the user calling a device located far away from the user may be performed in an Internet of Things (IoT) environment using a 5G network.
-
公开(公告)号:US11538073B2
公开(公告)日:2022-12-27
申请号:US16842617
申请日:2020-04-07
Applicant: LG ELECTRONICS INC.
Inventor: Siyoung Yang , Yongchul Park , Sungmin Han , Sangki Kim , Juyeong Jang , Minook Kim
IPC: G06F40/35 , G06Q30/02 , H04L51/02 , G06F16/332
Abstract: An electronic device is disclosed. The electronic device includes a memory and a processor. The electronic device may execute an artificial intelligence (AI) algorithm and/or a machine learning algorithm, and perform communications with other electronic devices in a 5G communication network. Accordingly, user convenience can be significantly improved.
-
公开(公告)号:US11521621B2
公开(公告)日:2022-12-06
申请号:US17028527
申请日:2020-09-22
Applicant: LG ELECTRONICS INC.
Inventor: Siyoung Yang , Yongchul Park , Sungmin Han , Sangki Kim , Juyeong Jang , Minook Kim
Abstract: Disclosed is gathering a user's speech samples. According to an embodiment of the disclosure, a method of gathering learning samples may gather a speaker's speech data obtained while talking on a mobile terminal and text data generated from the speech data and gather training data for generating a speech synthesis model. According to the disclosure, the method of gathering learning samples may be related to artificial intelligence (AI) modules, unmanned aerial vehicles (UAVs), robots, augmented reality (AR) devices, virtual reality (VR) devices, and 5G service-related devices.
-
公开(公告)号:US11721345B2
公开(公告)日:2023-08-08
申请号:US16917784
申请日:2020-06-30
Applicant: LG ELECTRONICS INC.
Inventor: Siyoung Yang , Yongchul Park , Sungmin Han , Sangki Kim , Juyeong Jang , Minook Kim
CPC classification number: G10L15/32 , G10L15/22 , G10L15/30 , G16Y40/35 , G10L2015/228
Abstract: Disclosed is a device for controlling a plurality of voice recognition devices for determining and selecting a first voice recognition device that a user wants to use based on a point in time when the voice of the user is spoken or a place where the user spoke the voice. The device for controlling a plurality of voice recognition devices according to the present disclosure may be associated with an artificial intelligence module, a robot, an augmented reality (AR) device, a virtual reality (VR) device, a device related to 5G service, etc.
-
5.
公开(公告)号:US11107456B2
公开(公告)日:2021-08-31
申请号:US16561410
申请日:2019-09-05
Applicant: LG ELECTRONICS INC.
Inventor: Jonghoon Chae , Minook Kim , Sangki Kim , Yongchul Park , Siyoung Yang , Juyeong Jang , Sungmin Han
IPC: G10L15/02 , G10L13/08 , G10L13/033 , G10L13/047
Abstract: Discussed is an artificial intelligence (AI)-based voice sampling apparatus for providing a speech style, including a rhyme encoder configured to receive a user's voice, extract a voice sample, and analyze a vocal feature included in the voice sample, a text encoder configured to receive text for reflecting the vocal feature, a processor configured to classify the vocal feature of the voice sample input to the rhyme encoder according to a label, extract an embedding vector representing the vocal feature from the label, and generate a speech style from the embedding vector and apply the generated speech style to the text, and a rhyme decoder configured to output synthesized voice data in which the speech style is applied to the text by the processor.
-
6.
公开(公告)号:US09565286B2
公开(公告)日:2017-02-07
申请号:US14971748
申请日:2015-12-16
Applicant: LG ELECTRONICS INC.
Inventor: Minjoo Chung , Minjeong Shin , Chul Park , Jinho Cho , Kiwoong Kim , Sangki Kim
IPC: H04B7/00 , H04M1/725 , H04W76/02 , H04B1/3827
CPC classification number: H04M1/72527 , G06F3/011 , G06F3/017 , G06F3/0334 , H04B1/385 , H04W76/14
Abstract: A mobile terminal including a display unit; a wireless communication unit configured to wirelessly communicate with an insole in a shoe and an external device; and a controller configured to receive a foot gesture input signal from the insole for controlling the mobile terminal or the external device, control the function on the mobile according to the foot gesture input signal in response to the foot gesture input signal being mapped a function of the mobile terminal, and transmit a control signal to the external device to control the function on the external device in response the foot gesture input signal being mapped the function of the external device.
Abstract translation: 一种包括显示单元的移动终端; 无线通信单元,被配置为与鞋内的鞋垫和外部设备无缝地通信; 以及控制器,被配置为从所述鞋垫接收用于控制所述移动终端或所述外部设备的脚踏手势输入信号,响应于所述脚踏手势输入信号映射了所述手势输入信号的功能,根据所述脚踏手势输入信号来控制所述移动设备上的功能 移动终端,并且响应于脚踏手势输入信号映射到外部设备的功能,向外部设备发送控制信号以控制外部设备上的功能。
-
7.
公开(公告)号:US11721319B2
公开(公告)日:2023-08-08
申请号:US16803941
申请日:2020-02-27
Applicant: LG ELECTRONICS INC.
Inventor: Minook Kim , Yongchul Park , Sungmin Han , Siyoung Yang , Sangki Kim , Juyeong Jang
IPC: G10L13/10 , G06N5/04 , G10L13/047 , G06N20/00
CPC classification number: G10L13/10 , G06N5/04 , G06N20/00 , G10L13/047
Abstract: An artificial intelligence device includes a memory and a processor. The memory is configured to store audio data having a predetermined speech style. The processor is configured to generate a condition vector relating to a condition for determining the speech style of the audio data, reduce a dimension of the condition vector to a predetermined reduction dimension, acquire a sparse code vector based on a dictionary vector acquired through sparse dictionary coding with respect to the condition vector having the predetermined reduction dimension, and change a vector element value included in the sparse code vector.
-
8.
公开(公告)号:US11636845B2
公开(公告)日:2023-04-25
申请号:US16928815
申请日:2020-07-14
Applicant: LG ELECTRONICS INC.
Inventor: Siyoung Yang , Yongchul Park , Sungmin Han , Sangki Kim , Juyeong Jang , Minook Kim
IPC: G10L13/00 , G10L13/08 , G10L13/10 , G10L13/033 , G10L13/04
Abstract: A method includes generating first synthesized speech by using text and a first emotion vector configured for the text, extracting a second emotion vector included in the first synthesized speech, determining whether correction of the second emotion information vector is needed by comparing a loss value calculated by using the first emotion information vector and the second emotion information vector with a preconfigured threshold, re-performing speech synthesis by using a third emotion information vector generated by correcting the second emotion information vector, and outputting the generated synthesized speech, thereby configuring emotion information of speech in a more effective manner. A speech synthesis apparatus may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.
-
公开(公告)号:US11074904B2
公开(公告)日:2021-07-27
申请号:US16593161
申请日:2019-10-04
Applicant: LG Electronics Inc.
Inventor: Siyoung Yang , Minook Kim , Sangki Kim , Yongchul Park , Juyeong Jang , Sungmin Han
Abstract: A speech synthesis method and apparatus based on emotion information are disclosed. A speech synthesis method based on emotion information extracts speech synthesis target text from received data and determines whether the received data includes situation explanation information. First metadata corresponding to first emotion information is generated on the basis of the situation explanation information. When the extracted data does not include situation explanation information, second metadata corresponding to second emotion information generated on the basis of semantic analysis and context analysis is generated. One of the first metadata and the second metadata is added to the speech synthesis target text to synthesize speech corresponding to the extracted data. A speech synthesis apparatus of this disclosure may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.
-
公开(公告)号:US20210210067A1
公开(公告)日:2021-07-08
申请号:US17088480
申请日:2020-11-03
Applicant: LG ELECTRONICS INC.
Inventor: Hwansik YUN , Wonho Shin , Yongchul Park , Sungmin Han , Siyoung Yang , Sangki Kim , Juyeong Jang , Minook Kim
Abstract: A voice recognition device and a method for learning voice data using the same are disclosed. The voice recognition device combines feature information for various speakers with a text-to-speech function to generate voice data recognized by a voice recognition unit, and can improve voice recognition efficiency by allowing the voice recognition unit itself to learn various voice data. The voice recognition device can be associated with an artificial intelligence module, a robot, an augmented reality (AR) device, a virtual reality (VR) device, devices related to 5G services, and the like.
-
-
-
-
-
-
-
-
-