-
公开(公告)号:US11531455B2
公开(公告)日:2022-12-20
申请号:US17278977
申请日:2019-10-11
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Minkyu Shin , Sangyoon Kim , Dokyun Lee , Changwoo Han , Jonguk Yoo , Jaewon Lee
IPC: G06F3/04842 , G06V20/10 , G06F3/04883 , G10L15/26
Abstract: Provided are an electronic device capable of providing text information corresponding to a user voice through a user interface and a method of controlling the electronic device. Specifically, an electronic device according to the present disclosure, when an image including at least one object is obtained, analyzes the image to identify the at least one object included in the image, and when a user voice is received, performs voice recognition on the user voice to obtain text information corresponding to the user voice, then identifies an object corresponding to the user voice among the at least one object included in the image, and displays a memo user interface (UI) including text information on an area corresponding to the object identified as corresponding to the user voice among areas on a display.
-
公开(公告)号:US11942105B2
公开(公告)日:2024-03-26
申请号:US17664025
申请日:2022-05-18
Applicant: Samsung Electronics Co., Ltd.
Inventor: Hoseon Shin , Chulmin Lee , Changwoo Han
IPC: G10L21/0232 , G10L25/78 , G10L25/84
CPC classification number: G10L21/0232 , G10L25/84 , G10L2025/783
Abstract: An electronic device includes an input device, a processor, and a memory The processor is configured to identify a first filter value of a first signal received from the input device. The processor is configured to receive a second signal after a first time elapses after the first signal is received. The processor is configured to receive a third signal after a second time elapses after the second signal is received. The processor is configured to compare a level of the second signal with a first threshold value for each of the at least one unit section of the second signal. The processor is configured to identify first information indicating that abnormal noise is present in a first section of the second signal. The processor is configured to perform filtering on the third signal based on the first filter value of the first signal according to the first information.
-
公开(公告)号:US11881211B2
公开(公告)日:2024-01-23
申请号:US17189710
申请日:2021-03-02
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Changwoo Han , Kwangyoun Kim , Chanwoo Kim , Kyungmin Lee , Youngho Han
CPC classification number: G10L15/18 , G10L15/063 , G10L15/26
Abstract: Disclosed are an electronic device and a method of controlling the electronic device. An electronic device according to an embodiment may perform a method comprising: performing natural language understanding for a first text included in learning data, obtaining first information associated with a speech corresponding to the first text being uttered based on a result of the natural language understanding, obtain second information associated with an acoustic feature corresponding to the speech corresponding to the first text being uttered based on the first information, obtaining a plurality of speech signals corresponding to the first text by converting a first speech signal corresponding to the first text based on the first information and the second information, and training a speech recognition model based on the plurality of obtained speech signals and the first text.
-
公开(公告)号:US20230130396A1
公开(公告)日:2023-04-27
申请号:US17968517
申请日:2022-10-18
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jinhwan PARK , Sungsoo Kim , Sichen Jin , Junmo Park , Dhairya Sandhyana , Changwoo Han
Abstract: An electronic apparatus includes a memory storing a speech recognition model and first recognition information corresponding to a first user voice obtained through the speech recognition model, the speech recognition model including a first network, a second network, and a third network; and a processor configured to: obtain a first vector by inputting voice data corresponding to a second user voice to the first network, obtain a second vector by inputting the first recognition information to the second network which generates a vector based on first weight information, and obtain second recognition information corresponding to the second user voice by inputting the first vector and the second vector to the third network which generates recognition information based on second weight information, wherein at least a part of the second weight information is the same as the first weight information.
-
公开(公告)号:US11984122B2
公开(公告)日:2024-05-14
申请号:US17425560
申请日:2021-06-18
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Youngho Han , Sangyoon Kim , Aahwan Kudumula , Kyungmin Lee , Donguk Jung , Changwoo Han
IPC: G10L15/08 , G06F21/31 , G06V30/19 , G06V30/262 , G10L15/18 , G10L15/22 , G10L15/26 , G10L15/187 , G10L15/19
CPC classification number: G10L15/22 , G06F21/31 , G06V30/19093 , G06V30/262 , G10L15/1815 , G10L15/26 , G10L15/187 , G10L15/19 , G10L2015/221 , G10L2015/223
Abstract: Disclosed is a method of controlling an electronic apparatus. The method of controlling an electronic apparatus includes: displaying a screen including an input area configured to receive a text, receiving a speech and obtaining a text corresponding to the speech, performing a service operation corresponding to the input area by inputting the obtained text to the input area, and based on a result of performing the service operation, obtaining a plurality of similar texts including a similar pronunciation with the obtained text, and repeatedly performing the service operation by sequentially inputting the plurality of obtained similar texts to the input area.
-
公开(公告)号:US11893980B2
公开(公告)日:2024-02-06
申请号:US17430614
申请日:2021-06-22
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Sichen Jin , Kwangyoun Kim , Sungsoo Kim , Junmo Park , Dhairya Sandhyana , Changwoo Han
IPC: G10L15/183 , H04N21/488 , G06V10/20 , G10L15/26
CPC classification number: G10L15/183 , G06V10/255 , G10L15/26 , H04N21/4884
Abstract: An electronic apparatus and a control method thereof are provided. The electronic apparatus includes a communication interface configured to receive content comprising image data and speech data; a memory configured to store a language contextual model trained with relevance between words; a display; and a processor configured to: extract an object and a character included in the image data, identify an object name of the object and the character, generate a bias keyword list comprising an image-related word that is associated with the image data, based on the identified object name and the identified character, convert the speech data to a text based on the bias keyword list and the language contextual model, and control the display to display the text that is converted from the speech data, as a caption.
-
公开(公告)号:US11804241B2
公开(公告)日:2023-10-31
申请号:US17578184
申请日:2022-01-18
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jubum Han , Changwoo Han
CPC classification number: G10L25/78 , G10L15/16 , G10L25/54 , H04M1/6066 , H04M1/724097 , H04R1/1016 , H04W4/024 , H04M2201/40
Abstract: An electronic apparatus and a controlling method thereof are provided. The controlling method includes, based on an audio signal being received through a microphone, determining whether a user is on a public transport; detecting whether the audio signal includes a voice signal output through an acoustic device of the public transport; determining whether the voice signal from the acoustic device includes a voice signal for guiding at least one stop from among a plurality of stops; and outputting information on the at least one stop.
-
公开(公告)号:US11074909B2
公开(公告)日:2021-07-27
申请号:US16913339
申请日:2020-06-26
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Kyungmin Lee , Youngho Han , Sangyoon Kim , Donguk Jung , Aahwan Kudumula , Changwoo Han
IPC: G10L15/197 , G10L15/04 , G10L15/26
Abstract: Provided are a device for recognizing a speech input including a named entity from a user and an operating method thereof. The device is configured to: generate a weighted finite state transducer model by using a vocabulary list including a plurality of named entities; obtain a first string from a speech input received from a user, by using a first decoding model; obtain a second string by using a second decoding model that uses the weighted finite state transducer model, the second string including a word sequence, which corresponds to at least one named entity, and an unrecognized word sequence not identified as a named entity; and output a text corresponding to the speech input by substituting the unrecognized word sequence of the second string with a word sequence included in the first string.
-
公开(公告)号:US12026666B2
公开(公告)日:2024-07-02
申请号:US18084024
申请日:2022-12-19
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Minkyu Shin , Sangyoon Kim , Dokyun Lee , Changwoo Han , Jonguk Yoo , Jaewon Lee
IPC: G06Q10/087 , G06F3/04842 , G06F3/04883 , G06V10/764 , G06V20/10 , G10L15/26
CPC classification number: G06Q10/087 , G06F3/04842 , G06F3/04883 , G06V10/764 , G06V20/10 , G10L15/26
Abstract: Provided are an electronic device capable of providing text information corresponding to a user voice through a user interface and a method of controlling the electronic device. Specifically, an electronic device according to the present disclosure, when an image including at least one object is obtained, analyzes the image to identify the at least one object included in the image, and when a user voice is received, performs voice recognition on the user voice to obtain text information corresponding to the user voice, then identifies an object corresponding to the user voice among the at least one object included in the image, and displays a memo user interface (UI) including text information on an area corresponding to the object identified as corresponding to the user voice among areas on a display.
-
公开(公告)号:US20230282208A1
公开(公告)日:2023-09-07
申请号:US18119007
申请日:2023-03-08
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Kyungmin LEE , Sangyoon Kim , Hyunsik Kim , Aahwan Kudumula , Youngho Han , Changwoo Han
IPC: G10L15/197 , G10L15/22 , G06F3/16 , G10L15/30
CPC classification number: G10L15/197 , G10L15/22 , G06F3/167 , G10L15/30 , G10L2015/223
Abstract: Provided are an electronic apparatus and method for performing an operation based on recognizing a user's command utterance without a call word. The method includes identifying a dedicated language model related to a displayed content; receiving an utterance of a user; recognizing the received utterance and identifying candidate texts of the recognized utterance; identifying a similarity between the recognized utterance and the identified candidate texts; identifying, based on the identified dedicated language model and a predetermined threshold value, a suitability of a predetermined number of candidate texts with a high identified similarity, among the candidate texts; based on the identified suitability being outside a predetermined suitability range, ignoring the recognized utterance; and based on the identified suitability being in the predetermined suitability range, identifying a candidate text having a highest suitability, among the candidate texts, as the recognized utterance, and performing a corresponding operation.
-
-
-
-
-
-
-
-
-