Electronic device and method of controlling electronic device

    公开(公告)号:US11531455B2

    公开(公告)日:2022-12-20

    申请号:US17278977

    申请日:2019-10-11

    Abstract: Provided are an electronic device capable of providing text information corresponding to a user voice through a user interface and a method of controlling the electronic device. Specifically, an electronic device according to the present disclosure, when an image including at least one object is obtained, analyzes the image to identify the at least one object included in the image, and when a user voice is received, performs voice recognition on the user voice to obtain text information corresponding to the user voice, then identifies an object corresponding to the user voice among the at least one object included in the image, and displays a memo user interface (UI) including text information on an area corresponding to the object identified as corresponding to the user voice among areas on a display.

    Electronic device and method for determining abnormal noise

    公开(公告)号:US11942105B2

    公开(公告)日:2024-03-26

    申请号:US17664025

    申请日:2022-05-18

    CPC classification number: G10L21/0232 G10L25/84 G10L2025/783

    Abstract: An electronic device includes an input device, a processor, and a memory The processor is configured to identify a first filter value of a first signal received from the input device. The processor is configured to receive a second signal after a first time elapses after the first signal is received. The processor is configured to receive a third signal after a second time elapses after the second signal is received. The processor is configured to compare a level of the second signal with a first threshold value for each of the at least one unit section of the second signal. The processor is configured to identify first information indicating that abnormal noise is present in a first section of the second signal. The processor is configured to perform filtering on the third signal based on the first filter value of the first signal according to the first information.

    ELECTRONIC APPARATUS FOR SPEECH RECOGNITION, AND CONTROLLING METHOD THEREOF

    公开(公告)号:US20230130396A1

    公开(公告)日:2023-04-27

    申请号:US17968517

    申请日:2022-10-18

    Abstract: An electronic apparatus includes a memory storing a speech recognition model and first recognition information corresponding to a first user voice obtained through the speech recognition model, the speech recognition model including a first network, a second network, and a third network; and a processor configured to: obtain a first vector by inputting voice data corresponding to a second user voice to the first network, obtain a second vector by inputting the first recognition information to the second network which generates a vector based on first weight information, and obtain second recognition information corresponding to the second user voice by inputting the first vector and the second vector to the third network which generates recognition information based on second weight information, wherein at least a part of the second weight information is the same as the first weight information.

    Electronic apparatus and control method thereof

    公开(公告)号:US11893980B2

    公开(公告)日:2024-02-06

    申请号:US17430614

    申请日:2021-06-22

    CPC classification number: G10L15/183 G06V10/255 G10L15/26 H04N21/4884

    Abstract: An electronic apparatus and a control method thereof are provided. The electronic apparatus includes a communication interface configured to receive content comprising image data and speech data; a memory configured to store a language contextual model trained with relevance between words; a display; and a processor configured to: extract an object and a character included in the image data, identify an object name of the object and the character, generate a bias keyword list comprising an image-related word that is associated with the image data, based on the identified object name and the identified character, convert the speech data to a text based on the bias keyword list and the language contextual model, and control the display to display the text that is converted from the speech data, as a caption.

    Device for recognizing speech input from user and operating method thereof

    公开(公告)号:US11074909B2

    公开(公告)日:2021-07-27

    申请号:US16913339

    申请日:2020-06-26

    Abstract: Provided are a device for recognizing a speech input including a named entity from a user and an operating method thereof. The device is configured to: generate a weighted finite state transducer model by using a vocabulary list including a plurality of named entities; obtain a first string from a speech input received from a user, by using a first decoding model; obtain a second string by using a second decoding model that uses the weighted finite state transducer model, the second string including a word sequence, which corresponds to at least one named entity, and an unrecognized word sequence not identified as a named entity; and output a text corresponding to the speech input by substituting the unrecognized word sequence of the second string with a word sequence included in the first string.

    ELECTRONIC APPARATUS AND CONTROLLING METHOD THEREOF

    公开(公告)号:US20230282208A1

    公开(公告)日:2023-09-07

    申请号:US18119007

    申请日:2023-03-08

    Abstract: Provided are an electronic apparatus and method for performing an operation based on recognizing a user's command utterance without a call word. The method includes identifying a dedicated language model related to a displayed content; receiving an utterance of a user; recognizing the received utterance and identifying candidate texts of the recognized utterance; identifying a similarity between the recognized utterance and the identified candidate texts; identifying, based on the identified dedicated language model and a predetermined threshold value, a suitability of a predetermined number of candidate texts with a high identified similarity, among the candidate texts; based on the identified suitability being outside a predetermined suitability range, ignoring the recognized utterance; and based on the identified suitability being in the predetermined suitability range, identifying a candidate text having a highest suitability, among the candidate texts, as the recognized utterance, and performing a corresponding operation.

Patent Agency Ranking