Voice recognition method and device

    公开(公告)号:US11790903B2

    公开(公告)日:2023-10-17

    申请号:US16869107

    申请日:2020-05-07

    CPC classification number: G10L15/22 G10L15/197 G10L2015/223

    Abstract: Disclosed is a voice recognition device and method. According to the disclosure, the voice recognition device, upon failing to grasp the intent of the user's utterance from the original utterance which is divided into a head utterance and a tail utterance, figures out the intent from the head utterance to thereby complete the original utterance and provides the result of voice recognition processing on the original utterance. According to an embodiment, the voice recognition device may be related to artificial intelligence (AI) modules, robots, augmented reality (AR) devices, virtual reality (VR) devices, and 5G service-related devices.

    Voice processing method based on artificial intelligence

    公开(公告)号:US11790893B2

    公开(公告)日:2023-10-17

    申请号:US17039169

    申请日:2020-09-30

    CPC classification number: G10L15/16 H04W72/1268 H04W72/23

    Abstract: A voice processing method is disclosed. The voice processing method applies first and second sentence vectors extracted from first and second utterances, that are included in one dialog group and are separated from each other, to a learning model and generates an output from which at least one word having an overlapping meaning is removed. The voice processing method can be associated with an artificial intelligence module, an unmanned aerial vehicle (UAV), a robot, an augmented reality (AR) device, a virtual reality (VR) device, devices related to 5G services, and the like.

    Artificial intelligence device and method for recognizing speech with multiple languages

    公开(公告)号:US11270700B2

    公开(公告)日:2022-03-08

    申请号:US16799727

    申请日:2020-02-24

    Abstract: An artificial intelligence device includes a microphone configured to acquire speech including a plurality of languages, and a processor configured to generate, from the speech, text data corresponding to the speech, generate a plurality of pieces of separated data acquired by separating the text data for each language, perform natural language understanding processing corresponding to a language of each of the plurality of pieces of separated data to generate a natural language understanding processing result for each of the plurality of pieces of separated data, acquire command information about a command to be instructed by the speech and slot information about an entity subjected to the command, based on the natural language understanding processing result, perform an operation corresponding to the speech based on the command information and the slot information, and generate a response based on a result of performing the operation.

Patent Agency Ranking