CROSS-ASSISTANT COMMAND PROCESSING
    13.
    发明公开

    公开(公告)号:US20240257808A1

    公开(公告)日:2024-08-01

    申请号:US18435024

    申请日:2024-02-07

    Inventor: Robert John Mars

    Abstract: A speech-processing system may provide access to one or more virtual assistants via a voice-controlled device. A user may leverage a first virtual assistant to translate a natural language command from a first language into a second language, which the device can forward to a second virtual assistant for processing. The device may receive a command from a user and send input data representing the command to a first speech-processing system representing the first virtual assistant. The device may receive a response in the form of a first natural language output from the first speech-processing system along with an indication that the first natural language output should be directed to a second speech-processing system representing the second virtual assistant. For example, the command may be in the first language, and the first natural language output may be in the second language, which is understandable by the second speech-processing system.

    SPEECH INTERFACE DEVICE WITH CACHING COMPONENT

    公开(公告)号:US20240249725A1

    公开(公告)日:2024-07-25

    申请号:US18425465

    申请日:2024-01-29

    CPC classification number: G10L15/30 G10L15/18 H04L67/5683

    Abstract: A speech interface device is configured to receive response data from a remote speech processing system for responding to user speech. This response data may be enhanced with information such as a remote ASR result(s) and a remote NLU result(s). The response data from the remote speech processing system may include one or more cacheable status indicators associated with the NLU result(s) and/or remote directive data, which indicate whether the remote NLU result(s) and/or the remote directive data are individually cacheable. A caching component of the speech interface device allows for caching at least some of this cacheable remote speech processing information, and using the cached information locally on the speech interface device when responding to user speech in the future. This allows for responding to user speech, even when the speech interface device is unable to communicate with a remote speech processing system over a wide area network.

    SYSTEM AND METHOD FOR UPDATING LANGUAGE MODELS

    公开(公告)号:US20240242710A1

    公开(公告)日:2024-07-18

    申请号:US18302180

    申请日:2023-04-18

    CPC classification number: G10L15/065 G10L15/16 G10L15/18

    Abstract: A system for updating language models is provided. The system includes a data-storage module, a data-update module, and a model-building module. The data-storage module is used for storing multiple pieces of corpus data that corresponds to multiple categories. The data-update module is used for storing a piece of new corpus data into the data-storage module. The piece of new corpus data corresponds to one of the categories. The model-building module is used for building a plurality of classified language models, and for updating one of the classified language models based on the piece of new corpus data stored in the data-storage module. The classified language model updated corresponds to the category that corresponds to the piece of new corpus data.

    SYSTEMS AND METHODS FOR USING SILENT SPEECH IN A USER INTERACTION SYSTEM

    公开(公告)号:US20240221738A1

    公开(公告)日:2024-07-04

    申请号:US18338749

    申请日:2023-06-21

    Applicant: Wispr AI, Inc.

    CPC classification number: G10L15/22 G10L15/18

    Abstract: The techniques described herein relate to computerized methods and systems for integrating with a knowledge system. In some embodiments, a user interaction system may include a speech input device wearable on a user and configured to receive an electronic signal indicative of a user's speech muscle activation patterns when the user is speaking. In some embodiments, the electronic signal may include EMG data received from an EMG sensor on the speech input device. The system may include at least one processor configured to use a speech model and the electronic signal as input to the speech model to generate a text prompt. The at least one processor may use a knowledge system to take an action or generate a response based on the text prompt. In some embodiments, the system may provide context to the knowledge system.

    Electronic apparatus and controlling method thereof

    公开(公告)号:US12008988B2

    公开(公告)日:2024-06-11

    申请号:US17065027

    申请日:2020-10-07

    CPC classification number: G10L15/22 G10L15/18 G10L15/24

    Abstract: An electronic apparatus and a controlling method thereof are provided. The electronic apparatus includes a microphone, a camera, a memory configured to store at least one command, and at least one processor configured to, based on a first user voice being input from a user, provide a response to the first user voice, based on an audio signal including a voice being input while the response to the first user voice is provided, analyze an image captured by the camera and determine whether there is a second user voice uttered by the user in the audio signal, and based on determining that there is the second user voice uttered by the user in the audio signal, stop providing the response to the first user voice and obtain and provide a response to the second user voice.

Patent Agency Ranking