Speech recognition using multiple voice-enabled devices

    公开(公告)号:US11699444B1

    公开(公告)日:2023-07-11

    申请号:US17078954

    申请日:2020-10-23

    CPC classification number: G10L15/34 G06F40/20 G10L15/22 G10L15/30

    Abstract: Techniques for using multiple voice-enabled devices in a user environment to reduce the latency for obtaining responses to user utterances from a remote system. The voice-enabled devices may each establish connections with the remote system to have the remote system perform supplemental speech processing for utterances the devices are unable to process locally. One voice-enabled device may have a higher-latency connection to the remote system, and another voice-enabled device may have a lower-latency connection to the remote system. The lower-latency device may send an utterance to the remote system before the higher-latency device is able, and the remote system may begin processing the utterance faster than if the lower-latency device sent the utterance. The remote system may then provide a response for the utterance to the higher-latency device in less time than if the remote system had to wait for the utterance from the higher-latency device.

    Speech recognition using multiple voice-enabled devices

    公开(公告)号:US12125489B1

    公开(公告)日:2024-10-22

    申请号:US18219411

    申请日:2023-07-07

    CPC classification number: G10L15/34 G06F40/20 G10L15/22 G10L15/30

    Abstract: Techniques for using multiple voice-enabled devices in a user environment to reduce the latency for obtaining responses to user utterances from a remote system. The voice-enabled devices may each establish connections with the remote system to have the remote system perform supplemental speech processing for utterances the devices are unable to process locally. One voice-enabled device may have a higher-latency connection to the remote system, and another voice-enabled device may have a lower-latency connection to the remote system. The lower-latency device may send an utterance to the remote system before the higher-latency device is able, and the remote system may begin processing the utterance faster than if the lower-latency device sent the utterance. The remote system may then provide a response for the utterance to the higher-latency device in less time than if the remote system had to wait for the utterance from the higher-latency device.

    Predicting on-device command execution

    公开(公告)号:US12046234B1

    公开(公告)日:2024-07-23

    申请号:US17359932

    申请日:2021-06-28

    CPC classification number: G10L15/22 G10L15/183 G10L2015/223

    Abstract: Some natural language command processing systems may handle some commands on a user device rather than sending input to another system for processing. Such a system may include an arbitration component for arbitrating between device and/or system processing. The arbitration component may execute in the system and render a device-specific decision as to whether the device will be able to process the input and/or execute the command, based on information known to the system about the device's capabilities. If the arbitration component predicts that the device will not be able to execute the command, the system may execute the command without waiting for a signal from the device. If the arbitration component predicts that the device will be able to execute the command, the system may halt processing to prevent duplicate execution.

Patent Agency Ranking