Speech recognition biasing
    1.
    发明授权

    公开(公告)号:US11670300B2

    公开(公告)日:2023-06-06

    申请号:US17811425

    申请日:2022-07-08

    申请人: X Development LLC

    发明人: Daniel Alex Lam

    摘要: Systems and methods are described include a robot and/or an associated computing system that can use various cues about an environment of the robot to apply a bias to increase the accuracy of speech transcription. In some implementations, audio data corresponding to a spoken instruction to a robot is received. Candidate transcriptions of the audio data are obtained. A respective action of the robot corresponding to each of the candidate transcriptions of the audio data is determined. One or more scores indicating characteristics of a potential outcome of performing the respective action corresponding to the candidate transcription of the audio data are determined for each of the candidate transcriptions of the audio data. A particular candidate transcription is selected from among the candidate transcriptions based at least on the one or more scores. The action determined for the particular candidate transcription is performed.

    Multi-tiered command processing
    2.
    发明授权

    公开(公告)号:US10896675B1

    公开(公告)日:2021-01-19

    申请号:US16024350

    申请日:2018-06-29

    申请人: X Development LLC

    发明人: Daniel Alex Lam

    摘要: Methods, systems, and apparatus, including computer programs stored on computer storage devices, for multi-tiered command processing are disclosed. One of the methods includes operating an audio processing system to detect voice commands in a set of first commands controlling behavior of the robot and in a set of second commands controlling behavior of the robot, wherein the system detects commands in the set of first commands using a local recognition model that does not require communication over a network, and detects commands in the set of second commands with a server over a network; executing one or more applications to respond to commands in a set of application commands that include commands different from the first commands and second commands; and detecting and carrying out commands, according to a predetermined hierarchy that prioritizes detection of the first commands first, the second commands second, and the application commands third.

    Speech Recognition Biasing
    3.
    发明申请

    公开(公告)号:US20220343910A1

    公开(公告)日:2022-10-27

    申请号:US17811425

    申请日:2022-07-08

    申请人: X Development LLC

    发明人: Daniel Alex Lam

    摘要: Systems and methods are described include a robot and/or an associated computing system that can use various cues about an environment of the robot to apply a bias to increase the accuracy of speech transcription. In some implementations, audio data corresponding to a spoken instruction to a robot is received. Candidate transcriptions of the audio data are obtained. A respective action of the robot corresponding to each of the candidate transcriptions of the audio data is determined. One or more scores indicating characteristics of a potential outcome of performing the respective action corresponding to the candidate transcription of the audio data are determined for each of the candidate transcriptions of the audio data. A particular candidate transcription is selected from among the candidate transcriptions based at least on the one or more scores. The action determined for the particular candidate transcription is performed.

    Speech recognition biasing
    4.
    发明授权

    公开(公告)号:US11417333B1

    公开(公告)日:2022-08-16

    申请号:US16878822

    申请日:2020-05-20

    申请人: X Development LLC

    发明人: Daniel Alex Lam

    摘要: Systems and methods are described include a robot and/or an associated computing system that can use various cues about an environment of the robot to apply a bias to increase the accuracy of speech transcription. In some implementations, audio data corresponding to a spoken instruction to a robot is received. Candidate transcriptions of the audio data are obtained. A respective action of the robot corresponding to each of the candidate transcriptions of the audio data is determined. One or more scores indicating characteristics of a potential outcome of performing the respective action corresponding to the candidate transcription of the audio data are determined for each of the candidate transcriptions of the audio data. A particular candidate transcription is selected from among the candidate transcriptions based at least on the one or more scores. The action determined for the particular candidate transcription is performed.

    Speech recognition biasing
    5.
    发明授权

    公开(公告)号:US10672398B1

    公开(公告)日:2020-06-02

    申请号:US16568506

    申请日:2019-09-12

    申请人: X Development LLC

    发明人: Daniel Alex Lam

    摘要: Systems and methods are described include a robot and/or an associated computing system that can use various cues about an environment of the robot to apply a bias to increase the accuracy of speech transcription. In some implementations, audio data corresponding to a spoken instruction to a robot is received. Candidate transcriptions of the audio data are obtained. A respective action of the robot corresponding to each of the candidate transcriptions of the audio data is determined. One or more scores indicating characteristics of a potential outcome of performing the respective action corresponding to the candidate transcription of the audio data are determined for each of the candidate transcriptions of the audio data. A particular candidate transcription is selected from among the candidate transcriptions based at least on the one or more scores. The action determined for the particular candidate transcription is performed.

    Speech recognition biasing
    6.
    发明授权

    公开(公告)号:US10438587B1

    公开(公告)日:2019-10-08

    申请号:US15671613

    申请日:2017-08-08

    申请人: X Development LLC

    发明人: Daniel Alex Lam

    摘要: Systems and methods are described include a robot and/or an associated computing system that can use various cues about an environment of the robot to apply a bias to increase the accuracy of speech transcription. In some implementations, audio data corresponding to a spoken instruction to a robot is received. Candidate transcriptions of the audio data are obtained. A respective action of the robot corresponding to each of the candidate transcriptions of the audio data is determined. One or more scores indicating characteristics of a potential outcome of performing the respective action corresponding to the candidate transcription of the audio data are determined for each of the candidate transcriptions of the audio data. A particular candidate transcription is selected from among the candidate transcriptions based at least on the one or more scores. The action determined for the particular candidate transcription is performed.