-
公开(公告)号:US11670300B2
公开(公告)日:2023-06-06
申请号:US17811425
申请日:2022-07-08
申请人: X Development LLC
发明人: Daniel Alex Lam
CPC分类号: G10L15/22 , B25J13/003 , G06N7/01 , G10L15/26 , G10L15/183 , G10L2015/223 , G10L2015/228
摘要: Systems and methods are described include a robot and/or an associated computing system that can use various cues about an environment of the robot to apply a bias to increase the accuracy of speech transcription. In some implementations, audio data corresponding to a spoken instruction to a robot is received. Candidate transcriptions of the audio data are obtained. A respective action of the robot corresponding to each of the candidate transcriptions of the audio data is determined. One or more scores indicating characteristics of a potential outcome of performing the respective action corresponding to the candidate transcription of the audio data are determined for each of the candidate transcriptions of the audio data. A particular candidate transcription is selected from among the candidate transcriptions based at least on the one or more scores. The action determined for the particular candidate transcription is performed.
-
公开(公告)号:US10896675B1
公开(公告)日:2021-01-19
申请号:US16024350
申请日:2018-06-29
申请人: X Development LLC
发明人: Daniel Alex Lam
摘要: Methods, systems, and apparatus, including computer programs stored on computer storage devices, for multi-tiered command processing are disclosed. One of the methods includes operating an audio processing system to detect voice commands in a set of first commands controlling behavior of the robot and in a set of second commands controlling behavior of the robot, wherein the system detects commands in the set of first commands using a local recognition model that does not require communication over a network, and detects commands in the set of second commands with a server over a network; executing one or more applications to respond to commands in a set of application commands that include commands different from the first commands and second commands; and detecting and carrying out commands, according to a predetermined hierarchy that prioritizes detection of the first commands first, the second commands second, and the application commands third.
-
公开(公告)号:US20220343910A1
公开(公告)日:2022-10-27
申请号:US17811425
申请日:2022-07-08
申请人: X Development LLC
发明人: Daniel Alex Lam
摘要: Systems and methods are described include a robot and/or an associated computing system that can use various cues about an environment of the robot to apply a bias to increase the accuracy of speech transcription. In some implementations, audio data corresponding to a spoken instruction to a robot is received. Candidate transcriptions of the audio data are obtained. A respective action of the robot corresponding to each of the candidate transcriptions of the audio data is determined. One or more scores indicating characteristics of a potential outcome of performing the respective action corresponding to the candidate transcription of the audio data are determined for each of the candidate transcriptions of the audio data. A particular candidate transcription is selected from among the candidate transcriptions based at least on the one or more scores. The action determined for the particular candidate transcription is performed.
-
公开(公告)号:US11417333B1
公开(公告)日:2022-08-16
申请号:US16878822
申请日:2020-05-20
申请人: X Development LLC
发明人: Daniel Alex Lam
IPC分类号: G10L21/00 , G10L15/10 , G10L15/20 , G10L15/22 , B25J13/00 , G06N7/00 , G10L15/26 , G10L15/183
摘要: Systems and methods are described include a robot and/or an associated computing system that can use various cues about an environment of the robot to apply a bias to increase the accuracy of speech transcription. In some implementations, audio data corresponding to a spoken instruction to a robot is received. Candidate transcriptions of the audio data are obtained. A respective action of the robot corresponding to each of the candidate transcriptions of the audio data is determined. One or more scores indicating characteristics of a potential outcome of performing the respective action corresponding to the candidate transcription of the audio data are determined for each of the candidate transcriptions of the audio data. A particular candidate transcription is selected from among the candidate transcriptions based at least on the one or more scores. The action determined for the particular candidate transcription is performed.
-
公开(公告)号:US10672398B1
公开(公告)日:2020-06-02
申请号:US16568506
申请日:2019-09-12
申请人: X Development LLC
发明人: Daniel Alex Lam
IPC分类号: G10L15/22 , G06N7/00 , G10L15/26 , B25J13/00 , G10L15/183
摘要: Systems and methods are described include a robot and/or an associated computing system that can use various cues about an environment of the robot to apply a bias to increase the accuracy of speech transcription. In some implementations, audio data corresponding to a spoken instruction to a robot is received. Candidate transcriptions of the audio data are obtained. A respective action of the robot corresponding to each of the candidate transcriptions of the audio data is determined. One or more scores indicating characteristics of a potential outcome of performing the respective action corresponding to the candidate transcription of the audio data are determined for each of the candidate transcriptions of the audio data. A particular candidate transcription is selected from among the candidate transcriptions based at least on the one or more scores. The action determined for the particular candidate transcription is performed.
-
公开(公告)号:US10438587B1
公开(公告)日:2019-10-08
申请号:US15671613
申请日:2017-08-08
申请人: X Development LLC
发明人: Daniel Alex Lam
摘要: Systems and methods are described include a robot and/or an associated computing system that can use various cues about an environment of the robot to apply a bias to increase the accuracy of speech transcription. In some implementations, audio data corresponding to a spoken instruction to a robot is received. Candidate transcriptions of the audio data are obtained. A respective action of the robot corresponding to each of the candidate transcriptions of the audio data is determined. One or more scores indicating characteristics of a potential outcome of performing the respective action corresponding to the candidate transcription of the audio data are determined for each of the candidate transcriptions of the audio data. A particular candidate transcription is selected from among the candidate transcriptions based at least on the one or more scores. The action determined for the particular candidate transcription is performed.
-
-
-
-
-