-
1.
公开(公告)号:US20240194191A1
公开(公告)日:2024-06-13
申请号:US18389033
申请日:2023-11-13
申请人: GOOGLE LLC
IPC分类号: G10L15/14 , G06F3/16 , G10L15/00 , G10L15/02 , G10L15/08 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30
CPC分类号: G10L15/14 , G06F3/167 , G10L15/005 , G10L15/02 , G10L15/1822 , G10L15/183 , G10L15/22 , G10L15/30 , G10L2015/088 , G10L2015/223 , G10L2015/228
摘要: Implementations relate to determining a language for speech recognition of a spoken utterance, received via an automated assistant interface, for interacting with an automated assistant. Implementations can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Selection of a speech recognition model for a particular language can based on one or more interaction characteristics exhibited during a dialog session between a user and an automated assistant. Such interaction characteristics can include anticipated user input types, anticipated user input durations, a duration for monitoring for a user response, and/or an actual duration of a provided user response.
-
公开(公告)号:US11817084B2
公开(公告)日:2023-11-14
申请号:US16880647
申请日:2020-05-21
申请人: GOOGLE LLC
IPC分类号: G10L15/14 , G10L15/02 , G10L15/18 , G06F3/16 , G10L15/00 , G10L15/183 , G10L15/22 , G10L15/30 , G10L15/08
CPC分类号: G10L15/14 , G06F3/167 , G10L15/005 , G10L15/02 , G10L15/183 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L2015/088 , G10L2015/223 , G10L2015/228
摘要: The present disclosure relates generally to determining a language for speech recognition of a spoken utterance, received via an automated assistant interface, for interacting with an automated assistant. The system can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Selection of a speech recognition model for a particular language can based on one or more interaction characteristics exhibited during a dialog session between a user and an automated assistant. Such interaction characteristics can include anticipated user input types, anticipated user input durations, a duration for monitoring for a user response, and/or an actual duration of a provided user response.
-
公开(公告)号:US11810557B2
公开(公告)日:2023-11-07
申请号:US17676130
申请日:2022-02-19
申请人: Google LLC
发明人: Diego Melendo Casado
IPC分类号: G10L15/183 , G10L15/22
CPC分类号: G10L15/183 , G10L15/22 , G10L2015/223
摘要: Techniques are described herein for enabling the use of “dynamic” or “context-specific” hot words to invoke an automated assistant. In various implementations, an automated assistant may be executed in a default listening state at least in part on a user's computing device(s). While in the default listening state, audio data captured by microphone(s) may be monitored for default hot words. Detection of the default hot word(s) transitions of the automated assistant into a speech recognition state. Sensor signal(s) generated by hardware sensor(s) integral with the computing device(s) may be detected and analyzed to determine an attribute of the user. Based on the analysis, the automated assistant may transition into an enhanced listening state in which the audio data may be monitored for enhanced hot word(s). Detection of enhanced hot word(s) triggers the automated assistant to perform a responsive action without requiring detection of default hot word(s).
-
公开(公告)号:US11798541B2
公开(公告)日:2023-10-24
申请号:US17099367
申请日:2020-11-16
申请人: Google LLC
IPC分类号: G10L15/00 , G10L15/197 , G10L15/22 , G10L15/30 , G10L15/08 , G10L15/14 , G10L15/18 , G10L13/00
CPC分类号: G10L15/197 , G10L13/00 , G10L15/005 , G10L15/08 , G10L15/14 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L2015/088 , G10L2015/223 , G10L2015/228
摘要: Determining a language for speech recognition of a spoken utterance received via an automated assistant interface for interacting with an automated assistant. Implementations can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Implementations determine a user profile that corresponds to audio data that captures a spoken utterance, and utilize language(s), and optionally corresponding probabilities, assigned to the user profile in determining a language for speech recognition of the spoken utterance. Some implementations select only a subset of languages, assigned to the user profile, to utilize in speech recognition of a given spoken utterance of the user. Some implementations perform speech recognition in each of multiple languages assigned to the user profile, and utilize criteria to select only one of the speech recognitions as appropriate for generating and providing content that is responsive to the spoken utterance.
-
公开(公告)号:US20230335116A1
公开(公告)日:2023-10-19
申请号:US18210963
申请日:2023-06-16
申请人: GOOGLE LLC
发明人: Meltem Oktem , Taral Pradeep Joglekar , Fnu Heryandi , Pu-sen Chao , Ignacio Lopez Moreno , Salil Rajadhyaksha , Alexander H. Gruenstein , Diego Melendo Casado
IPC分类号: G10L15/08 , G06F21/32 , G10L15/22 , G10L17/00 , G06V40/10 , G10L17/06 , G10L15/07 , G06F16/635
CPC分类号: G10L15/08 , G06F16/636 , G06F21/32 , G06V40/10 , G10L15/07 , G10L15/22 , G10L17/00 , G10L17/06 , G10L2015/088 , G10L15/26
摘要: In some implementations, processor(s) can receive an utterance from a speaker, and determine whether the speaker is a known user of a user device or not a known user of the user device. The user device can be shared by a plurality of known users. Further, the processor(s) can determine whether the utterance corresponds to a personal request or non-personal request. Moreover, and in response to determining that the speaker not a known user of the user device and in response to determining that the utterance corresponds to a non-personal request, the processor(s) can cause a response to the utterance to be provided for presentation to the speaker at the user device response to the utterance, or can cause an action to be performed by the user device responsive to the utterance.
-
公开(公告)号:US20220246140A1
公开(公告)日:2022-08-04
申请号:US17676130
申请日:2022-02-19
申请人: Google LLC
发明人: Diego Melendo Casado
IPC分类号: G10L15/183
摘要: Techniques are described herein for enabling the use of “dynamic” or “context-specific” hot words to invoke an automated assistant. In various implementations, an automated assistant may be executed in a default listening state at least in part on a user's computing device(s). While in the default listening state, audio data captured by microphone(s) may be monitored for default hot words. Detection of the default hot word(s) transitions of the automated assistant into a speech recognition state. Sensor signal(s) generated by hardware sensor(s) integral with the computing device(s) may be detected and analyzed to determine an attribute of the user. Based on the analysis, the automated assistant may transition into an enhanced listening state in which the audio data may be monitored for enhanced hot word(s). Detection of enhanced hot word(s) triggers the automated assistant to perform a responsive action without requiring detection of default hot word(s).
-
公开(公告)号:US11334182B2
公开(公告)日:2022-05-17
申请号:US16708539
申请日:2019-12-10
申请人: Google LLC
IPC分类号: G06F3/041 , G06F3/04842 , G06F3/04883 , G06F3/16 , G06F3/0488 , G06F3/023 , G06F40/58 , G06F40/274
摘要: In some implementations, data indicating a touch received on a proximity-sensitive display is received while the proximity-sensitive display is presenting one or more items. In one aspect, the techniques describe may involve a process for disambiguating touch selections of hypothesized items, such as text or graphical objects that have been generated based on input data, on a proximity-sensitive display. This process may allow a user to more easily select hypothesized items that the user may wish to correct, by determining whether a touch received through the proximity-sensitive display represents a selection of each hypothesized item based at least on a level of confidence that the hypothesized item accurately represents the input data.
-
公开(公告)号:US20210249015A1
公开(公告)日:2021-08-12
申请号:US17242273
申请日:2021-04-27
申请人: GOOGLE LLC
发明人: Kenneth Mixter , Diego Melendo Casado , Alexander Houston Gruenstein , Terry Tai , Christopher Thaddeus Hughes , Matthew Nirvan Sharifi
摘要: The various implementations described herein include methods and systems for determining device leadership among voice interface devices. In one aspect, a method is performed at a first electronic device of a plurality of electronic devices, each having microphones, a speaker, processors, and memory storing programs for execution by the processors. The first device detects a voice input. It determines a device state and a relevance of the voice input. It identifies a subset of electronic devices from the plurality to which the voice input is relevant. In accordance with a determination that the subset includes the first device, the first device determines a first score of a criterion associated with the voice input and receives second scores of the criterion from other devices in the subset. In accordance with a determination that the first score is higher than the second scores, the first device responds to the detected input.
-
公开(公告)号:US10896672B2
公开(公告)日:2021-01-19
申请号:US15769023
申请日:2018-04-16
申请人: Google LLC
IPC分类号: G10L15/14 , G10L15/02 , G10L15/18 , G10L15/00 , G10L15/22 , G10L15/30 , G06F3/16 , G10L15/183 , G10L15/08
摘要: Implementations relate to determining a language for speech recognition of a spoken utterance, received via an automated assistant interface, for interacting with an automated assistant. Implementations can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Selection of a speech recognition model for a particular language can based on one or more interaction characteristics exhibited during a dialog session between a user and an automated assistant. Such interaction characteristics can include anticipated user input types, anticipated user input durations, a duration for monitoring for a user response, and/or an actual duration of a provided user response.
-
公开(公告)号:US20200342855A1
公开(公告)日:2020-10-29
申请号:US16618681
申请日:2018-08-21
申请人: Google LLC
发明人: Diego Melendo Casado
IPC分类号: G10L15/183
摘要: Techniques are described herein for enabling the use of “dynamic” or “context-specific” hot words to invoke an automated assistant. In various implementations, an automated assistant may be executed in a default listening state at least in part on a user's computing device(s). While in the default listening state, audio data captured by microphone(s) may be monitored for default hot words. Detection of the default hot word(s) transitions of the automated assistant into a speech recognition state. Sensor signal(s) generated by hardware sensor(s) integral with the computing device(s) may be detected and analyzed to determine an attribute of the user. Based on the analysis, the automated assistant may transition into an enhanced listening state in which the audio data may be monitored for enhanced hot word(s). Detection of enhanced hot word(s) triggers the automated assistant to perform a responsive action without requiring detection of default hot word(s).
-
-
-
-
-
-
-
-
-