-
公开(公告)号:US12223950B2
公开(公告)日:2025-02-11
申请号:US18144694
申请日:2023-05-08
Applicant: GOOGLE LLC
Inventor: Marcin Nowak-Przygodzki , Nathan David Howard , Gabor Simko , Andrei Giurgiu , Behshad Behzadi
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting a continued conversation are disclosed. In one aspect, a method includes the actions of receiving first audio data of a first utterance. The actions further include obtaining a first transcription of the first utterance. The actions further include receiving second audio data of a second utterance. The actions further include obtaining a second transcription of the second utterance. The actions further include determining whether the second utterance includes a query directed to a query processing system based on analysis of the second transcription and the first transcription or a response to the first query. The actions further include configuring the data routing component to provide the second transcription of the second utterance to the query processing system as a second query or bypass routing the second transcription.
-
公开(公告)号:US20240257817A1
公开(公告)日:2024-08-01
申请号:US18104748
申请日:2023-02-01
Applicant: GOOGLE LLC
Inventor: Marcin Nowak-Przygodzki , Andrei Giurgiu , Mugurel-Ionut Andreica , Joseph Lange
Abstract: Techniques are described herein for delegation of request fulfillment, by an assistant, to other devices. A method includes: receiving, by a first device, a request from a first user; identifying, based on the request from the first user, (i) an action corresponding to the request and (ii) a first parameter corresponding to the action; determining that fulfillment of the action is to be delegated to a device other than the first device; in response: selecting, as the device other than the first device, a second device on which an application corresponding to the action is installed; identifying, by the first device, based on the first parameter and information associated with an account of the first user, a first disambiguated parameter corresponding to the action; and sending, to the second device, a command that specifies the action and the first disambiguated parameter, to cause the second device to fulfill the action.
-
公开(公告)号:US12032874B2
公开(公告)日:2024-07-09
申请号:US18366172
申请日:2023-08-07
Applicant: GOOGLE LLC
Inventor: Joseph Lange , Marcin Nowak-Przygodzki
CPC classification number: G06F3/167 , G10L15/22 , G10L15/28 , G10L2015/223
Abstract: Implementations set forth herein relate to an automated assistant that can provide a selectable action intent suggestion when a user is accessing a third party application that is controllable via the automated assistant. The action intent can be initialized by the user without explicitly invoking the automated assistant using, for example, an invocation phrase (e.g., “Assistant . . . ”). Rather, the user can initialize performance of the corresponding action by identifying one or more action parameters. In some implementations, the selectable suggestion can indicate that a microphone is active for the user to provide a spoken utterance that identifies a parameter(s). When the action intent is initialized in response to the spoken utterance from the user, the automated assistant can control the third party application according to the action intent and any identified parameter(s).
-
公开(公告)号:US11967321B2
公开(公告)日:2024-04-23
申请号:US17538641
申请日:2021-11-30
Applicant: GOOGLE LLC
Inventor: Joseph Lange , Abhanshu Sharma , Adam Coimbra , Gökhan Bakir , Gabriel Taubman , Ilya Firman , Jindong Chen , James Stout , Marcin Nowak-Przygodzki , Reed Enger , Thomas Weedon Hume , Vishwath Mohan , Jacek Szmigiel , Yunfan Jin , Kyle Pedersen , Gilles Baechler
IPC: G10L15/22 , G06F3/16 , G06F40/247 , G06F40/30 , G10L15/18
CPC classification number: G10L15/22 , G06F3/167 , G06F40/247 , G06F40/30 , G10L15/1815 , G10L15/1822 , G10L2015/223 , G10L2015/228
Abstract: Implementations set forth herein relate to an automated assistant that can interact with applications that may not have been pre-configured for interfacing with the automated assistant. The automated assistant can identify content of an application interface of the application to determine synonymous terms that a user may speak when commanding the automated assistant to perform certain tasks. Speech processing operations employed by the automated assistant can be biased towards these synonymous terms when the user is accessing an application interface of the application. In some implementations, the synonymous terms can be identified in a responsive language of the automated assistant when the content of the application interface is being rendered in a different language. This can allow the automated assistant to operate as an interface between the user and certain applications that may not be rendering content in a native language of the user.
-
公开(公告)号:US20230385022A1
公开(公告)日:2023-11-30
申请号:US18366172
申请日:2023-08-07
Applicant: GOOGLE LLC
Inventor: Joseph Lange , Marcin Nowak-Przygodzki
CPC classification number: G06F3/167 , G10L15/22 , G10L15/28 , G10L2015/223
Abstract: Implementations set forth herein relate to an automated assistant that can provide a selectable action intent suggestion when a user is accessing a third party application that is controllable via the automated assistant. The action intent can be initialized by the user without explicitly invoking the automated assistant using, for example, an invocation phrase (e.g., “Assistant . . . ”). Rather, the user can initialize performance of the corresponding action by identifying one or more action parameters. In some implementations, the selectable suggestion can indicate that a microphone is active for the user to provide a spoken utterance that identifies a parameter(s). When the action intent is initialized in response to the spoken utterance from the user, the automated assistant can control the third party application according to the action intent and any identified parameter(s).
-
6.
公开(公告)号:US11769502B2
公开(公告)日:2023-09-26
申请号:US17339114
申请日:2021-06-04
Applicant: Google LLC
Inventor: Mugurel Ionut Andreica , Vladimir Vuskovic , Joseph Lange , Sharon Stovezky , Marcin Nowak-Przygodzki
CPC classification number: G10L15/22 , G06N3/08 , G10L15/02 , G10L2015/223
Abstract: Implementations are set forth herein for creating an order of execution for actions that were requested by a user, via a spoken utterance to an automated assistant. The order of execution for the requested actions can be based on how each requested action can, or is predicted to, affect other requested actions. In some implementations, an order of execution for a series of actions can be determined based on an output of a machine learning model, such as a model that has been trained according to supervised learning. A particular order of execution can be selected to mitigate waste of processing, memory, and network resources—at least relative to other possible orders of execution. Using interaction data that characterizes past performances of automated assistants, certain orders of execution can be adapted over time, thereby allowing the automated assistant to learn from past interactions with one or more users.
-
公开(公告)号:US11720325B2
公开(公告)日:2023-08-08
申请号:US17233223
申请日:2021-04-16
Applicant: Google LLC
Inventor: Joseph Lange , Marcin Nowak-Przygodzki
CPC classification number: G06F3/167 , G10L15/22 , G10L15/28 , G10L2015/223
Abstract: Implementations set forth herein relate to an automated assistant that can provide a selectable action intent suggestion when a user is accessing a third party application that is controllable via the automated assistant. The action intent can be initialized by the user without explicitly invoking the automated assistant using, for example, an invocation phrase (e.g., “Assistant . . . ”). Rather, the user can initialize performance of the corresponding action by identifying one or more action parameters. In some implementations, the selectable suggestion can indicate that a microphone is active for the user to provide a spoken utterance that identifies a parameter(s). When the action intent is initialized in response to the spoken utterance from the user, the automated assistant can control the third party application according to the action intent and any identified parameter(s).
-
公开(公告)号:US20230177272A1
公开(公告)日:2023-06-08
申请号:US18103255
申请日:2023-01-30
Applicant: Google LLC
Inventor: Sharon Stovezky , Yariv Adan , Radu Voroneanu , Behshad Behzadi , Ragnar Groot Koerkamp , Marcin Nowak-Przygodzki
IPC: G06F40/295 , G06F16/9537
CPC classification number: G06F40/295 , G06F16/9537
Abstract: Implementations set forth herein relate to an automated assistant that operates according to a variety of different location-based biasing modes for rendering responsive content for a user and/or proactively suggesting content for the user. The user can provide condensed spoken utterances to the automated assistant, when the automated assistant is operating according to one or more location-based biasing modes, but nonetheless receive accurate responsive outputs from the automated assistant. A responsive output generated by biasing toward a subset of location characteristic data that has been prioritized over other subsets of location characteristic data. The biasing allows the automated assistant to compensate for any details that may be missing from a spoken utterance, but allows the user to provide shorter spoken utterances, thereby reducing an amount of language processing when processing inputs from the user.
-
9.
公开(公告)号:US11567980B2
公开(公告)日:2023-01-31
申请号:US16617360
申请日:2018-05-07
Applicant: Google LLC
Inventor: Joseph Lange , Mugurel Ionut Andreica , Marcin Nowak-Przygodzki
IPC: G06F16/33 , G06F16/215 , G06F16/332 , G06F16/338 , G10L15/22 , G10L15/26
Abstract: Implementations are directed to determining, based on a submitted query that is a compound query, that a set of multiple sub-queries are collectively an appropriate interpretation of the compound query. Those implementations are further directed to providing, in response to such a determination, a corresponding command for each of the sub-queries of the determined set. Each of the commands is to a corresponding agent (of one or more agents), and causes the agent to generate and provide corresponding responsive content. Those implementations are further directed to causing content to be rendered in response to the submitted query, where the content is based on the corresponding responsive content received in response to the commands.
-
公开(公告)号:US11470022B2
公开(公告)日:2022-10-11
申请号:US16832637
申请日:2020-03-27
Applicant: Google LLC
Inventor: Marcin Nowak-Przygodzki , Jan Lamecki , Behshad Behzadi
IPC: H04L51/02 , H04L12/18 , G10L15/18 , G10L15/22 , G06Q10/10 , H04M3/493 , G10L15/26 , H04M3/527 , G10L15/30
Abstract: Techniques are described related to enabling automated assistants to enter into a “conference mode” in which they can “participate” in meetings between multiple human participants and perform various functions described herein. In various implementations, an automated assistant implemented at least in part on conference computing device(s) may be set to a conference mode in which the automated assistant performs speech-to-text processing on multiple distinct spoken utterances, provided by multiple meeting participants, without requiring explicit invocation prior to each utterance. The automated assistant may perform semantic processing on first text generated from the speech-to-text processing of one or more of the spoken utterances, and generate, based on the semantic processing, data that is pertinent to the first text. The data may be output to the participants at conference computing device(s). The automated assistant may later determine that the meeting has concluded, and may be set to a non-conference mode.
-
-
-
-
-
-
-
-
-