-
公开(公告)号:US12217759B2
公开(公告)日:2025-02-04
申请号:US18434602
申请日:2024-02-06
Applicant: GOOGLE LLC
Inventor: Barnaby James , Bo Wang , Sunil Vemuri , David Schairer , Ulas Kirazci , Ertan Dogrultan , Petar Aleksic
IPC: G10L15/26 , G06F40/205 , G06F40/284 , G06F40/30 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30
Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.
-
公开(公告)号:US20240274133A1
公开(公告)日:2024-08-15
申请号:US18434602
申请日:2024-02-06
Applicant: GOOGLE LLC
Inventor: Barnaby James , Bo Wang , Sunil Vemuri , David Schairer , Ulas Kirazci , Ertan Dogrultan , Petar Aleksic
IPC: G10L15/26 , G06F40/205 , G06F40/284 , G06F40/30 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30
CPC classification number: G10L15/26 , G06F40/205 , G06F40/284 , G06F40/30 , G10L15/1815 , G10L15/183 , G10L15/22 , G10L15/30 , G10L2015/223 , G10L2015/228
Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.
-
公开(公告)号:US12002463B2
公开(公告)日:2024-06-04
申请号:US17728614
申请日:2022-04-25
Applicant: GOOGLE LLC
Inventor: Bo Wang , Venkat Kotla , Chad Yoshikawa , Chris Ramsdale , Pravir Gupta , Alfonso Gomez-Jordana , Kevin Yeun , Jae Won Seo , Lantian Zheng , Sang Soo Sung
CPC classification number: G10L15/22 , G06F3/167 , G10L15/1822 , G10L15/26 , G10L15/30
Abstract: Systems and methods for enabling voice-based interactions with electronic devices can include a data processing system maintaining a plurality of device action data sets and a respective identifier for each device action data set. The data processing system can receive, from an electronic device, an audio signal representing a voice query and an identifier. The data processing system can identify, using the identifier, a device action data set. The data processing system can identify a device action from device action data set based on content of the audio signal. The data processing system can then identify, from the device action dataset, a command associated with the device action and send the command to the for execution device for execution.
-
公开(公告)号:US11929075B2
公开(公告)日:2024-03-12
申请号:US16936935
申请日:2020-07-23
Applicant: Google LLC
Inventor: Bo Wang , Sunil Vemuri , Barnaby John James , Pravir Kumar Gupta , Nitin Mangesh Shetti
CPC classification number: G10L15/30 , G06F3/167 , G10L15/1822 , G10L25/72 , G10L2015/223 , G10L2015/225
Abstract: Methods, systems, and apparatus for receiving, by a voice action system, data specifying trigger terms that trigger an application to perform a voice action and a context that specifies a status of the application when the voice action can be triggered. The voice action system receives data defining a discoverability example for the voice action that comprises one or more of the trigger terms that trigger the application to perform the voice action when a status of the application satisfies the specified context. The voice action system receives a request for discoverability examples for the application from a user device having the application installed, and provides the data defining the discoverability examples to the user device in response to the request. The user device is configured to provide a notification of the one or more of the trigger terms when a status of the application satisfies the specified context.
-
公开(公告)号:US20230260517A1
公开(公告)日:2023-08-17
申请号:US18125606
申请日:2023-03-23
Applicant: GOOGLE LLC
Inventor: Barnaby James , Bo Wang , Sunil Vemuri , David Schairer , Ulas Kirazci , Ertan Dogrultan , Petar Aleksic
IPC: G10L15/26 , G10L15/22 , G06F40/284 , G06F40/205 , G06F40/30 , G10L15/183 , G10L15/18 , G10L15/30
CPC classification number: G10L15/26 , G06F40/30 , G06F40/205 , G06F40/284 , G10L15/22 , G10L15/30 , G10L15/183 , G10L15/1815 , G10L2015/223 , G10L2015/228
Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.
-
公开(公告)号:US11232797B2
公开(公告)日:2022-01-25
申请号:US16791334
申请日:2020-02-14
Applicant: Google LLC
Inventor: Barnaby James , Bo Wang , Sunil Vemuri , David Schairer , Ulas Kirazci , Ertan Dogrultan , Petar Aleksic
Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.
-
公开(公告)号:US11087752B2
公开(公告)日:2021-08-10
申请号:US16109229
申请日:2018-08-22
Applicant: Google LLC
Inventor: Bo Wang , Subbaiah Venkata , Chad Yoshikawa , Chris Ramsdale , Pravir Gupta , Alfonso Gomez-Jordana , Kevin Yeun , Jae Won Seo , Lantian Zheng , Sang Soo Sung
IPC: G10L15/22 , G10L15/30 , G10L15/18 , G06F16/632 , G10L15/08
Abstract: Systems and methods for enabling voice-based interactions with electronic devices can include a data processing system maintaining a plurality of device action data sets and a respective identifier for each device action data set. The data processing system can receive, from an electronic device, an audio signal representing a voice query and an identifier. The data processing system can identify, using the identifier, a device action data set. The data processing system can identify a device action from device action data set based on content of the audio signal. The data processing system can then identify, from the device action dataset, a command associated with the device action and send the command to the for execution device for execution.
-
公开(公告)号:US10831791B1
公开(公告)日:2020-11-10
申请号:US15964805
申请日:2018-04-27
Applicant: Google LLC
Inventor: Bo Wang , Omer Bar-or , Pravir K. Gupta , Yang Gao , Nitin Mangesh Shetti
IPC: G06F17/00 , G06F16/29 , G06F16/9537
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for using location aliases. In some implementations, a query is received, and a user that submitted the query is identified. It can be determined that one or more terms of the query represent an alias for a user-specific geographical location that has not been designated for the identified user. In response, a prompt can be provided to the user to specify a geographical location corresponding to the one or more query terms, data indicating a geographical location is received, and data is stored that indicates that, for the identified user, the one or more terms are an alias for the geographical location. One or more search results responsive to the query are provided, where the alias corresponds to the geographical location input in response to the prompt.
-
公开(公告)号:US20190147878A1
公开(公告)日:2019-05-16
申请号:US16244780
申请日:2019-01-10
Applicant: Google LLC
Inventor: Ulas Kirazci , Bo Wang , Steve Chen , Sunil Vemuri , Barnaby James , Valerie Nygaard
Abstract: Some implementations are directed to selective invocation of a particular third-party (3P) agent by an automated assistant to achieve an intended action determined by the automated assistant during a dynamic dialog between the automated assistant and a user. In some of those implementations, the particular 3P agent is invoked with value(s) for parameter(s) that are determined during the dynamic dialog; and/or the particular 3P agent is selected, from a plurality of candidate 3P agents, for invocation based on the determined value(s) for the parameter(s) and/or based on other criteria. In some of those implementations, the automated assistant invokes the particular 3P agent by transmitting, to the particular 3P agent, a 3P invocation request that includes the determined value(s) for the parameter(s).
-
公开(公告)号:US20240202235A1
公开(公告)日:2024-06-20
申请号:US18587482
申请日:2024-02-26
Applicant: Google LLC
Inventor: Bo Wang , Smita Rai , Max Ohlendorf , Subbaiah Venkata , Chad Yoshikawa , Abhinav Taneja , Amit Agarwal , Chris Ramsdale , Chris Turkstra
IPC: G06F16/632 , G06F16/638 , G06F21/62
CPC classification number: G06F16/634 , G06F16/638 , G06F21/6218
Abstract: Coordinating processing of audio queries is provided. A system receives a query. The system provides the query to a first digital assistant component and a second digital assistant component for processing. The system receives a first response to the query from the first digital assistant component, and a second response to the query from the second digital assistant component. The first digital assistant component can be authorized to access a database the second digital assistant component is prohibited from accessing. The system determines, based on a ranking decision function, to select the second response to the query from the second digital assistant component. The system provides, responsive to the selection, the second response from the second digital assistant to a computing device.
-
-
-
-
-
-
-
-
-