-
公开(公告)号:US20240119958A1
公开(公告)日:2024-04-11
申请号:US18488623
申请日:2023-10-17
Applicant: Google LLC
Inventor: Anshul Kothari , Gaurav Bhaya , Tarun Jain
CPC classification number: G10L25/60 , G06N20/00 , G10L25/03 , H04L12/282 , G10L2015/226
Abstract: Coordinating signal processing among computing devices in a voice-driven computing environment is provided. A first and second digital assistant can detect an input audio signal, perform a signal quality check, and provide indications that the first and second digital assistants are operational to process the input audio signal. A system can select the first digital assistant for further processing. The system can receive, from the first digital assistant, data packets including a command. The system can generate, for a network connected device selected from a plurality of network connected devices, an action data structure based on the data packets, and transmit the action data structure to the selected network connected device.
-
公开(公告)号:US11880442B2
公开(公告)日:2024-01-23
申请号:US17543371
申请日:2021-12-06
Applicant: GOOGLE LLC
Inventor: Ken Krieger , Andrew Joseph Alexander Gildfind , Nicholas Salvatore Arini , Simon Michael Rowe , Raimundo Mirisola , Gaurav Bhaya , Robert Stets
IPC: G06F21/35 , G06F21/34 , H04N21/422 , H04N21/4223 , H04N21/442 , G06F21/32 , H04L9/40 , G10L17/24 , G06F21/31 , G10L17/00 , G06V40/16
CPC classification number: G06F21/32 , G06F21/316 , G06F21/34 , G06F21/35 , G06V40/172 , G10L17/00 , G10L17/24 , H04L63/0861 , H04L63/107 , H04N21/4223 , H04N21/42203 , H04N21/44218
Abstract: The present disclosure is generally directed a data processing system for authenticating packetized audio signals in a voice activated computer network environment. The data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, disabling malicious transmissions prior to their transmission across the network. The present solution can also improve computational efficiency by disabling remote computer processes possibly affected by or caused by the malicious audio signal transmissions. By disabling the transmission of malicious audio signals, the system can reduce bandwidth utilization by not transmitting the data packets carrying the malicious audio signal across the networks.
-
公开(公告)号:US20230246936A1
公开(公告)日:2023-08-03
申请号:US18298034
申请日:2023-04-10
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
IPC: H04L43/103 , G06F16/683 , G06F16/33 , G10L15/26 , G06F3/16 , G06F40/186 , G10L15/18 , G10L15/22 , H04L41/0813 , H04L67/12
CPC classification number: H04L43/103 , G06F16/683 , G06F16/3344 , G10L15/26 , G06F3/167 , G06F40/186 , G10L15/1822 , G10L15/22 , H04L41/0813 , H04L67/12 , G10L2015/088
Abstract: A selective sensor polling system for a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request. The system can select a template for an action data structure with a plurality of fields. The system can determine to poll a first sensor for data for the first field. The system can determine to obtain data in memory previously collected by the second sensor. The system can generate and transmit the action data structure with the data from the sensor and memory, and transmit the action data structure to a third party device.
-
公开(公告)号:US11381609B2
公开(公告)日:2022-07-05
申请号:US16909375
申请日:2020-06-23
Applicant: Google LLC
Inventor: Justin Lewis , Richard Rapp , Gaurav Bhaya , Robert Stets
IPC: H04L65/1066 , G06F9/50 , G06F9/451 , G10L15/18 , H04L45/00 , H04L65/60 , H04L65/80 , G01S5/02 , G01S5/18 , G06F3/16 , G10L15/08
Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
公开(公告)号:US11334713B2
公开(公告)日:2022-05-17
申请号:US16561997
申请日:2019-09-05
Applicant: GOOGLE LLC
Inventor: Gaurav Bhaya , Robert James Stets, Jr.
Abstract: Identifier dependent operation processing of packet based data communication is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. A content selector component can select, based on the request or trigger keyword, a content item. A link generation component can determine whether the client computing device has an account or a record in a database associated with the service provider device. In the absence of the record or account, the link generation device generates and sends a virtual identifier to the service provider device with instructions to generate an account in the database using the virtual identifier. Once the account is created, the service provider device can communicate with the client computing device.
-
公开(公告)号:US11093686B2
公开(公告)日:2021-08-17
申请号:US16696558
申请日:2019-11-26
Applicant: GOOGLE LLC
Inventor: Graeme John Rimmer , Lewis Jay Hemens , Gaurav Bhaya , Robert Stets
IPC: G06F40/103 , G06F40/131 , G06F40/186 , G06F40/189 , H04L29/08 , H04L29/06 , G10L15/22
Abstract: Systems and methods for automatically determining a content item size may be based on a size of a viewport and a width of a parent element. A script may be configured to determine a size of a viewport, determine a width of a parent element of a resource, and determine a content item size based, at least in part, on the size of the view port and the width of the parent element. A dimension of the determined content item size may be used by a content item selection system to determine a set of content items. A content item selection system may select a content item from the determined set of content items and serve data to effect display of the selected content item in the parent element with the resource.
-
公开(公告)号:US11087424B1
公开(公告)日:2021-08-10
申请号:US15638324
申请日:2017-06-29
Applicant: GOOGLE LLC
Inventor: Subhadip Sarkar , Ravi Jain , Gaurav Bhaya , Robert Stets
Abstract: Systems and methods of selecting content based on image data are provided. A system can receive an image captured by a camera of the computing device. The system can analyze the image to identify a pattern that matches a predetermined pattern of an object stored in an image pattern database comprising a plurality of predetermined patterns of objects. The system can identify one or more image keywords from the image based on the predetermined pattern of the object that matches the image. The system can select, based on a comparison of the one or more image keywords with one or more keywords of each of a plurality of content items, a content item. The system can provide, to the computing device, the content item to cause the computing device to present the content item.
-
公开(公告)号:US20200374265A1
公开(公告)日:2020-11-26
申请号:US16989737
申请日:2020-08-10
Applicant: Google LLC
Inventor: Anshul Kothari , Gaurav Bhaya , Amit Agarwal , Surojit Chatterjee , Vibhor Nanavati
Abstract: The disclosure is directed to maintaining an asynchronous communication via an intermediary. An agent executed on a device intermediary to clients and providers can receive a request from a client for an asynchronous communication between the client device and a provider device. The agent identifies a provider dynamic identifier identifying the provider. The agent uses a communication channel selection policy to determine a communication channel and a provider static identifier corresponding to the provider device. The agent maps a client dynamic identifier assigned by the server to a client static identifier of the client device. The agent generates a transmission data package including the content, a source address identifying the client dynamic identifier and a destination address identifying the provider static identifier. The agent provides the generated transmission data package for transmission via the communication channel determined using the communication channel selection policy to the provider device.
-
9.
公开(公告)号:US20200213382A1
公开(公告)日:2020-07-02
申请号:US16818736
申请日:2020-03-13
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert James Stets, JR.
IPC: H04L29/08 , G06F40/279 , G10L15/26 , G06F16/9032 , G06F16/951
Abstract: Systems and methods to combine multiple voice activated audio input data packets that indicate sequence dependent operations are provided. A natural language processor component can receive first and second input audio signal from a client computing device, and can identify respective requests and corresponding trigger keywords. A direct action application programming interface (“API”) can generate respective action data structures, and can construct respective data transmissions including the respective action data structures. A thread optimization component can obtain data packets of the first data transmission, and can obtain data packets of the second data transmission. The thread optimization component can determine, based on a heuristic technique applied to the data packets of the respective data transmissions a sequence dependency parameter. The thread optimization component can merge, based on a comparison of the sequence dependency parameter with a threshold, the first and second data transmissions into a single thread.
-
公开(公告)号:US12183347B2
公开(公告)日:2024-12-31
申请号:US18154448
申请日:2023-01-13
Applicant: Google LLC
Inventor: Anshul Kothari , Gaurav Bhaya , Tarun Jain
IPC: G10L15/22 , G06F3/16 , G06F9/44 , G06F16/903 , G06F21/32 , G10L15/08 , G10L17/00 , H04L67/306 , H04W4/021 , G10L17/22
Abstract: Processing stacked data structures is provided. A system receives an input audio signal detected by a sensor of a local computing device, identifies an acoustic signature, and identifies an account corresponding to the signature. The system establishes a session and a profile stack data structure including a first profile layer having policies configured by a third-party device. The system pushes, to the profile stack data structure, a second profile layer retrieved from the account. The system parses the input audio signal to identify a request and a trigger keyword. The system generates, based on the trigger keyword and the second profile layer, a first action data structure compatible with the first profile layer. The system provides the first action data structure for execution. The system disassembles the profile stack data structure to remove the first profile layer or the second profile layer from the profile stack data structure.
-
-
-
-
-
-
-
-
-