-
公开(公告)号:US20210366481A1
公开(公告)日:2021-11-25
申请号:US17393258
申请日:2021-08-03
Applicant: Google LLC
Inventor: Gaurav Bhaya , Tarun Jain , Anshul Kothari
IPC: G10L15/22 , G10L15/18 , H04L12/801
Abstract: A system of reducing transmissions of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify candidate interfaces and determine if prior instances of the packetized data was transmitted to the candidate interfaces. The interface management component can prevent the transmission of the packetized data if determined to be redundant.
-
公开(公告)号:US20210365628A1
公开(公告)日:2021-11-25
申请号:US17393250
申请日:2021-08-03
Applicant: Google LLC
Inventor: Boon-Lock Yeo , Xuemei GU , Gangjiang Li , Gaurav Bhaya , Robert Stets
IPC: G06F40/134 , G06K9/00 , G06F16/432 , G06F16/583 , G06F40/279
Abstract: Systems and methods for extracting audiovisual features from images and other digital components. A data processing system can extract image data and image features from an input image. The data processing system can match the image features to the image features of a plurality of image to identify candidate images. A second image can be selected from the candidate images based on a request that the data processing system received with the input image.
-
公开(公告)号:US20210326106A1
公开(公告)日:2021-10-21
申请号:US17363566
申请日:2021-06-30
Applicant: Google LLC
Inventor: Anshul Kothari , Gaurav Bhaya , Tarun Jain
IPC: G06F3/16 , G10L15/18 , G06F16/23 , G06F16/9535 , G06F9/451 , G06F16/332 , G10L15/22 , G10L15/26
Abstract: Managing rendering of a graphical user interface is provided. A system receives data packets comprising an input audio signal. The system determines an application identifier and query. The system provides the query to the application to cause the application to generate a second query for transmission to a third-party server, and identify responses to the query. The system intercepts the responses, and generates a keyword based on the responses. The system selects a digital component using the keyword, executes a deduplication process, and determines to add the digital component to the responses. The system constructs a display output using a graphical user interface template that integrates the plurality of responses generated by the application with the digital component, and provides the display output to the computing device for rendering.
-
公开(公告)号:US11145300B2
公开(公告)日:2021-10-12
申请号:US16064961
申请日:2018-05-07
Applicant: Google LLC
Inventor: Gaurav Bhaya , Ulas Kirazci , Bradley Abrams , Adam Coimbra , Ilya Firman , Carey Radebaugh
IPC: G10L15/22 , G10L15/18 , G10L15/30 , G06F40/216 , G06F40/30 , G06F16/901 , G06F40/35
Abstract: The present disclosure is generally directed to the generation of voice-activated data flows in interconnected network. The voice-activated data flows can include input audio signals that include a request and are detected at a client device. The client device can transmit the input audio signal to a data processing system, where the input audio signal can be parsed and passed to the data processing system of a service provider to fulfill the request in the input audio signal. The present solution is configured to conserve network resources by reducing the number of network transmissions needed to fulfill a request.
-
公开(公告)号:US11087760B2
公开(公告)日:2021-08-10
申请号:US16696622
申请日:2019-11-26
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
IPC: G10L15/00 , G10L15/22 , H04L12/825 , G10L15/26 , G06F3/16 , G10L15/14 , G10L15/18 , G10L15/30 , G10L15/08
Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
公开(公告)号:US20210210115A1
公开(公告)日:2021-07-08
申请号:US17205895
申请日:2021-03-18
Applicant: GOOGLE LLC
Inventor: Anshul Kothari , Gaurav Bhaya , Tarun Jain
Abstract: Coordinating signal processing among computing devices in a voice-driven computing environment is provided. A first and second digital assistant can detect an input audio signal, perform a signal quality check, and provide indications that the first and second digital assistants are operational to process the input audio signal. A system can select the first digital assistant for further processing. The system can receive, from the first digital assistant, data packets including a command. The system can generate, for a network connected device selected from a plurality of network connected devices, an action data structure based on the data packets, and transmit the action data structure to the selected network connected device.
-
公开(公告)号:US11037555B2
公开(公告)日:2021-06-15
申请号:US15764907
申请日:2017-12-08
Applicant: Google LLC
Inventor: Anshul Kothari , Gaurav Bhaya , Tarun Jain
Abstract: Coordinating signal processing among computing devices in a voice-driven computing environment is provided. A first and second digital assistant can detect an input audio signal, perform a signal quality check, and provide indications that the first and second digital assistants are operational to process the input audio signal. A system can select the first digital assistant for further processing. The system can receive, from the first digital assistant, data packets including a command. The system can generate, for a network connected device selected from a plurality of network connected devices, an action data structure based on the data packets, and transmit the action data structure to the selected network connected device.
-
公开(公告)号:US20210097997A1
公开(公告)日:2021-04-01
申请号:US17104645
申请日:2020-11-25
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
IPC: G10L15/22 , G10L15/26 , G06F3/16 , G06F16/332 , G06F16/33 , G06F40/40 , G06F40/284 , G10L15/00 , G10L15/14 , G10L15/18
Abstract: Optimization of sequence dependent operations in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. A prediction component can determine a thread based on the trigger keyword and the request that includes a first action, a second action subsequent to the first action, and a third action subsequent to the second action. A content selector component can select, based on the third action and the trigger keyword, a content item. An audio signal generator component can generate an output signal comprising the content item. An interface can transmit the output signal to cause a client computing device to drive a speaker to generate an acoustic wave corresponding to the output signal prior to occurrence of at least one of the first action and the second action.
-
公开(公告)号:US10748541B2
公开(公告)日:2020-08-18
申请号:US16666780
申请日:2019-10-29
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets , Umesh Patil
Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
公开(公告)号:US20200251103A1
公开(公告)日:2020-08-06
申请号:US16853251
申请日:2020-04-20
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert James Stets, JR.
IPC: G10L15/18 , G10L15/22 , G06F16/332 , G06F16/33 , H04M3/22 , H04M3/493 , H04L29/06 , H04L29/08 , G06F16/9032 , G06F40/205 , H04B17/309 , G10L25/69 , G10L25/87 , G10L25/90
Abstract: A feedback control system for data transmissions in voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request. The system can select a content item using the trigger keyword or request. The content item can be configured to establish a communication session between the device and a third party device. The system can monitor the communication session to measure a characteristic of the communication session. The system can generate a quality signal based on the measured characteristic.
-
-
-
-
-
-
-
-
-