-
公开(公告)号:US20210312903A1
公开(公告)日:2021-10-07
申请号:US17353764
申请日:2021-06-21
Applicant: Google LLC
Inventor: Siddhi Tadpatrikar , Michael Buchanan , Pravir Kumar Gupta
IPC: G10L15/05 , G06F16/683 , G10L15/04 , G10L15/065 , G10L15/22 , G10L15/26 , G10L25/78
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing are described. In one aspect, a method includes the action of accessing voice query log data that includes voice queries spoken by a particular user. The actions further include based on the voice query log data that includes voice queries spoken by a particular user, determining a pause threshold from the voice query log data that includes voice queries spoken by the particular user. The actions further include receiving, from the particular user, an utterance. The actions further include determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold. The actions further include based on determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold, processing the utterance as a voice query.
-
公开(公告)号:US11062696B2
公开(公告)日:2021-07-13
申请号:US16377767
申请日:2019-04-08
Applicant: Google LLC
Inventor: Siddhi Tadpatrikar , Michael Buchanan , Pravir Kumar Gupta
IPC: G10L15/04 , G10L15/05 , G06F16/683 , G10L15/065 , G10L15/22 , G10L15/26 , G10L25/78 , G10L15/07
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing are described. In one aspect, a method includes the action of accessing voice query log data that includes voice queries spoken by a particular user. The actions further include based on the voice query log data that includes voice queries spoken by a particular user, determining a pause threshold from the voice query log data that includes voice queries spoken by the particular user. The actions further include receiving, from the particular user, an utterance. The actions further include determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold. The actions further include based on determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold, processing the utterance as a voice query.
-
公开(公告)号:US20200027448A1
公开(公告)日:2020-01-23
申请号:US16586612
申请日:2019-09-27
Applicant: Google LLC
Inventor: Vikram Aggarwal , Pravir Kumar Gupta
IPC: G10L15/18 , G06F16/332 , G10L15/26 , G06F3/16
Abstract: Technology of the disclosure may facilitate user discovery of various voice-based action queries that can be spoken to initiate computer-based actions, such as voice-based action queries that can be provided as spoken input to a computing device to initiate computer-based actions that are particularized to content being viewed or otherwise consumed by the user on the computing device. Some implementations are generally directed to determining, in view of content recently viewed by a user on a computing device, at least one suggested voice-based action query for presentation via the computing device. Some implementations are additionally or alternatively generally directed to receiving at least one suggested voice-based action query at a computing device and providing the suggested voice-based action query as a suggestion in response to input to initiate providing of a voice-based query via the computing device.
-
公开(公告)号:US10269341B2
公开(公告)日:2019-04-23
申请号:US15196663
申请日:2016-06-29
Applicant: Google LLC
Inventor: Siddhi Tadpatrikar , Michael Buchanan , Pravir Kumar Gupta
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing are described. In one aspect, a method includes the action of accessing voice query log data that includes voice queries spoken by a particular user. The actions further include based on the voice query log data that includes voice queries spoken by a particular user, determining a pause threshold from the voice query log data that includes voice queries spoken by the particular user. The actions further include receiving, from the particular user, an utterance. The actions further include determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold. The actions further include based on determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold, processing the utterance as a voice query.
-
公开(公告)号:US20240144924A1
公开(公告)日:2024-05-02
申请号:US18406484
申请日:2024-01-08
Applicant: GOOGLE LLC
Inventor: Vikram Aggarwal , Pravir Kumar Gupta
IPC: G10L15/18 , G06F3/16 , G06F16/332 , G10L15/26
CPC classification number: G10L15/1822 , G06F3/167 , G06F16/3322 , G06F16/3323 , G10L15/26 , G10L2015/223
Abstract: Technology of the disclosure may facilitate user discovery of various voice-based action queries that can be spoken to initiate computer-based actions, such as voice-based action queries that can be provided as spoken input to a computing device to initiate computer-based actions that are particularized to content being viewed or otherwise consumed by the user on the computing device. Some implementations are generally directed to determining, in view of content recently viewed by a user on a computing device, at least one suggested voice-based action query for presentation via the computing device. Some implementations are additionally or alternatively generally directed to receiving at least one suggested voice-based action query at a computing device and providing the suggested voice-based action query as a suggestion in response to input to initiate providing of a voice-based query via the computing device.
-
公开(公告)号:US20230237988A1
公开(公告)日:2023-07-27
申请号:US18189270
申请日:2023-03-24
Applicant: Google LLC
Inventor: Michael BUCHANAN , Pravir Kumar Gupta , Christopher Bo Tandiono
CPC classification number: G10L15/05 , G10L17/06 , G10L15/04 , G10L15/22 , G10L15/26 , G10L25/87 , G10L25/51 , G10L25/78
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing based on word comparisons are described. In one aspect, a method includes the actions of obtaining a transcription of an utterance. The actions further include determining, as a first value, a quantity of text samples in a collection of text samples that (i) include terms that match the transcription, and (ii) do not include any additional terms. The actions further include determining, as a second value, a quantity of text samples in the collection of text samples that (i) include terms that match the transcription, and (ii) include one or more additional terms. The actions further include classifying the utterance as a likely incomplete utterance or not a likely incomplete utterance based at least on comparing the first value and the second value.
-
公开(公告)号:US11657816B2
公开(公告)日:2023-05-23
申请号:US17099130
申请日:2020-11-16
Applicant: Google LLC
Inventor: Bo Wang , Sunil Vemuri , Nitin Mangesh Shetti , Pravir Kumar Gupta , Scott B. Huffman , Javier Alejandro Rey , Jeffrey A. Boortz
CPC classification number: G10L15/22 , G06F3/167 , G10L15/1815 , G10L15/19 , G10L2015/0638 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, for defining and monitoring an event for a physical entity and the performance of an action in response to the occurrence of the event. A method includes receiving data indicating an event for a physical entity, the event specified in part by a physical environment feature for which the occurrence of the event is to be monitored by the data processing apparatus; receiving data indicating an action associated with the event and to be taken in response to the occurrence of the event; monitoring for the occurrence of the event for the physical entity; and in response to the occurrence of the event, causing the action associated with the event to be performed.
-
公开(公告)号:US20220310110A1
公开(公告)日:2022-09-29
申请号:US17841458
申请日:2022-06-15
Applicant: Google LLC
Inventor: Omer Bar-or , Scott B. Huffman , Ida Mayer , Arthur E. Blume , Pravir Kumar Gupta
Abstract: Systems, methods and apparatus for invoking actions at a second user device from a first user device. A method includes determining that a first user device has an associated second user device; accessing specification data that specifies a set of user device actions that the second user device is configured to perform; receiving command inputs for the first user device; for each command input, determining whether the command input resolves to one of the user device actions; for each command input not determined to resolve any of the user device actions, causing the command input to be processed at the first user device; and for each command input determined to resolve one of the user device actions causing the first user device to display in a user interface a dialog by which a user may either accept or deny invoking the user device action at the second user device.
-
公开(公告)号:US20210248995A1
公开(公告)日:2021-08-12
申请号:US17245019
申请日:2021-04-30
Applicant: Google LLC
Inventor: Michael Buchanan , Pravir Kumar Gupta , Christopher Bo Tandiono
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing based on word comparisons are described. In one aspect, a method includes the actions of obtaining a transcription of an utterance. The actions further include determining, as a first value, a quantity of text samples in a collection of text samples that (i) include terms that match the transcription, and (ii) do not include any additional terms. The actions further include determining, as a second value, a quantity of text samples in the collection of text samples that (i) include terms that match the transcription, and (ii) include one or more additional terms. The actions further include classifying the utterance as a likely incomplete utterance or not a likely incomplete utterance based at least on comparing the first value and the second value.
-
公开(公告)号:US10741183B2
公开(公告)日:2020-08-11
申请号:US16101940
申请日:2018-08-13
Applicant: Google LLC
Inventor: Bo Wang , Sunil Vemuri , Barnaby John James , Pravir Kumar Gupta , Nitin Mangesh Shetti
Abstract: Methods, systems, and apparatus for receiving, by a voice action system, data specifying trigger terms that trigger an application to perform a voice action and a context that specifies a status of the application when the voice action can be triggered. The voice action system receives data defining a discoverability example for the voice action that comprises one or more of the trigger terms that trigger the application to perform the voice action when a status of the application satisfies the specified context. The voice action system receives a request for discoverability examples for the application from a user device having the application installed, and provides the data defining the discoverability examples to the user device in response to the request. The user device is configured to provide a notification of the one or more of the trigger terms when a status of the application satisfies the specified context.
-
-
-
-
-
-
-
-
-