-
1.
公开(公告)号:US20240296843A1
公开(公告)日:2024-09-05
申请号:US18657405
申请日:2024-05-07
申请人: GOOGLE LLC
IPC分类号: G10L15/22 , G10L15/065 , G10L15/10 , G10L15/30
CPC分类号: G10L15/22 , G10L15/065 , G10L15/10 , G10L15/30
摘要: Processor(s) of a client device can: receive sensor data that captures environmental attributes of an environment of the client device; process the sensor data using a machine learning model to generate a predicted output that dictates whether one or more currently dormant automated assistant functions are activated; making a decision as to whether to trigger the one or more currently dormant automated assistant functions; subsequent to making the decision, determining that the decision was incorrect; and in response to determining that the determination was incorrect, generating a gradient based on comparing the predicted output to ground truth output. In some implementations, the generated gradient is used, by processor(s) of the client device, to update weights of the on-device speech recognition model. In some implementations, the generated gradient is additionally or alternatively transmitted to a remote system for use in remote updating of global weights of a global speech recognition model.
-
公开(公告)号:US20240112672A1
公开(公告)日:2024-04-04
申请号:US17959637
申请日:2022-10-04
申请人: GOOGLE LLC
发明人: Rajiv Mathews , Dragan Zivkovic , Khe Chai Sim
CPC分类号: G10L15/19 , G10L15/063 , G10L15/22 , G10L15/30 , G10L2015/0635
摘要: On-device processor(s) of a client device may store, in on-device storage and in association with a time to live (TTL) in the on-device storage, a correction directed to ASR processing of audio data. The correction may include a portion of a given speech hypothesis that was modified to an alternate speech hypothesis. Further, the on-device processor(s) may cause an on-device ASR model to be personalized based on the correction. Moreover, and based on additional ASR processing of additional audio data, the on-device processor(s) may store, in the on-device storage and in association with an additional TTL in the on-device storage, a pseudo-correction directed to the additional ASR processing. Accordingly, the on-device processor(s) may cause the on-device ASR model to be personalized based on the pseudo-correction to prevent forgetting by the on-device ASR model.
-
公开(公告)号:US20210327421A1
公开(公告)日:2021-10-21
申请号:US16973572
申请日:2019-11-08
申请人: Google LLC
IPC分类号: G10L15/22 , G10L15/065 , G10L15/10 , G10L15/30
摘要: Processor(s) of a client device can: receive sensor data that captures environmental attributes of an environment of the client device; process the sensor data using a machine learning model to generate a predicted output that dictates whether one or more currently dormant automated assistant functions are activated; making a decision as to whether to trigger the one or more currently dormant automated assistant functions; subsequent to making the decision, determining that the decision was incorrect; and in response to determining that the determination was incorrect, generating a gradient based on comparing the predicted output to ground truth output. In some implementations, the generated gradient is used, by processor(s) of the client device, to update weights of the on-device speech recognition model. In some implementations, the generated gradient is additionally or alternatively transmitted to a remote system for use in remote updating of global weights of a global speech recognition model.
-
公开(公告)号:US11955134B2
公开(公告)日:2024-04-09
申请号:US17643848
申请日:2021-12-13
申请人: Google LLC
发明人: Ehsan Amid , Om Thakkar , Rajiv Mathews , Francoise Beaufays
IPC分类号: G10L21/0332 , G10L15/06 , G10L15/08 , G10L21/10
CPC分类号: G10L21/0332 , G10L15/063 , G10L15/08 , G10L21/10
摘要: A method of phrase extraction for ASR models includes obtaining audio data characterizing an utterance and a corresponding ground-truth transcription of the utterance and modifying the audio data to obfuscate a particular phrase recited in the utterance. The method also includes processing, using a trained ASR model, the modified audio data to generate a predicted transcription of the utterance, and determining whether the predicted transcription includes the particular phrase by comparing the predicted transcription of the utterance to the ground-truth transcription of the utterance. When the predicted transcription includes the particular phrase, the method includes generating an output indicating that the trained ASR model leaked the particular phrase from a training data set used to train the ASR model.
-
5.
公开(公告)号:US20240112673A1
公开(公告)日:2024-04-04
申请号:US17958887
申请日:2022-10-03
申请人: GOOGLE LLC
发明人: Rajiv Mathews , Rohit Prabhavalkar , Giovanni Motta , Mingqing Chen , Lillian Zhou , Dhruv Guliani , Harry Zhang , Trevor Strohman , Françoise Beaufays
IPC分类号: G10L15/197 , G10L15/06 , G10L15/22 , G10L15/30
CPC分类号: G10L15/197 , G10L15/063 , G10L15/22 , G10L15/30 , G10L2015/0635
摘要: Implementations described herein identify and correct automatic speech recognition (ASR) misrecognitions. For example, on-device processor(s) of a client device may generate a predicted textual segment that is predicted to correspond to spoken utterance of a user of the client device, and may receive further input that modifies the predicted textual segment to an alternate textual segment. Further, the on-device processor(s) may store these textual segments in on-device storage as a candidate correction pair, and transmit the candidate correction pair to a remote system. Moreover, remote processor(s) of the remote system may determine that the candidate correction pair is an actual correction pair, and may cause client devices to generate updates for a global ASR model for the candidate correction pair. Additionally, the remote processor(s) may distribute the global ASR model to the client devices and/or additional client devices.
-
6.
公开(公告)号:US20230352019A1
公开(公告)日:2023-11-02
申请号:US18218818
申请日:2023-07-06
申请人: GOOGLE LLC
IPC分类号: G10L15/22 , G10L15/065 , G10L15/10 , G10L15/30
CPC分类号: G10L15/22 , G10L15/065 , G10L15/10 , G10L15/30
摘要: Processor(s) of a client device can: receive sensor data that captures environmental attributes of an environment of the client device; process the sensor data using a machine learning model to generate a predicted output that dictates whether one or more currently dormant automated assistant functions are activated; making a decision as to whether to trigger the one or more currently dormant automated assistant functions; subsequent to making the decision, determining that the decision was incorrect; and in response to determining that the determination was incorrect, generating a gradient based on comparing the predicted output to ground truth output. In some implementations, the generated gradient is used, by processor(s) of the client device, to update weights of the on-device speech recognition model. In some implementations, the generated gradient is additionally or alternatively transmitted to a remote system for use in remote updating of global weights of a global speech recognition model.
-
公开(公告)号:US20230352004A1
公开(公告)日:2023-11-02
申请号:US18218319
申请日:2023-07-05
申请人: GOOGLE LLC
发明人: Françoise Beaufays , Andrew Hard , Swaroop Indra Ramaswamy , Om Dipakbhai Thakkar , Rajiv Mathews
IPC分类号: G10L15/065 , G10L13/04 , G10L15/26 , G10L15/30
CPC分类号: G10L15/065 , G10L13/04 , G10L15/26 , G10L15/30
摘要: Implementations disclosed herein are directed to federated learning of machine learning (“ML”) model(s) based on gradient(s) generated at corresponding client devices and a remote system. Processor(s) of the corresponding client devices can process client data generated locally at the corresponding client devices using corresponding on-device ML model(s) to generate corresponding predicted outputs, generate corresponding client gradients based on the corresponding predicted outputs, and transmit the corresponding client gradients to the remote system. Processor(s) of the remote system can process remote data obtained from remote database(s) using global ML model(s) to generate additional corresponding predicted outputs, generate corresponding remote gradients based on the additional corresponding predicted outputs. Further, the remote system can utilize the corresponding client gradients and the corresponding remote gradients to update the global ML model(s) or weights thereof. The updated global ML model(s) and/or the updated weights thereof can be transmitted back to the corresponding client devices.
-
公开(公告)号:US20220383204A1
公开(公告)日:2022-12-01
申请号:US17535405
申请日:2021-11-24
申请人: GOOGLE LLC
发明人: Om Dipakbhai Thakkar , Trung Dang , Swaroop Indra Ramaswamy , Rajiv Mathews , Françoise Beaufays
IPC分类号: G06N20/20
摘要: Implementations relate to ascertaining to what extent predictions, generated using a machine learning model, can be effectively reconstructed from model updates, where the model updates are generated based on those predictions and based on applying a particular loss technique (e.g., a particular cross-entropy loss technique). Some implementations disclosed generate measures that each indicate a degree of conformity between a corresponding reconstruction, generated using a corresponding model update, and a corresponding prediction. In some of those implementations, the measures are utilized in determining whether to utilize the particular loss technique (utilized in generating the model updates) in federated learning of the machine learning model and/or of additional machine learning model(s).
-
公开(公告)号:US20240330767A1
公开(公告)日:2024-10-03
申请号:US18611628
申请日:2024-03-20
申请人: Google LLC
发明人: Andrew Hard , Rajiv Mathews
IPC分类号: G06N20/00
CPC分类号: G06N20/00
摘要: A method includes training a client machine learning (ML) model on client training data at a client device. While training the client ML model, the method also includes obtaining, from a server, server model weights of a server ML model trained on server training data, the server training data different that the client training data. While training the client ML model, the method also includes: transmitting, to the server, client model weights of the client ML model; updating the client ML model using the server model weights; obtaining, from the server, updated server model weights of the server ML model, the updated server model weights updated based on the transmitted client model weights; and further updating the client ML model using the updated server model weights.
-
10.
公开(公告)号:US20240265269A1
公开(公告)日:2024-08-08
申请号:US18125613
申请日:2023-03-23
申请人: GOOGLE LLC
摘要: Implementations disclosed herein are directed to techniques for enabling decentralized learning of global language models (LMs). Remote processor(s) of a remote system can obtain a global LM that includes a global embedding matrix, generate a global embedding mask for the global embedding matrix using a masking technique, apply the global embedding mask to global embedding matrix to generate a sparsified global LM that includes a masked global embedding matrix that is a masked version of the global embedding matrix, transmit the sparsified global LM to computing device(s) that are participating in a given round of decentralized learning for the global language model, receive corresponding updates from the computing device(s), and cause the global LM to be updated based on the corresponding updates. By generating the global embedding mask and applying it to the global embedding matrix, the transferable size of the global LM is reduced thereby enabling decentralized learning thereof.
-
-
-
-
-
-
-
-
-