-
公开(公告)号:US20220067534A1
公开(公告)日:2022-03-03
申请号:US17006570
申请日:2020-08-28
Applicant: salesforce.com, inc.
Inventor: Junwen Bai , Weiran Wang , Yingbo Zhou , Caiming Xiong
Abstract: Embodiments described herein combine both masked reconstruction and predictive coding. Specifically, unlike contrastive learning, the mutual information between past states and future states are directly estimated. The context information can also be directly captured via shifted masked reconstruction—unlike standard masked reconstruction, the target reconstructed observations are shifted slightly towards the future to incorporate more predictability. The estimated mutual information and shifted masked reconstruction loss can then be combined as the loss function to update the neural model.
-
公开(公告)号:US12198060B2
公开(公告)日:2025-01-14
申请号:US17006570
申请日:2020-08-28
Applicant: Salesforce.com, Inc.
Inventor: Junwen Bai , Weiran Wang , Yingbo Zhou , Caiming Xiong
IPC: G06N3/088 , G06F18/21 , G06F18/214 , G06N3/049
Abstract: Embodiments described herein combine both masked reconstruction and predictive coding. Specifically, unlike contrastive learning, the mutual information between past states and future states are directly estimated. The context information can also be directly captured via shifted masked reconstruction—unlike standard masked reconstruction, the target reconstructed observations are shifted slightly towards the future to incorporate more predictability. The estimated mutual information and shifted masked reconstruction loss can then be combined as the loss function to update the neural model.
-
公开(公告)号:US11328731B2
公开(公告)日:2022-05-10
申请号:US16903964
申请日:2020-06-17
Applicant: salesforce.com, inc.
Inventor: Weiran Wang , Yingbo Zhou , Caiming Xiong
IPC: G10L15/26
Abstract: System and methods for identifying a text word from a spoken utterance are provided. An ensemble BPE system that includes a phone BPE system and a character BPE system receives a spoken utterance. Both BPE systems include a multi-level language model (LM) and an acoustic model. The phone BPE system identifies first words from the spoken utterance and determine a first score for each first word. The first words are converted into character sequences. The character BPE model converts the character sequences into second words and determines a second score for each second word. For each word from the first words that matches a word in the second words the first and second scores are combined. The text word is the word with a highest score.
-
公开(公告)号:US20210319796A1
公开(公告)日:2021-10-14
申请号:US16903964
申请日:2020-06-17
Applicant: salesforce.com, inc.
Inventor: Weiran Wang , Yingbo Zhou , Caiming Xiong
IPC: G10L15/26
Abstract: System and methods for identifying a text word from a spoken utterance are provided. An ensemble BPE system that includes a phone BPE system and a character BPE system receives a spoken utterance. Both BPE systems include a multi-level language model (LM) and an acoustic model. The phone BPE system identifies first words from the spoken utterance and determine a first score for each first word. The first words are converted into character sequences. The character BPE model converts the character sequences into second words and determines a second score for each second word. For each word from the first words that matches a word in the second words the first and second scores are combined. The text word is the word with a highest score.
-
-
-