-
公开(公告)号:US11782686B2
公开(公告)日:2023-10-10
申请号:US17459968
申请日:2021-08-27
Applicant: salesforce.com, inc.
Inventor: Yue Wang , Weishi Wang , Shafiq Rayhan Joty , Chu Hong Hoi
CPC classification number: G06F8/427 , G06F18/214 , G06F40/20 , G06N3/047 , G06N3/084
Abstract: Embodiments described herein a code generation and understanding model that builds on a Transformer-based encoder-decoder framework. The code generation and understanding model is configured to derive generic representations for programming language (PL) and natural language (NL) in code domain via pre-training on unlabeled code corpus, and then to benefit many code-related downstream tasks with fine-tuning. Apart from the denoising sequence-to-sequence objectives widely adopted for pre-training on natural language, identifier tagging and prediction pre-training objective is adopted to enable the model to better leverage the crucial token type information from PL, which specifically are the identifiers assigned by developers.
-
公开(公告)号:US20230154146A1
公开(公告)日:2023-05-18
申请号:US17566061
申请日:2021-12-30
Applicant: salesforce.com, inc.
Inventor: Dongxu Li , Junnan Li , Chu Hong Hoi
IPC: G06V10/74 , G06V10/774 , G06F40/279 , G06V20/40 , G06V10/776
CPC classification number: G06V10/761 , G06V10/774 , G06F40/279 , G06V20/47 , G06V20/41 , G06V10/776 , G06V20/46
Abstract: Embodiments described a method of video-text pre-learning to effectively learn cross-modal representations from sparse video frames and text. Specifically, an align and prompt framework provides a video and language pre-training framework that encodes the frames and text independently using a transformer-based video encoder and a text encoder. A multi-modal encoder is then employed to capture cross-modal interaction between a plurality of video frames and a plurality of texts. The pre-training includes a prompting entity modeling that enables the model to capture fine-grained region-entity alignment.
-
公开(公告)号:US20220382856A1
公开(公告)日:2022-12-01
申请号:US17514487
申请日:2021-10-29
Applicant: salesforce.com, inc.
Inventor: Wenzhuo Yang , Chu Hong Hoi , Kun Zhang
Abstract: Embodiments described herein provide a causality-based anomaly detection mechanism that formulates multivariate time series as instances that do not follow the regular causal mechanism. Specifically, the causality-based anomaly detection mechanism leverages the causal structure discovered from data so that the joint distribution of multivariate time series is factorized into simpler modules where each module corresponds to a local causal mechanism, reflected by the corresponding conditional distribution. Those local mechanisms are modular or autonomous and can then be handled separately. In light of this modularity property, the anomaly detection problem then naturally decomposed into a series of low-dimensional anomaly detection problems. Each sub-problem is concerned with a local mechanism.
-
44.
公开(公告)号:US11487999B2
公开(公告)日:2022-11-01
申请号:US16860977
申请日:2020-04-28
Applicant: salesforce.com, inc.
Inventor: Hung Le , Chu Hong Hoi
Abstract: A system and method for generating a response in a video grounded dialogue are provided. A video-grounded dialogue neural network language model receives video input and text input. The text input includes a dialogue history between the model and a human user and a current utterance by the user. Encoded video input is generated using video encoding layers. Encoded text input is generated using text encoding layers. The encoded video input and the encoded text input are concatenated in to a single input sequence. A generative pre-trained transformer model generates the response to the current utterance from the singe input sequence.
-
公开(公告)号:US20220269946A1
公开(公告)日:2022-08-25
申请号:US17375728
申请日:2021-07-14
Applicant: salesforce.com, inc.
Inventor: Pan Zhou , Caiming Xiong , Chu Hong Hoi
Abstract: Embodiments described herein provide a contrastive learning mechanism with self-labeling refinement, which iteratively employs the network and data themselves to generate more accurate and informative soft labels for contrastive learning. Specifically, the contrastive learning framework includes a self-labeling refinery module to explicitly generate accurate labels, and a momentum mix-up module to increase similarity between a query and its positive, which in turn implicitly improves label accuracy.
-
公开(公告)号:US11416688B2
公开(公告)日:2022-08-16
申请号:US16870571
申请日:2020-05-08
Applicant: salesforce.com, inc.
Inventor: Chien-Sheng Wu , Chu Hong Hoi , Caiming Xiong
Abstract: Embodiments described in this disclosure illustrate the use of self-/semi supervised approaches for label-efficient DST in task-oriented dialogue systems. Conversational behavior is modeled by next response generation and turn utterance generation tasks. Prediction consistency is strengthened by augmenting data with stochastic word dropout and label guessing. Experimental results show that by exploiting self-supervision the joint goal accuracy can be boosted with limited labeled data.
-
公开(公告)号:US20220139384A1
公开(公告)日:2022-05-05
申请号:US17088206
申请日:2020-11-03
Applicant: salesforce.com, inc.
Inventor: Chien-Sheng Wu , Chu Hong Hoi , Richard Socher , Caiming Xiong
Abstract: Embodiments described herein provide methods and systems for training task-oriented dialogue (TOD) language models. In some embodiments, a TOD language model may receive a TOD dataset including a plurality of dialogues and a model input sequence may be generated from the dialogues using a first token prefixed to each user utterance and a second token prefixed to each system response of the dialogues. In some embodiments, the first token or the second token may be randomly replaced with a mask token to generate a masked training sequence and a masked language modeling (MLM) loss may be computed using the masked training sequence. In some embodiments, the TOD language model may be updated based on the MLM loss.
-
公开(公告)号:US20220114481A1
公开(公告)日:2022-04-14
申请号:US17162931
申请日:2021-01-29
Applicant: salesforce.com, inc.
Inventor: Wenzhuo Yang , Jia Li , Chu Hong Hoi , Caiming Xiong
Abstract: Embodiments described herein provide a two-stage model-agnostic approach for generating counterfactual explanation via counterfactual feature selection and counterfactual feature optimization. Given a query instance, counterfactual feature selection picks a subset of feature columns and values that can potentially change the prediction and then counterfactual feature optimization determines the best feature value for the selected feature as a counterfactual example.
-
公开(公告)号:US11263476B2
公开(公告)日:2022-03-01
申请号:US16870621
申请日:2020-05-08
Applicant: salesforce.com, inc.
Inventor: Junnan Li , Chu Hong Hoi
Abstract: The system and method are directed to a prototypical contrastive learning (PCL). The PCL explicitly encodes the hierarchical semantic structure of the dataset into the learned embedding space and prevents the network from exploiting low-level cues for solving the unsupervised learning task. The PCL includes prototypes as the latent variables to help find the maximum-likelihood estimation of the network parameters in an expectation-maximization framework. The PCL iteratively performs an E-step for finding prototypes with clustering and M-step for optimizing the network on a contrastive loss.
-
50.
公开(公告)号:US20210375280A1
公开(公告)日:2021-12-02
申请号:US17014458
申请日:2020-09-08
Applicant: salesforce.com, inc.
Inventor: Weishi Wang , Shafiq Rayhan Joty , Chu Hong Hoi
Abstract: Embodiments described herein provide a dynamic topic tracking mechanism that tracks how the conversation topics change from one utterance to another and use the tracking information to rank candidate responses. A pre-trained language model may be used for response selection in the multi-party conversations, which consists of two steps: (1) a topic-based pre-training to embed topic information into the language model with self-supervised learning, and (2) a multi-task learning on the pretrained model by jointly training response selection and dynamic topic prediction and disentanglement tasks.
-
-
-
-
-
-
-
-
-