ZERO-SHOT DOMAIN GENERALIZATION WITH PRIOR KNOWLEDGE

    公开(公告)号:US20240062043A1

    公开(公告)日:2024-02-22

    申请号:US18364746

    申请日:2023-08-03

    IPC分类号: G06N3/0455 G06N3/08

    CPC分类号: G06N3/0455 G06N3/08

    摘要: A computer-implemented method for employing a graph-based adaptive domain generation framework is provided. The method includes, in a training phase, performing domain prototypical network training on source domains, constructing an autoencoding domain relation graph by applying a graph autoencoder to produce domain node embeddings, and performing, via a domain-adaptive classifier, domain-adaptive classifier training to make an informed decision. The method further includes, in a testing phase, given testing samples from a new source domain, computing a prototype by using a pretrained domain prototypical network, inferring node embedding, and making a prediction by the domain-adaptive classifier based on the domain node embeddings.

    DYNAMIC CAUSAL DISCOVERY IN IMITATION LEARNING

    公开(公告)号:US20240054373A1

    公开(公告)日:2024-02-15

    申请号:US18471570

    申请日:2023-09-21

    IPC分类号: G06N7/01 G06N20/00

    CPC分类号: G06N7/01 G06N20/00

    摘要: A method for learning a self-explainable imitator by discovering causal relationships between states and actions is presented. The method includes obtaining, via an acquisition component, demonstrations of a target task from experts for training a model to generate a learned policy, training the model, via a learning component, the learning component computing actions to be taken with respect to states, generating, via a dynamic causal discovery component, dynamic causal graphs for each environment state, encoding, via a causal encoding component, discovered causal relationships by updating state variable embeddings, and outputting, via an output component, the learned policy including trajectories similar to the demonstrations from the experts.

    META IMITATION LEARNING WITH STRUCTURED SKILL DISCOVERY

    公开(公告)号:US20240046091A1

    公开(公告)日:2024-02-08

    申请号:US18484793

    申请日:2023-10-11

    IPC分类号: G06N3/08 G06N20/00

    CPC分类号: G06N3/08 G06N20/00

    摘要: A method for acquiring skills through imitation learning by employing a meta imitation learning framework with structured skill discovery (MILD) is presented. The method includes learning behaviors or tasks, by an agent, from demonstrations: by learning to decompose the demonstrations into segments, via a segmentation component, the segments corresponding to skills that are transferrable across different tasks, learning relationships between the skills that are transferrable across the different tasks, employing, via a graph generator, a graph neural network for learning implicit structures of the skills from the demonstrations to define structured skills, and generating policies from the structured skills to allow the agent to acquire the structured skills for application to one or more target tasks.

    INTERPRETING CONVOLUTIONAL SEQUENCE MODEL BY LEARNING LOCAL AND RESOLUTION-CONTROLLABLE PROTOTYPES

    公开(公告)号:US20240037397A1

    公开(公告)日:2024-02-01

    申请号:US18479385

    申请日:2023-10-02

    IPC分类号: G06N3/08 G06N3/04

    CPC分类号: G06N3/08 G06N3/04

    摘要: A method interprets a convolutional sequence model. The method converts an input data sequence having input segments into output features. The method clusters the input segments into clusters using respective resolution-controllable class prototypes allocated to each of classes. Each respective class prototype includes a respective output feature subset characterizing a respective associated class. The method calculates, using the clusters, similarity scores that indicate a similarity of an output feature to a respective class prototypes responsive to distances between the output feature and the respective class prototypes. The method concatenates the similarity scores to obtain a similarity vector. The method performs a prediction and prediction support operation that provides a value of prediction and an interpretation for the value responsive to the input segments and similarity vector. The interpretation for the value of prediction is provided using only non-negative weights and lacking a weight bias in the fully connected layer.

    INTERPRETING CONVOLUTIONAL SEQUENCE MODEL BY LEARNING LOCAL AND RESOLUTION-CONTROLLABLE PROTOTYPES

    公开(公告)号:US20240028897A1

    公开(公告)日:2024-01-25

    申请号:US18479326

    申请日:2023-10-02

    IPC分类号: G06N3/08 G06N3/04

    CPC分类号: G06N3/08 G06N3/04

    摘要: A method interprets a convolutional sequence model. The method converts an input data sequence having input segments into output features. The method clusters the input segments into clusters using respective resolution-controllable class prototypes allocated to each of classes. Each respective class prototype includes a respective output feature subset characterizing a respective associated class. The method calculates, using the clusters, similarity scores that indicate a similarity of an output feature to a respective class prototypes responsive to distances between the output feature and the respective class prototypes. The method concatenates the similarity scores to obtain a similarity vector. The method performs a prediction and prediction support operation that provides a value of prediction and an interpretation for the value responsive to the input segments and similarity vector. The interpretation for the value of prediction is provided using only non-negative weights and lacking a weight bias in the fully connected layer.

    MEDICAL EVENT PREDICTION USING A PERSONALIZED DUAL-CHANNEL COMBINER NETWORK

    公开(公告)号:US20240013920A1

    公开(公告)日:2024-01-11

    申请号:US18370074

    申请日:2023-09-19

    摘要: Systems and methods for predicting an occurrence of a medical event for a patient using a trained neural network. Historical patient data is preprocessed to generate normalized training samples, and the normalized training samples are sent to a personalized deep convolutional neural network for model pretraining and updating of model parameters. The pretrained model is stored in a remote server for utilization by a local machine for personalization during a preparation time period for a medical treatment. A normalized finetuning set is generated as output, and the model parameters are iteratively finetuned. A personal prediction score for future medical events is generated, and an operation of a medical treatment device is controlled responsive to the prediction score.

    Medical Event Prediction Using a Personalized Dual-Channel Combiner Network

    公开(公告)号:US20240006069A1

    公开(公告)日:2024-01-04

    申请号:US18370049

    申请日:2023-09-19

    摘要: Systems and methods for predicting an occurrence of a medical event for a patient using a trained neural network. Historical patient data is preprocessed to generate normalized training samples, and the normalized training samples are sent to a personalized deep convolutional neural network for model pretraining and updating of model parameters. The pretrained model is stored in a remote server for utilization by a local machine for personalization during a preparation time period for a medical treatment. A normalized finetuning set is generated as output, and the model parameters are iteratively finetuned. A personal prediction score for future medical events is generated, and an operation of a medical treatment device is controlled responsive to the prediction score.

    Multi-scale multi-granularity spatial-temporal traffic volume prediction

    公开(公告)号:US11842271B2

    公开(公告)日:2023-12-12

    申请号:US17003112

    申请日:2020-08-26

    IPC分类号: G06N3/04 G06N3/08 G06N3/049

    CPC分类号: G06N3/08 G06N3/049

    摘要: Methods and systems for allocating network resources responsive to network traffic include modeling spatial correlations between fine spatial granularity traffic and coarse spatial granularity traffic for different sites and regions to determine spatial feature vectors for one or more sites in a network. Temporal correlations at a fine spatial granularity are modeled across multiple temporal scales, based on the spatial feature vectors. Temporal correlations at a coarse spatial granularity are modeled across multiple temporal scales, based on the spatial feature vectors. A traffic flow prediction is determined for the one or more sites in the network, based on the temporal correlations at the fine spatial granularity and the temporal correlations at the coarse spatial granularity. Network resources are provisioned at the one or more sites in accordance with the traffic flow prediction.