VISION-LANGUAGE MODEL WITH AN ENSEMBLE OF EXPERTS

    公开(公告)号:US20240265690A1

    公开(公告)日:2024-08-08

    申请号:US18544840

    申请日:2023-12-19

    CPC classification number: G06V10/82 G06V10/811

    Abstract: A vision-language model learns skills and domain knowledge via distinct and separate task-specific neural networks, referred to as experts. Each expert is independently optimized for a specific task, facilitating the use of domain-specific data and architectures that are not feasible with a single large neural network trained for multiple tasks. The vision-language model implemented as an ensemble of pre-trained experts and is more efficiently trained compared with the single large neural network. During training, the vision-language model integrates specialized skills and domain knowledge, rather than trying to simultaneously learn multiple tasks, resulting in effective multi-modal learning.

    FAIRNESS-BASED NEURAL NETWORK MODEL TRAINING USING REAL AND GENERATED DATA

    公开(公告)号:US20240144000A1

    公开(公告)日:2024-05-02

    申请号:US18307227

    申请日:2023-04-26

    CPC classification number: G06N3/08

    Abstract: A neural network model is trained for fairness and accuracy using both real and synthesized training data, such as images. During training a first sampling ratio between the real and synthesized training data is optimized. The first sampling ratio may comprise a value for each group (or attribute), where each value is optimized. A second sampling ratio defines relative amounts of training data that are used for each one of the groups. Furthermore, a neural network model accuracy and a fairness metric are both used for updating the first and second sampling ratios during training iterations. The neural network model may be trained using different classes of training data. The second sampling ratio may vary for each class.

Patent Agency Ranking