TECHNIQUES FOR MACHINE LEARNING MODEL SELECTION FOR DOMAIN GENERALIZATION

    公开(公告)号:US20230368078A1

    公开(公告)日:2023-11-16

    申请号:US17663595

    申请日:2022-05-16

    CPC classification number: G06N20/20 G06K9/6227 G06K9/6262

    Abstract: A computing device may perform training of a set of machine learning models on a first data set associated with a first domain. In some examples, the training may include, for each machine learning model of the set of machine learning models, inputting, as values for a set of parameters of the respective sets of parameters and for an iteration of a set of iterations, a moving average of the set of parameters calculated over a threshold number of previous iterations. The computing device may select a set of model states that are generated during the training of the plurality of machine learning models based on a validation performance of the set of model states performed during the training. The computing device may then generate an ensembled machine learning model by aggregating the set of machine learning models corresponding to the set of selected model states.

    SYSTEMS AND METHODS FOR EDITING A LARGE LANGUAGE MODEL

    公开(公告)号:US20250124233A1

    公开(公告)日:2025-04-17

    申请号:US18428530

    申请日:2024-01-31

    Abstract: Systems and methods for editing a large language model are provided. The large language model generates a sequence of tokens, a first probability of a pre-edit output based on the sequence of tokens, and a second probability of a target output based on the sequence of tokens. A loss function is provided based on the first probability and the second probability. A plurality of gradients of the large language model with respect to the loss function is computed. An edit location of the large language model is determined based on the plurality of gradients. The large language model is edited by editing weights at the edit location of the large language model, such that the updated large language model generates the target output for an input including the sequence of words.

Patent Agency Ranking