Training neural networks using data augmentation policies

    公开(公告)号:US11847541B2

    公开(公告)日:2023-12-19

    申请号:US17556871

    申请日:2021-12-20

    Applicant: Google LLC

    CPC classification number: G06N20/00 G06F18/214 G06F18/217 G06N3/08 G06N3/04

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a machine learning model. One of the methods includes obtaining a training data set for training a machine learning model, the training data set comprising a plurality of training inputs; determining a plurality of data augmentation policies, wherein each data augmentation policy defines a procedure for processing a training input to generate a transformed training input; for each data augmentation policy, training the machine learning model using the data augmentation policy; determining, for each data augmentation policy, a quality measure of the machine learning model that has been trained using the data augmentation policy; and selecting a final data augmentation policy based using the quality measures of the machine learning models.

    NEURAL ARCHITECTURE SEARCH
    3.
    发明公开

    公开(公告)号:US20230368024A1

    公开(公告)日:2023-11-16

    申请号:US18226527

    申请日:2023-07-26

    Applicant: Google LLC

    CPC classification number: G06N3/08 G06F18/217 G06N3/044 G06N3/045

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining neural network architectures. One of the methods includes generating, using a controller neural network, a batch of output sequences, each output sequence in the batch defining a respective architecture of a child neural network that is configured to perform a particular neural network task; for each output sequence in the batch: training a respective instance of the child neural network having the architecture defined by the output sequence; evaluating a performance of the trained instance of the child neural network on the particular neural network task to determine a performance metric for the trained instance of the child neural network on the particular neural network task; and using the performance metrics for the trained instances of the child neural network to adjust the current values of the controller parameters of the controller neural network.

    NEURAL ARCHITECTURE SEARCH FOR CONVOLUTIONAL NEURAL NETWORKS

    公开(公告)号:US20230252327A1

    公开(公告)日:2023-08-10

    申请号:US18137398

    申请日:2023-04-20

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining neural network architectures. One of the methods includes generating, using a controller neural network having controller parameters and in accordance with current values of the controller parameters, a batch of output sequences. The method includes, for each output sequence in the batch: generating an instance of a child convolutional neural network (CNN) that includes multiple instances of a first convolutional cell having an architecture defined by the output sequence; training the instance of the child CNN to perform an image processing task; and evaluating a performance of the trained instance of the child CNN on the task to determine a performance metric for the trained instance of the child CNN; and using the performance metrics for the trained instances of the child CNN to adjust current values of the controller parameters of the controller neural network.

    NEURAL ARCHITECTURE SEARCH
    5.
    发明申请

    公开(公告)号:US20210295163A1

    公开(公告)日:2021-09-23

    申请号:US17340959

    申请日:2021-06-07

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining neural network architectures. One of the methods includes generating, using a controller neural network, a batch of output sequences, each output sequence in the batch defining a respective architecture of a child neural network that is configured to perform a particular neural network task; for each output sequence in the batch: training a respective instance of the child neural network having the architecture defined by the output sequence; evaluating a performance of the trained instance of the child neural network on the particular neural network task to determine a performance metric for the trained instance of the child neural network on the particular neural network task; and using the performance metrics for the trained instances of the child neural network to adjust the current values of the controller parameters of the controller neural network.

    LEARNING DATA AUGMENTATION POLICIES

    公开(公告)号:US20210019658A1

    公开(公告)日:2021-01-21

    申请号:US17061103

    申请日:2020-10-01

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for learning a data augmentation policy for training a machine learning model. In one aspect, a method includes: receiving training data for training a machine learning model to perform a particular machine learning task; determining multiple data augmentation policies, comprising, at each of multiple time steps: generating a current data augmentation policy based on quality measures of data augmentation policies generated at previous time steps; training a machine learning model on the training data using the current data augmentation policy; and determining a quality measure of the current data augmentation policy using the machine learning model after it has been trained using the current data augmentation policy; and selecting a final data augmentation policy based on the quality measures of the determined data augmentation policies.

    TRAINING NEURAL NETWORKS USING DATA AUGMENTATION POLICIES

    公开(公告)号:US20240273410A1

    公开(公告)日:2024-08-15

    申请号:US18544347

    申请日:2023-12-18

    Applicant: Google LLC

    CPC classification number: G06N20/00 G06F18/214 G06F18/217 G06N3/08 G06N3/04

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a machine learning model. One of the methods includes obtaining a training data set for training a machine learning model, the training data set comprising a plurality of training inputs; determining a plurality of data augmentation policies, wherein each data augmentation policy defines a procedure for processing a training input to generate a transformed training input; for each data augmentation policy, training the machine learning model using the data augmentation policy; determining, for each data augmentation policy, a quality measure of the machine learning model that has been trained using the data augmentation policy; and selecting a final data augmentation policy based using the quality measures of the machine learning models.

    TRAINING NEURAL NETWORKS USING DATA AUGMENTATION POLICIES

    公开(公告)号:US20220114400A1

    公开(公告)日:2022-04-14

    申请号:US17556871

    申请日:2021-12-20

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a machine learning model. One of the methods includes obtaining a training data set for training a machine learning model, the training data set comprising a plurality of training inputs; determining a plurality of data augmentation policies, wherein each data augmentation policy defines a procedure for processing a training input to generate a transformed training input; for each data augmentation policy, training the machine learning model using the data augmentation policy; determining, for each data augmentation policy, a quality measure of the machine learning model that has been trained using the data augmentation policy; and selecting a final data augmentation policy based using the quality measures of the machine learning models.

Patent Agency Ranking