PERFORMING MACHINE LEARNING TASKS USING INSTRUCTION-TUNED NEURAL NETWORKS

    公开(公告)号:US20230205994A1

    公开(公告)日:2023-06-29

    申请号:US17561581

    申请日:2021-12-23

    Applicant: Google LLC

    CPC classification number: G06F40/284 G06F40/30 G06N3/10 G06N5/04

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing a machine learning task on an input to generate an output. In one aspect, one of the method includes receiving input data that describes an input of a machine learning task; receiving candidate output data that describes a set of candidate classification outputs of the machine learning task for the input; generating an input sequence that includes the input and the set of candidate classification outputs; processing the input sequence using a neural network to generate a network output that specifies a respective score for each candidate classification output in the set of candidate classification outputs; and generating an output of the machine learning task for the input, comprising selecting, as the output, a selected candidate classification output from the set of candidate classification outputs using the respective scores.

    Instruction Fine-Tuning Machine-Learned Models Using Intermediate Reasoning Steps

    公开(公告)号:US20240256965A1

    公开(公告)日:2024-08-01

    申请号:US18424624

    申请日:2024-01-26

    Applicant: Google LLC

    CPC classification number: G06N20/00

    Abstract: An example method for training a machine-learned sequence processing model includes obtaining a plurality of training examples for training the machine-learned sequence processing model. For each respective training example of the plurality of training examples, the example method includes: obtaining a respective query associated with the respective training example; inputting the respective query to the machine-learned sequence processing model; obtaining, from the machine-learned sequence processing model a response to the respective query and a trace of intermediate states from the respective query to the response; evaluating the response using a ground truth response associated with the respective training example; evaluating the trace using a ground truth trace associated with the respective training example; and updating one or more parameters of the machine-learned sequence processing model based on the evaluation of the response and based on the evaluation of the trace.

Patent Agency Ranking