-
公开(公告)号:US20230359895A1
公开(公告)日:2023-11-09
申请号:US18313291
申请日:2023-05-05
Applicant: Google LLC
Inventor: Xiangning Chen , Chen Liang , Da Huang , Esteban Alberto Real , Yao Liu , Kaiyuan Wang , Yifeng Lu , Quoc V. Le
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform a machine learning task using a momentum and sign based optimizer.
-
公开(公告)号:US20220383195A1
公开(公告)日:2022-12-01
申请号:US17795087
申请日:2021-02-08
Applicant: Google LLC
Inventor: Chen Liang , David Richard So , Esteban Alberto Real , Quoc V. Le
Abstract: A method for searching for an output machine learning (ML) algorithm to perform an ML task is described. The method includes: receiving a set of training examples and a set of validation examples, and generating a sequence of candidate ML algorithms to perform the task. For each candidate ML algorithm in the sequence, the method includes: setting up one or more training parameters for the candidate ML algorithm by executing a respective candidate setup function, training the candidate ML algorithm by processing the set of training examples using a respective candidate predict function and a respective candidate learn function, and evaluating a performance of the trained candidate ML algorithm by executing the respective candidate predict function on the set of validation examples to determine a performance metric. The method includes selecting a trained candidate ML algorithm with the best performance metric as the output ML algorithm for the task.
-
公开(公告)号:US10997503B2
公开(公告)日:2021-05-04
申请号:US16447866
申请日:2019-06-20
Applicant: Google LLC
Inventor: David Martin Dohan , David Richard So , Chen Liang , Quoc V. Le
Abstract: A method for receiving training data for training a neural network to perform a machine learning task and for searching for, using the training data, an optimized neural network architecture for performing the machine learning task is described. Searching for the optimized neural network architecture includes: maintaining population data; maintaining threshold data; and repeatedly performing the following operations: selecting one or more candidate architectures from the population data; generating a new architecture from the one or more selected candidate architectures; for the new architecture: training a neural network having the new architecture until termination criteria for the training are satisfied; and determining a final measure of fitness of the neural network having the new architecture after the training; and adding data defining the new architecture and the final measure of fitness for the neural network having the new architecture to the population data.
-
公开(公告)号:US20190130251A1
公开(公告)日:2019-05-02
申请号:US16176961
申请日:2018-10-31
Applicant: Google LLC
Inventor: Ni Lao , Chen Liang , Quoc V. Le , John Blitzer
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a system output from a system input using a neural network system comprising an encoder neural network configured to, for each of a plurality of encoder time steps, receive an input sequence comprising a respective question token, and process the question token at the encoder time step to generate an encoded representation of the question token, and a decoder neural network configured to, for each of a plurality of decoder time steps, receive a decoder input, and process the decoder input and a preceding decoder hidden state to generate an updated decoder hidden state.
-
公开(公告)号:US20220180207A1
公开(公告)日:2022-06-09
申请号:US17211200
申请日:2021-03-24
Applicant: Google LLC
Inventor: Chen Liang , Da Huang , Yifeng Lu
Abstract: Provided is an end-to-end pipeline (e.g., which may be implemented in TensorFlow) which leverages a specialized search space to generate custom models which provide improved time series prediction.
-
公开(公告)号:US20210256390A1
公开(公告)日:2021-08-19
申请号:US17306813
申请日:2021-05-03
Applicant: Google LLC
Inventor: David Martin Dohan , David Richard So , Chen Liang , Quoc V. Le
Abstract: A method for receiving training data for training a neural network to perform a machine learning task and for searching for, using the training data, an optimized neural network architecture for performing the machine learning task is described. Searching for the optimized neural network architecture includes: maintaining population data; maintaining threshold data; and repeatedly performing the following operations: selecting one or more candidate architectures from the population data; generating a new architecture from the one or more selected candidate architectures; for the new architecture: training a neural network having the new architecture until termination criteria for the training are satisfied; and determining a final measure of fitness of the neural network having the new architecture after the training; and adding data defining the new architecture and the final measure of fitness for the neural network having the new architecture to the population data.
-
公开(公告)号:US20210256313A1
公开(公告)日:2021-08-19
申请号:US17180682
申请日:2021-02-19
Applicant: Google LLC
Inventor: Rishabh Agarwal , Chen Liang , Dale Eric Schuurmans , Mohammad Norouzi
Abstract: Methods and systems for learning policies using sparse and underspecified rewards. One of the methods includes training the policy jointly with an auxiliary reward function having a plurality of auxiliary reward parameters, the auxiliary reward function being configured to map, in accordance with the auxiliary reward parameters, trajectory features of at least a trajectory to an auxiliary reward value that indicates how well the trajectory performed a task in response to a context input.
-
公开(公告)号:US11954442B2
公开(公告)日:2024-04-09
申请号:US16986534
申请日:2020-08-06
Applicant: Google LLC
Inventor: Chen Liang , Wei Yu , Quoc V. Le , Xinyun Chen , Dengyong Zhou
IPC: G06F40/30 , G06F16/33 , G06F40/20 , G06N3/045 , G06N3/08 , G06N20/00 , G06F40/216 , G06F40/284
CPC classification number: G06F40/30 , G06F16/3347 , G06F40/20 , G06N3/045 , G06N3/08 , G06N20/00 , G06F40/216 , G06F40/284
Abstract: The present disclosure is directed to systems and methods for performing reading comprehension with machine learning. More specifically, the present disclosure is directed to a Neural Symbolic Reader (example implementations of which may be referred to as NeRd), which includes a reader to encode the passage and question, and a programmer to generate a program for multi-step reasoning. By using operators like span selection, the program can be executed over a natural language text passage to generate an answer to a natural language text question. NeRd is domain-agnostic such that the same neural architecture works for different domains. Further, NeRd is compositional such that complex programs can be generated by compositionally applying the symbolic operators.
-
-
-
-
-
-
-