-
公开(公告)号:US10417557B2
公开(公告)日:2019-09-17
申请号:US15856453
申请日:2017-12-28
Applicant: Google LLC
Inventor: Samy Bengio , Oriol Vinyals , Alexander Toshkov Toshev , Dumitru Erhan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating descriptions of input images. One of the methods includes obtaining an input image; processing the input image using a first neural network to generate an alternative representation for the input image; and processing the alternative representation for the input image using a second neural network to generate a sequence of a plurality of words in a target natural language that describes the input image.
-
公开(公告)号:US10409908B2
公开(公告)日:2019-09-10
申请号:US14976121
申请日:2015-12-21
Applicant: Google LLC
Inventor: Oriol Vinyals , Lukasz Mieczyslaw Kaiser
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating parse trees for input text segments. One of the methods includes obtaining an input text segment, processing the input text segment using a first long short term memory (LSTM) neural network to convert the input text segment into an alternative representation for the input text segment, and processing the alternative representation for the input text segment using a second LSTM neural network to generate a linearized representation of a parse tree for the input text segment.
-
公开(公告)号:US10133739B2
公开(公告)日:2018-11-20
申请号:US14921925
申请日:2015-10-23
Applicant: GOOGLE LLC
Inventor: Quoc V. Le , Minh-Thang Luong , Ilya Sutskever , Oriol Vinyals , Wojciech Zaremba
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for neural translation systems with rare word processing. One of the methods is a method training a neural network translation system to track the source in source sentences of unknown words in target sentences, in a source language and a target language, respectively and includes deriving alignment data from a parallel corpus, the alignment data identifying, in each pair of source and target language sentences in the parallel corpus, aligned source and target words; annotating the sentences in the parallel corpus according to the alignment data and a rare word model to generate a training dataset of paired source and target language sentences; and training a neural network translation model on the training dataset.
-
公开(公告)号:US20180204112A1
公开(公告)日:2018-07-19
申请号:US15856453
申请日:2017-12-28
Applicant: Google LLC
Inventor: Samy Bengio , Oriol Vinyals , Alexander Toshkov Toshev , Dumitru Erhan
CPC classification number: G06N3/0472 , G06F17/28 , G06N3/0454
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating descriptions of input images. One of the methods includes obtaining an input image; processing the input image using a first neural network to generate an alternative representation for the input image; and processing the alternative representation for the input image using a second neural network to generate a sequence of a plurality of words in a target natural language that describes the input image.
-
公开(公告)号:US20240296313A1
公开(公告)日:2024-09-05
申请号:US18662584
申请日:2024-05-13
Applicant: Google LLC
Inventor: Samy Bengio , Oriol Vinyals , Alexander Toshkov Toshev , Dumitru Erhan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating descriptions of input images. One of the methods includes obtaining an input image; processing the input image using a first neural network to generate an alternative representation for the input image; and processing the alternative representation for the input image using a second neural network to generate a sequence of a plurality of words in a target natural language that describes the input image.
-
公开(公告)号:US20240144109A1
公开(公告)日:2024-05-02
申请号:US18399358
申请日:2023-12-28
Applicant: Google LLC
Inventor: Oriol Vinyals , Jeffrey Adgate Dean , Geoffrey E. Hinton
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a distilled machine learning model. One of the methods includes training a cumbersome machine learning model, wherein the cumbersome machine learning model is configured to receive an input and generate a respective score for each of a plurality of classes; and training a distilled machine learning model on a plurality of training inputs, wherein the distilled machine learning model is also configured to receive inputs and generate scores for the plurality of classes, comprising: processing each training input using the cumbersome machine learning model to generate a cumbersome target soft output for the training input; and training the distilled machine learning model to, for each of the training inputs, generate a soft output that matches the cumbersome target soft output for the training input.
-
公开(公告)号:US11900232B2
公开(公告)日:2024-02-13
申请号:US17863733
申请日:2022-07-13
Applicant: Google LLC
Inventor: Oriol Vinyals , Jeffrey Adgate Dean , Geoffrey E. Hinton
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a distilled machine learning model. One of the methods includes training a cumbersome machine learning model, wherein the cumbersome machine learning model is configured to receive an input and generate a respective score for each of a plurality of classes; and training a distilled machine learning model on a plurality of training inputs, wherein the distilled machine learning model is also configured to receive inputs and generate scores for the plurality of classes, comprising: processing each training input using the cumbersome machine learning model to generate a cumbersome target soft output for the training input; and training the distilled machine learning model to, for each of the training inputs, generate a soft output that matches the cumbersome target soft output for the training input.
-
公开(公告)号:US11151985B2
公开(公告)日:2021-10-19
申请号:US16713298
申请日:2019-12-13
Applicant: Google LLC
Inventor: William Chan , Navdeep Jaitly , Quoc V. Le , Oriol Vinyals , Noam M. Shazeer
IPC: G10L15/16 , G06N3/04 , G06F40/12 , G06F40/197 , G10L15/183 , G10L15/26 , G10L25/30
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for speech recognition. One method includes obtaining an input acoustic sequence, the input acoustic sequence representing an utterance, and the input acoustic sequence comprising a respective acoustic feature representation at each of a first number of time steps, processing the input acoustic sequence using a first neural network to convert the input acoustic sequence into an alternative representation for the input acoustic sequence, processing the alternative representation for the input acoustic sequence using an attention-based Recurrent Neural Network (RNN) to generate, for each position in an output sequence order, a set of substring scores that includes a respective substring score for each substring in a set of substrings; and generating a sequence of substrings that represent a transcription of the utterance.
-
公开(公告)号:US11074454B1
公开(公告)日:2021-07-27
申请号:US16410863
申请日:2019-05-13
Applicant: Google LLC
Inventor: Sudheendra Vijayanarasimhan , George Dan Toderici , Yue Hei Ng , Matthew John Hausknecht , Oriol Vinyals , Rajat Monga
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for classifying videos using neural networks. One of the methods includes obtaining a temporal sequence of video frames, wherein the temporal sequence comprises a respective video frame from a particular video at each of a plurality time steps; for each time step of the plurality of time steps: processing the video frame at the time step using a convolutional neural network to generate features of the video frame; and processing the features of the video frame using an LSTM neural network to generate a set of label scores for the time step and classifying the video as relating to one or more of the topics represented by labels in the set of labels from the label scores for each of the plurality of time steps.
-
公开(公告)号:US20210056162A1
公开(公告)日:2021-02-25
申请号:US16989455
申请日:2020-08-10
Applicant: Google LLC
Inventor: Mostafa Dehghani , Stephan Gouws , Oriol Vinyals , Jakob D. Uszkoreit , Lukasz Mieczyslaw Kaiser
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for implementing a sequence to sequence model that is recurrent in depth while employing self-attention to combine information from different parts of sequences.
-
-
-
-
-
-
-
-
-