-
公开(公告)号:US20240143691A1
公开(公告)日:2024-05-02
申请号:US18544245
申请日:2023-12-18
Applicant: Google LLC
Inventor: Mostafa Dehghani , Stephan Gouws , Oriol Vinyals , Jakob D. Uszkoreit , Lukasz Mieczyslaw Kaiser
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for implementing a sequence-to-sequence model that is recurrent in depth while employing self-attention to combine information from different parts of sequences.
-
公开(公告)号:US11954594B1
公开(公告)日:2024-04-09
申请号:US17315695
申请日:2021-05-10
Applicant: Google LLC
Inventor: Samy Bengio , Oriol Vinyals , Navdeep Jaitly , Noam M. Shazeer
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: This document generally describes a neural network training system, including one or more computers, that trains a recurrent neural network (RNN) to receive an input, e.g., an input sequence, and to generate a sequence of outputs from the input sequence. In some implementations, training can include, for each position after an initial position in a training target sequence, selecting a preceding output of the RNN to provide as input to the RNN at the position, including determining whether to select as the preceding output (i) a true output in a preceding position in the output order or (ii) a value derived from an output of the RNN for the preceding position in an output order generated in accordance with current values of the parameters of the recurrent neural network.
-
公开(公告)号:US20220180151A1
公开(公告)日:2022-06-09
申请号:US17679625
申请日:2022-02-24
Applicant: Google LLC
Inventor: Oriol Vinyals , Samuel Bengio
IPC: G06N3/04
Abstract: In one aspect, this specification describes a recurrent neural network system implemented by one or more computers that is configured to process input sets to generate neural network outputs for each input set. The input set can be a collection of multiple inputs for which the recurrent neural network should generate the same neural network output regardless of the order in which the inputs are arranged in the collection. The recurrent neural network system can include a read neural network, a process neural network, and a write neural network. In another aspect, this specification describes a system implemented as computer programs on one or more computers in one or more locations that is configured to train a recurrent neural network that receives a neural network input and sequentially emits outputs to generate an output sequence for the neural network input.
-
公开(公告)号:US11195521B2
公开(公告)日:2021-12-07
申请号:US16781273
申请日:2020-02-04
Applicant: Google LLC
Inventor: Navdeep Jaitly , Quoc V. Le , Oriol Vinyals , Samuel Bengio , Ilya Sutskever
IPC: G10L15/00 , G10L15/16 , G06N3/04 , G10L15/26 , G06F40/58 , G06F40/274 , G06F40/55 , G10L15/02 , G05B13/02
Abstract: A system can be configured to perform tasks such as converting recorded speech to a sequence of phonemes that represent the speech, converting an input sequence of graphemes into a target sequence of phonemes, translating an input sequence of words in one language into a corresponding sequence of words in another language, or predicting a target sequence of words that follow an input sequence of words in a language (e.g., a language model). In a speech recognizer, the RNN system may be used to convert speech to a target sequence of phonemes in real-time so that a transcription of the speech can be generated and presented to a user, even before the user has completed uttering the entire speech input.
-
公开(公告)号:US20200251099A1
公开(公告)日:2020-08-06
申请号:US16781273
申请日:2020-02-04
Applicant: Google LLC
Inventor: Navdeep Jaitly , Quoc V. Le , Oriol Vinyals , Samuel Bengio , Ilya Sutskever
Abstract: A system can be configured to perform tasks such as converting recorded speech to a sequence of phonemes that represent the speech, converting an input sequence of graphemes into a target sequence of phonemes, translating an input sequence of words in one language into a corresponding sequence of words in another language, or predicting a target sequence of words that follow an input sequence of words in a language (e.g., a language model). In a speech recognizer, the RNN system may be used to convert speech to a target sequence of phonemes in real-time so that a transcription of the speech can be generated and presented to a user, even before the user has completed uttering the entire speech input.
-
公开(公告)号:US10650328B2
公开(公告)日:2020-05-12
申请号:US16368526
申请日:2019-03-28
Applicant: Google LLC
Inventor: Oriol Vinyals , Jeffrey Adgate Dean , Geoffrey E. Hinton
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a distilled machine learning model. One of the methods includes training a cumbersome machine learning model, wherein the cumbersome machine learning model is configured to receive an input and generate a respective score for each of a plurality of classes; and training a distilled machine learning model on a plurality of training inputs, wherein the distilled machine learning model is also configured to receive inputs and generate scores for the plurality of classes, comprising: processing each training input using the cumbersome machine learning model to generate a cumbersome target soft output for the training input; and training the distilled machine learning model to, for each of the training inputs, generate a soft output that matches the cumbersome target soft output for the training input.
-
公开(公告)号:US10559300B2
公开(公告)日:2020-02-11
申请号:US16055414
申请日:2018-08-06
Applicant: Google LLC
Inventor: Navdeep Jaitly , Quoc V. Le , Oriol Vinyals , Samuel Bengio , Ilya Sutskever
Abstract: A system can be configured to perform tasks such as converting recorded speech to a sequence of phonemes that represent the speech, converting an input sequence of graphemes into a target sequence of phonemes, translating an input sequence of words in one language into a corresponding sequence of words in another language, or predicting a target sequence of words that follow an input sequence of words in a language (e.g., a language model). In a speech recognizer, the RNN system may be used to convert speech to a target sequence of phonemes in real-time so that a transcription of the speech can be generated and presented to a user, even before the user has completed uttering the entire speech input.
-
公开(公告)号:US10289962B2
公开(公告)日:2019-05-14
申请号:US14731349
申请日:2015-06-04
Applicant: Google LLC
Inventor: Oriol Vinyals , Jeffrey A. Dean , Geoffrey E. Hinton
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a distilled machine learning model. One of the methods includes training a cumbersome machine learning model, wherein the cumbersome machine learning model is configured to receive an input and generate a respective score for each of a plurality of classes; and training a distilled machine learning model on a plurality of training inputs, wherein the distilled machine learning model is also configured to receive inputs and generate scores for the plurality of classes, comprising: processing each training input using the cumbersome machine learning model to generate a cumbersome target soft output for the training input; and training the distilled machine learning model to, for each of the training inputs, generate a soft output that matches the cumbersome target soft output for the training input.
-
公开(公告)号:US10229111B1
公开(公告)日:2019-03-12
申请号:US15423852
申请日:2017-02-03
Applicant: Google LLC
Inventor: Ekaterina Filippova , Enrique Alfonseca , Carlos Alberto Colmenares Rojas , Lukasz Mieczyslaw Kaiser , Oriol Vinyals
Abstract: Methods, systems, apparatus, including computer programs encoded on computer storage medium, for generating a sentence summary. In one aspect, the method includes actions of tokenizing the sentence into a plurality of tokens, processing data representative of each token in a first order using an LSTM neural network to initialize an internal state of a second LSTM neural network, processing data representative of each token in a second order using the second LSTM neural network, comprising, for each token in the sentence: processing the data representative of the token using the second LSTM neural network in accordance with a current internal state of the second LSTM neural network to (i) generate an LSTM output for the token, and (ii) to update the current internal state of the second LSTM neural network, and generating the summarized version of the sentence using the outputs of the second LSTM neural network for the tokens.
-
公开(公告)号:US12100391B2
公开(公告)日:2024-09-24
申请号:US17450235
申请日:2021-10-07
Applicant: Google LLC
Inventor: William Chan , Navdeep Jaitly , Quoc V. Le , Oriol Vinyals , Noam M. Shazeer
IPC: G10L15/16 , G06F40/12 , G06F40/197 , G06N3/044 , G06N3/045 , G10L15/183 , G10L15/26 , G10L25/30
CPC classification number: G10L15/16 , G06F40/12 , G06F40/197 , G06N3/044 , G06N3/045 , G10L15/183 , G10L15/26 , G10L25/30
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for speech recognition. One method includes obtaining an input acoustic sequence, the input acoustic sequence representing an utterance, and the input acoustic sequence comprising a respective acoustic feature representation at each of a first number of time steps; processing the input acoustic sequence using a first neural network to convert the input acoustic sequence into an alternative representation for the input acoustic sequence; processing the alternative representation for the input acoustic sequence using an attention-based Recurrent Neural Network (RNN) to generate, for each position in an output sequence order, a set of substring scores that includes a respective substring score for each substring in a set of substrings; and generating a sequence of substrings that represent a transcription of the utterance.
-
-
-
-
-
-
-
-
-