ASYNCHRONOUS OPTIMIZATION FOR SEQUENCE TRAINING OF NEURAL NETWORKS
    1.
    发明申请
    ASYNCHRONOUS OPTIMIZATION FOR SEQUENCE TRAINING OF NEURAL NETWORKS 有权
    神经网络序列训练的异步优化

    公开(公告)号:US20150127337A1

    公开(公告)日:2015-05-07

    申请号:US14258139

    申请日:2014-04-22

    Applicant: Google Inc.

    CPC classification number: G10L15/063 G06N3/0454 G10L15/16 G10L15/183

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for obtaining, by a first sequence-training speech model, a first batch of training frames that represent speech features of first training utterances; obtaining, by the first sequence-training speech model, one or more first neural network parameters; determining, by the first sequence-training speech model, one or more optimized first neural network parameters based on (i) the first batch of training frames and (ii) the one or more first neural network parameters; obtaining, by a second sequence-training speech model, a second batch of training frames that represent speech features of second training utterances; obtaining one or more second neural network parameters; and determining, by the second sequence-training speech model, one or more optimized second neural network parameters based on (i) the second batch of training frames and (ii) the one or more second neural network parameters.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于通过第一序列训练语音模型获得表示第一训练话语的语音特征的第一批训练帧; 通过所述第一序列训练语音模型获得一个或多个第一神经网络参数; 基于(i)第一批训练帧和(ii)所述一个或多个第一神经网络参数,通过所述第一序列训练语音模型确定一个或多个优化的第一神经网络参数; 通过第二序列训练语音模型获得表示第二训练语音的语音特征的第二批训练帧; 获得一个或多个第二神经网络参数; 以及通过所述第二序列训练语音模型,基于(i)第二批训练帧和(ii)所述一个或多个第二神经网络参数来确定一个或多个优化的第二神经网络参数。

    Speaker verification using neural networks
    2.
    发明授权
    Speaker verification using neural networks 有权
    使用神经网络的扬声器验证

    公开(公告)号:US09401148B2

    公开(公告)日:2016-07-26

    申请号:US14228469

    申请日:2014-03-28

    Applicant: Google Inc.

    CPC classification number: G10L17/18

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for inputting speech data that corresponds to a particular utterance to a neural network; determining an evaluation vector based on output at a hidden layer of the neural network; comparing the evaluation vector with a reference vector that corresponds to a past utterance of a particular speaker; and based on comparing the evaluation vector and the reference vector, determining whether the particular utterance was likely spoken by the particular speaker.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于将对应于特定话语的语音数据输入到神经网络; 基于所述神经网络的隐藏层的输出确定评估向量; 将评估向量与对应于特定说话者的过去发音的参考向量进行比较; 并且基于比较评估向量和参考向量,确定特定发音是否可能由特定说话者说出。

    Asynchronous optimization for sequence training of neural networks

    公开(公告)号:US10019985B2

    公开(公告)日:2018-07-10

    申请号:US14258139

    申请日:2014-04-22

    Applicant: Google Inc.

    CPC classification number: G10L15/063 G06N3/0454 G10L15/16 G10L15/183

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for obtaining, by a first sequence-training speech model, a first batch of training frames that represent speech features of first training utterances; obtaining, by the first sequence-training speech model, one or more first neural network parameters; determining, by the first sequence-training speech model, one or more optimized first neural network parameters based on (i) the first batch of training frames and (ii) the one or more first neural network parameters; obtaining, by a second sequence-training speech model, a second batch of training frames that represent speech features of second training utterances; obtaining one or more second neural network parameters; and determining, by the second sequence-training speech model, one or more optimized second neural network parameters based on (i) the second batch of training frames and (ii) the one or more second neural network parameters.

    SPEAKER VERIFICATION USING NEURAL NETWORKS
    4.
    发明申请
    SPEAKER VERIFICATION USING NEURAL NETWORKS 有权
    使用神经网络的扬声器验证

    公开(公告)号:US20150127336A1

    公开(公告)日:2015-05-07

    申请号:US14228469

    申请日:2014-03-28

    Applicant: Google Inc.

    CPC classification number: G10L17/18

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for inputting speech data that corresponds to a particular utterance to a neural network; determining an evaluation vector based on output at a hidden layer of the neural network; comparing the evaluation vector with a reference vector that corresponds to a past utterance of a particular speaker; and based on comparing the evaluation vector and the reference vector, determining whether the particular utterance was likely spoken by the particular speaker.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于将对应于特定话语的语音数据输入到神经网络; 基于所述神经网络的隐藏层的输出确定评估向量; 将评估向量与对应于特定说话者的过去发音的参考向量进行比较; 并且基于比较评估向量和参考向量,确定特定发音是否可能由特定说话者说出。

Patent Agency Ranking