Patent search ap:("Google Inc.") AND inv:"Javier Gonzalvo Fructuoso" Page 1

1.

发明申请
Methods and Systems for Automated Generation of Nativized Multi-Lingual Lexicons 有权
Title translation: 自动生成多语言词汇的方法和系统

公开(公告)号：US20150095018A1

公开(公告)日：2015-04-02

申请号：US14283586

申请日：2014-05-21

Applicant: Google Inc.

Inventor： Javier Gonzalvo Fructuoso , Ioannis Agiomyrgiannakis

IPC: G06F17/27

CPC classification number: G10L13/086 , G06F17/277 , G10L13/08 , G10L15/063 , G10L15/187 , G10L2015/0633

Abstract: An input signal that includes linguistic content in a first language may be received by a computing device. The linguistic content may include text or speech. The computing device may associate the linguistic content in the first language with one or more phonemes from a second language. The computing device may also determine a phonemic representation of the linguistic content in the first language based on use of the one or more phonemes from the second language. The phonemic representation may be indicative of a pronunciation of the linguistic content in the first language according to speech sounds of the second language.

Abstract translation: 包括第一语言的语言内容的输入信号可以被计算设备接收。语言内容可能包括文字或言语。计算设备可将第一语言中的语言内容与来自第二语言的一个或多个音素相关联。计算设备还可以基于来自第二语言的一个或多个音素的使用来确定第一语言中的语言内容的音位表示。根据第二语言的语音，音素表示可以指示第一语言中的语言内容的发音。

2.

发明授权
Methods and systems for automated generation of nativized multi-lingual lexicons 有权
Title translation: 自动生成本土化多语言词典的方法和系统

公开(公告)号：US08768704B1

公开(公告)日：2014-07-01

申请号：US14053052

申请日：2013-10-14

Applicant: Google Inc.

Inventor： Javier Gonzalvo Fructuoso , Ioannis Agiomyrgiannakis

IPC: G10L13/00 , G10L13/08

CPC classification number: G10L13/086 , G06F17/277 , G10L13/08 , G10L15/063 , G10L15/187 , G10L2015/0633

Abstract: An input signal that includes linguistic content in a first language may be received by a computing device. The linguistic content may include text or speech. Based on an acoustic feature comparison between a plurality of first-language speech sounds and a plurality of second-language speech sounds, the computing device may associate the linguistic content in the first language with one or more phonemes from a second language. The computing device may also determine a phonemic representation of the linguistic content in the first language based on use of the one or more phonemes from the second language. The phonemic representation may be indicative of a pronunciation of the linguistic content in the first language according to speech sounds of the second language.

Abstract translation: 包括第一语言的语言内容的输入信号可以被计算设备接收。语言内容可能包括文字或言语。基于多个第一语言语音和多个第二语言语音之间的声学特征比较，计算设备可将第一语言中的语言内容与来自第二语言的一个或多个音素相关联。计算设备还可以基于来自第二语言的一个或多个音素的使用来确定第一语言中的语言内容的音位表示。根据第二语言的语音，音素表示可以指示第一语言中的语言内容的发音。

3.

发明授权
Deep networks for unit selection speech synthesis 有权
Title translation: 深层网络单元选择语音合成

公开(公告)号：US09460704B2

公开(公告)日：2016-10-04

申请号：US14019967

申请日：2013-09-06

Applicant: Google Inc.

Inventor： Andrew W. Senior , Javier Gonzalvo Fructuoso

IPC: G10L13/00 , G10L13/06 , G10L25/30

CPC classification number: G10L13/06 , G10L25/30

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for providing a representation based on structured data in resources. The methods, systems, and apparatus include actions of receiving target acoustic features output from a neural network that has been trained to predict acoustic features given linguistic features. Additional actions include determining a distance between the target acoustic features and acoustic features of a stored acoustic sample. Further actions include selecting the acoustic sample to be used in speech synthesis based at least on the determined distance and synthesizing speech based on the selected acoustic sample.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于基于资源中的结构化数据提供表示。方法，系统和装置包括接收从神经网络输出的目标声学特征的动作，所述神经网络已被训练以预测具有语言特征的声学特征。附加动作包括确定目标声学特征与存储的声学样本的声学特征之间的距离。进一步的动作包括至少基于所确定的距离来选择要在语音合成中使用的声学样本，并且基于所选择的声学样本来合成语音。

4.

发明授权
Methods and systems for sharing of adapted voice profiles 有权

公开(公告)号：US09318104B1

公开(公告)日：2016-04-19

申请号：US14796245

申请日：2015-07-10

Applicant: Google Inc.

Inventor： Javier Gonzalvo Fructuoso , Johan Schalkwyk

IPC: G10L17/00 , G10L15/02 , G10L13/00

CPC classification number: G10L15/28 , G10L13/033 , G10L13/04 , G10L13/10 , G10L15/07 , H04L67/306 , H04M1/00 , H04M1/578

Abstract: Methods and systems for sharing of adapted voice profiles are provided. The method may comprise receiving, at a computing system, one or more speech samples, and the one or more speech samples may include a plurality of spoken utterances. The method may further comprise determining, at the computing system, a voice profile associated with a speaker of the plurality of spoken utterances, and including an adapted voice of the speaker. Still further, the method may comprise receiving, at the computing system, an authorization profile associated with the determined voice profile, and the authorization profile may include one or more user identifiers associated with one or more respective users. Yet still further, the method may comprise the computing system providing the voice profile to at least one computing device associated with the one or more respective users, based at least in part on the authorization profile.

5.

发明申请
SPEECH SYNTHESIS MODEL SELECTION 审中-公开
Title translation: 语音合成模型选择

公开(公告)号：US20160343366A1

公开(公告)日：2016-11-24

申请号：US14716063

申请日：2015-05-19

Applicant: Google Inc.

Inventor： Javier Gonzalvo Fructuoso , Byungha Chun

IPC: G10L13/027 , G10L13/08 , G10L13/047

CPC classification number: G10L13/08 , G10L13/047

Abstract: In some implementations, a text-to-speech system may perform a mapping of acoustic frames to linguistic model clusters in a pre-selection process for unit selection synthesis. An architecture may leverage data-driven models, such as neural networks that are trained using recorded speech samples, to effectively map acoustic frames to linguistic model clusters during synthesis. This architecture may allow for improved handling and synthesis of combinations of unseen linguistic features.

Abstract translation: 在一些实现中，文本到语音系统可以在用于单元选择合成的预选过程中执行声音帧到语言模型集群的映射。架构可以利用数据驱动的模型，例如使用记录的语音样本训练的神经网络，以在合成期间将声学帧有效地映射到语言模型簇。这种架构可以允许改进未被看见的语言特征组合的处理和综合。

6.

发明申请
STATISTICAL UNIT SELECTION LANGUAGE MODELS BASED ON ACOUSTIC FINGERPRINTING 有权
Title translation: 基于声音指纹的统计单位选择语言模型

公开(公告)号：US20160093295A1

公开(公告)日：2016-03-31

申请号：US14850249

申请日：2015-09-10

Applicant: Google Inc.

Inventor： Alexander Gutkin , Javier Gonzalvo Fructuoso , Cyril Georges Luc Allauzen

IPC: G10L15/06 , G10L13/08 , G10L19/018

CPC classification number: G10L15/063 , G10L13/08 , G10L19/018

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for providing statistical unit selection language modeling based on acoustic fingerprinting. The methods, systems and apparatus include the actions of obtaining a unit database of acoustic units and, for each acoustic unit, linguistic data corresponding to the acoustic unit; obtaining stored data associating each acoustic unit with (i) a corresponding acoustic fingerprint and (ii) a probability of the linguistic data corresponding to the acoustic unit occurring in a text corpus; determining that the unit database of acoustic units has been updated to include one or more new acoustic units; for each new acoustic unit in the updated unit database: generating an acoustic fingerprint for the new acoustic unit; identifying an acoustic unit that (i) has an acoustic fingerprint that is indicated as similar to the fingerprint of the new acoustic unit, and (ii) has a stored associated probability.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于提供基于声学指纹识别的统计单位选择语言建模。方法，系统和装置包括获得单元数据库的动作，对于每个声学单元，对应于声学单元的语言数据; 获得将每个声学单元与（i）对应的声学指纹相关联的存储数据和（ii）与在文本语料库中发生的声学单元相对应的语言数据的概率; 确定声学单元的单元数据库已经被更新为包括一个或多个新的声学单元; 对于更新的单元数据库中的每个新的声学单元：为新的声学单元产生声学指纹; 识别（i）具有与新声学单元的指纹相似的声音指纹的声学单元，以及（ii）具有存储的相关概率。

7.

发明申请
MULTILINGUAL PROSODY GENERATION 有权
Title translation: 多重预测生成

公开(公告)号：US20150186359A1

公开(公告)日：2015-07-02

申请号：US14143627

申请日：2013-12-30

Applicant: Google Inc.

Inventor： Javier Gonzalvo Fructuoso , Andrew W. Senior , Byungha Chun

IPC: G06F17/28

CPC classification number: G10L13/10 , G06F17/289 , G10L13/07 , G10L13/08 , G10L13/086 , G10L25/30

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for multilingual prosody generation. In some implementations, data indicating a set of linguistic features corresponding to a text is obtained. Data indicating the linguistic features and data indicating the language of the text are provided as input to a neural network that has been trained to provide output indicating prosody information for multiple languages. The neural network can be a neural network having been trained using speech in multiple languages. Output indicating prosody information for the linguistic features is received from the neural network. Audio data representing the text is generated using the output of the neural network.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于多语言韵律生成。在一些实现中，获得指示与文本相对应的一组语言特征的数据。指示语言特征的数据和指示文本语言的数据被提供给已经被训练以提供指示多种语言的韵律信息的输出的神经网络的输入。神经网络可以是已经使用多种语言的语音训练的神经网络。从神经网络接收到表示语言特征的韵律信息的输出。使用神经网络的输出生成表示文本的音频数据。

8.

发明授权
Statistical unit selection language models based on acoustic fingerprinting 有权
Title translation: 基于声指纹的统计单位选择语言模型

公开(公告)号：US09424835B2

公开(公告)日：2016-08-23

申请号：US14850249

申请日：2015-09-10

Applicant: Google Inc.

Inventor： Alexander Gutkin , Javier Gonzalvo Fructuoso , Cyril Georges Luc Allauzen

IPC: G10L15/08 , G10L15/06 , G10L19/018 , G10L13/08

CPC classification number: G10L15/063 , G10L13/08 , G10L19/018

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for providing statistical unit selection language modeling based on acoustic fingerprinting. The methods, systems and apparatus include the actions of obtaining a unit database of acoustic units and, for each acoustic unit, linguistic data corresponding to the acoustic unit; obtaining stored data associating each acoustic unit with (i) a corresponding acoustic fingerprint and (ii) a probability of the linguistic data corresponding to the acoustic unit occurring in a text corpus; determining that the unit database of acoustic units has been updated to include one or more new acoustic units; for each new acoustic unit in the updated unit database: generating an acoustic fingerprint for the new acoustic unit; identifying an acoustic unit that (i) has an acoustic fingerprint that is indicated as similar to the fingerprint of the new acoustic unit, and (ii) has a stored associated probability.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于提供基于声学指纹识别的统计单位选择语言建模。方法，系统和装置包括获得单元数据库的动作，对于每个声学单元，对应于声学单元的语言数据; 获得将每个声学单元与（i）对应的声学指纹相关联的存储数据和（ii）与在文本语料库中发生的声学单元相对应的语言数据的概率; 确定声学单元的单元数据库已经被更新为包括一个或多个新的声学单元; 对于更新的单元数据库中的每个新的声学单元：为新的声学单元产生声学指纹; 识别（i）具有与新声学单元的指纹相似的声音指纹的声学单元，以及（ii）具有存储的相关概率。

9.

发明申请
MULTILINGUAL PROSODY GENERATION 有权

公开(公告)号：US20160071512A1

公开(公告)日：2016-03-10

申请号：US14942300

申请日：2015-11-16

Applicant: Google Inc.

Inventor： Javier Gonzalvo Fructuoso , Andrew W. Senior , Byungha Chun

IPC: G10L13/10 , G10L13/07 , G10L25/30 , G10L13/08

CPC classification number: G10L13/10 , G06F17/289 , G10L13/07 , G10L13/08 , G10L13/086 , G10L25/30

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for multilingual prosody generation. In some implementations, data indicating a set of linguistic features corresponding to a text is obtained. Data indicating the linguistic features and data indicating the language of the text are provided as input to a neural network that has been trained to provide output indicating prosody information for multiple languages. The neural network can be a neural network having been trained using speech in multiple languages. Output indicating prosody information for the linguistic features is received from the neural network. Audio data representing the text is generated using the output of the neural network.

10.

发明授权
Multilingual prosody generation 有权
Title translation: 多语言韵律一代

公开(公告)号：US09195656B2

公开(公告)日：2015-11-24

申请号：US14143627

申请日：2013-12-30

Applicant: Google Inc.

Inventor： Javier Gonzalvo Fructuoso , Andrew W. Senior , Byungha Chun

IPC: G10L13/08 , G06F17/28 , G10L13/10

CPC classification number: G10L13/10 , G06F17/289 , G10L13/07 , G10L13/08 , G10L13/086 , G10L25/30

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for multilingual prosody generation. In some implementations, data indicating a set of linguistic features corresponding to a text is obtained. Data indicating the linguistic features and data indicating the language of the text are provided as input to a neural network that has been trained to provide output indicating prosody information for multiple languages. The neural network can be a neural network having been trained using speech in multiple languages. Output indicating prosody information for the linguistic features is received from the neural network. Audio data representing the text is generated using the output of the neural network.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于多语言韵律生成。在一些实现中，获得指示与文本相对应的一组语言特征的数据。指示语言特征的数据和指示文本语言的数据被提供给已经被训练以提供指示多种语言的韵律信息的输出的神经网络的输入。神经网络可以是已经使用多种语言的语音训练的神经网络。从神经网络接收到表示语言特征的韵律信息的输出。使用神经网络的输出生成表示文本的音频数据。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification