Patent search ap:("TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED") AND inv:"Fuzhang Wu" Page 1

1.

发明授权
Identity vector generation method, computer device, and computer-readable storage medium 有权

公开(公告)号：US10909989B2

公开(公告)日：2021-02-02

申请号：US16213421

申请日：2018-12-07

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor： Wei Li , Binghua Qian , Xingming Jin , Ke Li , Fuzhang Wu , Yongjian Wu , Feiyue Huang

IPC: G10L15/00 , G10L15/02 , G10L17/02 , G10L17/14 , G10L17/20

Abstract: An identity vector generation method is provided. The method includes obtaining to-be-processed speech data. Corresponding acoustic features are extracted from the to-be-processed speech data. A posterior probability that each of the acoustic features belongs to each Gaussian distribution component in a speaker background model is calculated to obtain a statistic. The statistic is mapped to a statistic space to obtain a reference statistic, the statistic space built according to a statistic corresponding to a speech sample exceeding a threshold speech duration. A corrected statistic is determined according to the calculated statistic and the reference statistic; and an identity vector is generated according to the corrected statistic.

2.

发明授权
Model generating method, and speech synthesis method and apparatus 有权

公开(公告)号：US10832652B2

公开(公告)日：2020-11-10

申请号：US16318889

申请日：2017-08-14

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor： Haolei Yuan , Fuzhang Wu , Binghua Qian

IPC: G10L13/00 , G10L13/06 , G10L13/047 , G10L13/08 , G06F17/16

Abstract: A method is performed by at least one processor, and includes acquiring training speech data by concatenating speech segments having a lowest target cost among candidate concatenation solutions, and extracting training speech segments of a first annotation type, from the training speech data, the first annotation type being used for annotating that a speech continuity of a respective one of the training speech segments is superior to a preset condition. The method further includes calculating a mean dissimilarity matrix, based on neighboring candidate speech segments corresponding to the training speech segments before concatenation, the mean dissimilarity matrix representing a mean dissimilarity in acoustic features of groups of the neighboring candidate speech segments belonging to a same type of concatenation combination relationship, and generating a concatenation cost model having a target concatenation weight, based on the mean dissimilarity matrix, the concatenation cost model corresponding to the same type of concatenation combination relationship.

3.

发明申请
IDENTITY VECTOR PROCESSING METHOD AND COMPUTER DEVICE 审中-公开

公开(公告)号：US20180233151A1

公开(公告)日：2018-08-16

申请号：US15954416

申请日：2018-04-16

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor： Wei Li , Binghua Qian , Xingming Jin , Ke Li , Fuzhang Wu , Yongjian Wu , Feiyue Huang

IPC: G10L17/20 , G10L17/02 , G10L17/08

CPC classification number: G10L17/20 , G10L17/02 , G10L17/08

Abstract: Processing circuitry of an information processing apparatus obtains a set of identity vectors that are calculated according to voice samples from speakers. The identity vectors are classified into speaker classes respectively corresponding to the speakers. The processing circuitry selects, from the identity vectors, first subsets of interclass neighboring identity vectors respectively corresponding to the identity vectors and second subsets of intraclass neighboring identity vectors respectively corresponding to the identity vectors. The processing circuitry determines an interclass difference based on the first subsets of interclass neighboring identity vectors and the corresponding identity vectors; and determines an intraclass difference based on the second subsets of intraclass neighboring identify vectors and the corresponding identity vectors. Further, the processing circuitry determines a set of basis vectors to maximize a projection of the interclass difference on the basis vectors and to minimize a projection of the intraclass difference on the basis vectors.

4.

发明授权
Method and apparatus for training voiceprint recognition system 有权

公开(公告)号：US10854207B2

公开(公告)日：2020-12-01

申请号：US16231913

申请日：2018-12-24

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor： Wei Li , Binghua Qian , Xingming Jin , Ke Li , Fuzhang Wu , Yongjian Wu , Feiyue Huang

IPC: G10L17/22 , G10L17/04 , G10L17/02 , G10L17/00 , G10L17/08

Abstract: A method and an apparatus for training a voiceprint recognition system are provided. The method includes obtaining a voice training data set comprising voice segments of users; determining identity vectors of all the voice segments; identifying identity vectors of voice segments of a same user in the determined identity vectors; placing the recognized identity vectors of the same user in the users into one of user categories; and determining an identity vector in the user category as a first identity vector. The method further includes normalizing the first identity vector by using a normalization matrix, a first value being a sum of similarity degrees between the first identity vector in the corresponding category and other identity vectors in the corresponding category; training the normalization matrix, and outputting a training value of the normalization matrix when the normalization matrix maximizes a sum of first values of all the user categories.

5.

发明授权
Voice data processing method, apparatus and storage medium 有权

公开(公告)号：US10692503B2

公开(公告)日：2020-06-23

申请号：US15764573

申请日：2017-03-03

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor： Xingming Jin , Wei Li , Fangmai Zheng , Fuzhang Wu , Bilei Zhu , Binghua Qian , Ke Li , Yongjian Wu , Feiyue Huang

IPC: G10L17/20 , G10L17/08 , G10L25/21 , G10L17/04

Abstract: A voice data processing method and apparatus are provided. The method includes obtaining an I-Vector vector of each of voice samples, and determining a target seed sample in the voice samples. A first cosine distance is calculated between an I-Vector vector of the target seed sample and an I-Vector vector of a target remaining voice sample, where the target remaining voice sample is a voice sample other than the target seed sample in the voice samples. A target voice sample is filtered from the voice samples or the target remaining voice sample according to the first cosine distance, to obtain a target voice sample whose first cosine distance is greater than a first threshold.

6.

发明授权
Constructing speech decoding network for numeric speech recognition 有权

公开(公告)号：US10699699B2

公开(公告)日：2020-06-30

申请号：US15993332

申请日：2018-05-30

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Fuzhang Wu , Binghua Qian , Wei Li , Ke Li , Yongjian Wu , Feiyue Huang

IPC: G10L15/06 , G10L15/14 , G10L15/02 , G10L15/04 , G10L25/24 , G10L25/90 , G10L15/187

Abstract: The embodiments of the present disclosure disclose a method for constructing a speech decoding network in digital speech recognition. The method comprises acquiring training data obtained by digital speech recording, the training data comprising a plurality of speech segments, and each speech segment comprising a plurality of digital speeches; performing acoustic feature extraction on the training data to obtain a feature sequence corresponding to each speech segment; performing progressive training starting from a mono-phoneme acoustic model to obtain an acoustic model; acquiring a language model, and constructing a speech decoding network by the language model and the acoustic model obtained by training.

7.

发明授权
Identity vector processing method and computer device 有权

公开(公告)号：US10650830B2

公开(公告)日：2020-05-12

申请号：US15954416

申请日：2018-04-16

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor： Wei Li , Binghua Qian , Xingming Jin , Ke Li , Fuzhang Wu , Yongjian Wu , Feiyue Huang

IPC: G10L17/00 , G10L17/20 , G10L17/08 , G10L17/02

Abstract: Processing circuitry of an information processing apparatus obtains a set of identity vectors that are calculated according to voice samples from speakers. The identity vectors are classified into speaker classes respectively corresponding to the speakers. The processing circuitry selects, from the identity vectors, first subsets of interclass neighboring identity vectors respectively corresponding to the identity vectors and second subsets of intraclass neighboring identity vectors respectively corresponding to the identity vectors. The processing circuitry determines an interclass difference based on the first subsets of interclass neighboring identity vectors and the corresponding identity vectors; and determines an intraclass difference based on the second subsets of intraclass neighboring identify vectors and the corresponding identity vectors. Further, the processing circuitry determines a set of basis vectors to maximize a projection of the interclass difference on the basis vectors and to minimize a projection of the intraclass difference on the basis vectors.

8.

发明申请
IDENTITY VECTOR GENERATION METHOD, COMPUTER DEVICE, AND COMPUTER-READABLE STORAGE MEDIUM 审中-公开

公开(公告)号：US20190115031A1

公开(公告)日：2019-04-18

申请号：US16213421

申请日：2018-12-07

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor： Wei Li , Binghua Qian , Xingming Jin , Ke Li , Fuzhang Wu , Yongjian Wu , Feiyue Huang

IPC: G10L17/02 , G10L17/20 , G10L17/14

Abstract: An identity vector generation method is provided. The method includes obtaining to-be-processed speech data. Corresponding acoustic features are extracted from the to-be-processed speech data. A posterior probability that each of the acoustic features belongs to each Gaussian distribution component in a speaker background model is calculated to obtain a statistic. The statistic is mapped to a statistic space to obtain a reference statistic, the statistic space built according to a statistic corresponding to a speech sample exceeding a threshold speech duration. A corrected statistic is determined according to the calculated statistic and the reference statistic; and an identity vector is generated according to the corrected statistic.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification