Patent search ap:("TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED") AND inv:"Eryu Wang" Page 2

11.

发明授权
Method and apparatus for performing speech keyword retrieval 有权
Title translation: 执行语音关键词检索的方法和装置

公开(公告)号：US09355637B2

公开(公告)日：2016-05-31

申请号：US14620000

申请日：2015-02-11

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Jianxiong Ma , Lu Li , Li Lu , Xiang Zhang , Shuai Yue , Feng Rao , Eryu Wang , Linghui Kong

IPC: G10L15/18 , G10L15/28 , G10L15/08 , G10L15/32

CPC classification number: G10L15/18 , G10L15/08 , G10L15/28 , G10L15/32 , G10L2015/088

Abstract: A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.

Abstract translation: 提供了一种用于检索关键字的方法和装置。该装置在模型文件中配置至少两种类型的语言模型，其中每种类型的语言模型包括识别模型和相应的解码模型; 该设备从待处理语音数据中提取语音特征; 通过在模型文件中逐一使用识别模型对提取出的语音特征进行语言匹配，并根据语言匹配率确定识别模型; 并确定与识别模型相对应的解码模型; 通过使用所确定的解码模型来解码所提取的语音特征，并且在解码之后获得字识别结果; 并且将关键词字典中的关键字与单词识别结果进行匹配，并输出匹配的关键字。

12.

发明授权
Method and apparatus for performing speech keyword retrieval 有权

公开(公告)号：US09257118B2

公开(公告)日：2016-02-09

申请号：US14620000

申请日：2015-02-11

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Jianxiong Ma , Lu Li , Li Lu , Xiang Zhang , Shuai Yue , Feng Rao , Eryu Wang , Linghui Kong

IPC: G10L15/18 , G10L15/28 , G10L15/08 , G10L15/32

Abstract: A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.

13.

发明申请
Systems and Methods for Adding Punctuations 有权
Title translation: 添加标点的系统和方法

公开(公告)号：US20140350939A1

公开(公告)日：2014-11-27

申请号：US14160808

申请日：2014-01-22

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor： Haibo Liu , Eryu Wang , Xiang Zhang , Shuai Yue , Lu Li , Li Lu , Jian Liu , Bo Chen

IPC: G10L15/18 , G10L15/04

CPC classification number: G10L15/1815 , G06F17/27 , G06F17/2725 , G10L15/04 , G10L15/183 , G10L15/187 , G10L15/26 , G10L15/265

Abstract: Systems and methods are provided for adding punctuations. For example, one or more first feature units are identified in a voice file taken as a whole; the voice file is divided into multiple segments: one or more second feature units are identified in the voice file; a first aggregate weight of first punctuation states of the voice file and a second aggregate weight of second punctuation states of the voice file are determined, using a language model established based on word separation and third semantic features; a weighted calculation is performed to generate a third aggregate weight based on at least information associated with the first aggregate weight and the second aggregate weight; and one or more final punctuations are added to the voice file based on at least information associated with the third aggregate weight.

Abstract translation: 提供了系统和方法来添加标点符号。例如，一个或多个第一特征单元在作为整体而言的语音文件中被识别; 语音文件被分成多个段：在语音文件中识别一个或多个第二特征单元; 使用基于词分离和第三语义特征建立的语言模型来确定语音文件的第一标点状态的第一聚合权重和语音文件的第二标点状态的第二聚合权重; 基于至少与第一聚集权重和第二聚集权重相关联的信息来执行加权计算以产生第三聚集权重; 并且基于至少与第三聚合权重相关联的信息将一个或多个最终标点符号添加到语音文件。

14.

发明申请
METHOD AND SYSTEM FOR AUTOMATIC SPEECH RECOGNITION 有权
Title translation: 自动语音识别方法与系统

公开(公告)号：US20140214419A1

公开(公告)日：2014-07-31

申请号：US14108223

申请日：2013-12-16

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Feng Rao , Li Lu , Bo Chen , Shuai Yue , Xiang Zhang , Eryu Wang , Dadong Xie , Lou Li , Duling Lu

IPC: G10L15/06

CPC classification number: G10L15/063 , G10L15/183 , G10L15/197 , G10L15/26

Abstract: An automatic speech recognition method includes at a computer having one or more processors and memory for storing one or more programs to be executed by the processors, obtaining a plurality of speech corpus categories through classifying and calculating raw speech corpus; obtaining a plurality of classified language models that respectively correspond to the plurality of speech corpus categories through a language model training applied on each speech corpus category; obtaining an interpolation language model through implementing a weighted interpolation on each classified language model and merging the interpolated plurality of classified language models; constructing a decoding resource in accordance with an acoustic model and the interpolation language model; and decoding input speech using the decoding resource, and outputting a character string with a highest probability as a recognition result of the input speech.

Abstract translation: 自动语音识别方法包括在具有一个或多个处理器的计算机和用于存储要由处理器执行的一个或多个程序的存储器，通过分类和计算原始语音语料库获得多个语音语料库类别; 通过应用于每个语音语料库类别的语言模型训练获得分别对应于所述多个语音语料库类别的多个分类语言模型; 通过对每个分类语言模型实施加权插值并合并内插的多个分类语言模型来获得内插语言模型; 根据声学模型和内插语言模型构建解码资源; 并使用解码资源解码输入语音，并输出具有最高概率的字符串作为输入语音的识别结果。

15.

发明授权
Data parallel processing method and apparatus based on multiple graphic processing units 有权

公开(公告)号：US10282809B2

公开(公告)日：2019-05-07

申请号：US15210278

申请日：2016-07-14

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor： Xing Jin , Yi Li , Yongqiang Zou , Zhimao Guo , Eryu Wang , Wei Xue , Bo Chen , Yong Li , Chunjian Bao , Lei Xiao

IPC: G06T1/20 , G06F9/50 , G06F9/52 , G06T1/60 , G06N3/04 , G06N3/063 , G06N3/08

Abstract: A parallel data processing method based on multiple graphic processing units (GPUs) is provided, including: creating, in a central processing unit (CPU), a plurality of worker threads for controlling a plurality of worker groups respectively, the worker groups including one or more GPUs; binding each worker thread to a corresponding GPU; loading a plurality of batches of training data from a nonvolatile memory to GPU video memories in the plurality of worker groups; and controlling the plurality of GPUs to perform data processing in parallel through the worker threads. The method can enhance efficiency of multi-GPU parallel data processing. In addition, a parallel data processing apparatus is further provided.

16.

发明授权
Method and system of adding punctuation and establishing language model using a punctuation weighting applied to chinese speech recognized text 有权

公开(公告)号：US09811517B2

公开(公告)日：2017-11-07

申请号：US14148579

申请日：2014-01-06

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Haibo Liu , Eryu Wang , Xiang Zhang , Li Lu , Shuai Yue , Qiuge Liu , Bo Chen , Jian Liu , Lu Li

IPC: G06F17/27 , G06F17/28 , G10L15/00 , G10L15/26

CPC classification number: G06F17/273 , G06F17/2775 , G06F17/2785 , G06F17/289 , G10L15/265

Abstract: A method of processing information content based on a Chinese language model is performed at a computer, the method including: identifying a plurality of expressions in the information content extracted from a speech input through speech recognition that is queued to be processed; dividing the expressions into a plurality of characteristic units according to semantic features and predetermined characteristics associated with each characteristic unit, each including a subset of the expressions and the predetermined characteristics at least including a respective integer number of expressions that are included in the characteristic unit; extracting, from the Chinese language model, a plurality of probabilities for punctuation marks associated with each characteristic unit; and in accordance with the probabilities, associating a respective punctuation mark with each characteristic unit included in the information content. The method further comprises adding punctuation marks based on a weight determined for each punctuation mark.

17.

发明授权
Method and system for building a topic specific language model for use in automatic speech recognition 有权

公开(公告)号：US09697821B2

公开(公告)日：2017-07-04

申请号：US14108223

申请日：2013-12-16

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Feng Rao , Li Lu , Bo Chen , Shuai Yue , Xiang Zhang , Eryu Wang , Dadong Xie , Lou Li , Duling Lu

IPC: G10L15/06 , G10L15/183 , G10L15/197 , G10L15/26

CPC classification number: G10L15/063 , G10L15/183 , G10L15/197 , G10L15/26

Abstract: An automatic speech recognition method includes at a computer having one or more processors and memory for storing one or more programs to be executed by the processors, obtaining a plurality of speech corpus categories through classifying and calculating raw speech corpus; obtaining a plurality of classified language models that respectively correspond to the plurality of speech corpus categories through a language model training applied on each speech corpus category; obtaining an interpolation language model through implementing a weighted interpolation on each classified language model and merging the interpolated plurality of classified language models; constructing a decoding resource in accordance with an acoustic model and the interpolation language model; and decoding input speech using the decoding resource, and outputting a character string with a highest probability as a recognition result of the input speech.

18.

发明申请
SYSTEMS AND METHODS FOR AUDIO COMMAND RECOGNITION 有权
Title translation: 用于音频命令识别的系统和方法

公开(公告)号：US20160086609A1

公开(公告)日：2016-03-24

申请号：US14958606

申请日：2015-12-03

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Shuai Yue , Xiang Zhang , Li Lu , Feng Rao , Eryu Wang , Haibo Liu , Bo Chen , Jian Liu , Lu Li

IPC: G10L17/24 , G10L15/22 , G10L17/16 , G06F3/16 , G10L17/26

CPC classification number: G10L17/24 , G06F3/167 , G10L15/22 , G10L17/02 , G10L17/16 , G10L17/26 , G10L2015/223

Abstract: The present application discloses a method, an electronic system and a non-transitory computer readable storage medium for recognizing audio commands in an electronic device. The electronic device obtains audio data based on an audio signal provided by a user and extracts characteristic audio fingerprint features from the audio data. The electronic device further determines whether the corresponding audio signal is generated by an authorized user by comparing the characteristic audio fingerprint features with an audio fingerprint model for the authorized user and with a universal background model that represents user-independent audio fingerprint features, respectively. When the corresponding audio signal is generated by the authorized user of the electronic device, an audio command is extracted from the audio data, and an operation is performed according to the audio command.

Abstract translation: 本申请公开了一种用于识别电子设备中的音频命令的方法，电子系统和非暂时性计算机可读存储介质。电子设备基于由用户提供的音频信号获得音频数据，并从音频数据中提取特征音频指纹特征。电子设备还通过将特征音频指纹特征与用于授权用户的音频指纹模型进行比较，以及分别表示用户独立的音频指纹特征的通用背景模型来确定对应的音频信号是否由授权用户产生。当由电子设备的授权用户产生相应的音频信号时，从音频数据中提取音频命令，并根据音频命令进行操作。

19.

发明授权
User authentication method and apparatus based on audio and video data 有权
Title translation: 基于音频和视频数据的用户认证方法和设备

公开(公告)号：US09177131B2

公开(公告)日：2015-11-03

申请号：US14262665

申请日：2014-04-25

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor： Xiang Zhang , Li Lu , Eryu Wang , Shuai Yue , Feng Rao , Haibo Liu , Lou Li , Duling Lu , Bo Chen

IPC: H04L29/06 , G06F21/32

CPC classification number: G06F21/32 , G06F2221/2117

Abstract: A computer-implemented method is performed at a server having one or more processors and memory storing programs executed by the one or more processors for authenticating a user from video and audio data. The method includes: receiving a login request from a mobile device, the login request including video data and audio data; extracting a group of facial features from the video data; extracting a group of audio features from the audio data and recognizing a sequence of words in the audio data; identifying a first user account whose respective facial features match the group of facial features and a second user account whose respective audio features match the group of audio features. If the first user account is the same as the second user account, retrieve the sequence of words associated with the user account and compare the sequences of words for authentication purpose.

Abstract translation: 在具有一个或多个处理器的服务器和由一个或多个处理器执行的用于从视频和音频数据认证用户的存储器存储程序的服务器执行计算机实现的方法。该方法包括：从移动设备接收登录请求，登录请求包括视频数据和音频数据; 从视频数据中提取一组面部特征; 从音频数据提取一组音频特征并识别音频数据中的单词序列; 识别其各自的面部特征与该组面部特征相匹配的第一用户帐户和其各个音频特征与该组音频特征相匹配的第二用户帐户。如果第一个用户帐户与第二个用户帐户相同，则检索与用户帐户相关联的单词序列，并比较用于验证目的的单词序列。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification