Method and device for voiceprint recognition
    22.
    发明授权
    Method and device for voiceprint recognition 有权
    用于声纹识别的方法和装置

    公开(公告)号:US09502038B2

    公开(公告)日:2016-11-22

    申请号:US14105110

    申请日:2013-12-12

    CPC classification number: G10L17/18 G10L17/02 G10L17/04 G10L17/08

    Abstract: A method and device for voiceprint recognition, include: establishing a first-level Deep Neural Network (DNN) model based on unlabeled speech data, the unlabeled speech data containing no speaker labels and the first-level DNN model specifying a plurality of basic voiceprint features for the unlabeled speech data; obtaining a plurality of high-level voiceprint features by tuning the first-level DNN model based on labeled speech data, the labeled speech data containing speech samples with respective speaker labels, and the tuning producing a second-level DNN model specifying the plurality of high-level voiceprint features; based on the second-level DNN model, registering a respective high-level voiceprint feature sequence for a user based on a registration speech sample received from the user; and performing speaker verification for the user based on the respective high-level voiceprint feature sequence registered for the user.

    Abstract translation: 用于声纹识别的方法和装置包括:基于未标记的语音数据建立第一级深神经网络(DNN)模型,不包含扬声器标签的未标记语音数据和指定多个基本声纹特征的第一级DNN模型 对于未标记的语音数据; 通过基于标记的语音数据调整第一级DNN模型来获得多个高级声纹特征,所述标记语音数据包含具有相应扬声器标签的语音样本,并且调谐产生指定多个高的DNN模型 级的声纹特征; 基于第二级DNN模型,基于从用户接收到的注册语音样本,为用户注册相应的高级声纹特征序列; 以及基于为用户注册的各个高级声纹特征序列,为用户执行说话人验证。

    DATA PARALLEL PROCESSING METHOD AND APPARATUS BASED ON MULTIPLE GRAPHIC PROCESSING UNITS
    23.
    发明申请
    DATA PARALLEL PROCESSING METHOD AND APPARATUS BASED ON MULTIPLE GRAPHIC PROCESSING UNITS 审中-公开
    基于多个图形处理单元的数据并行处理方法和装置

    公开(公告)号:US20160321777A1

    公开(公告)日:2016-11-03

    申请号:US15210278

    申请日:2016-07-14

    Abstract: A parallel data processing method based on multiple graphic processing units (GPUs) is provided, including: creating, in a central processing unit (CPU), a plurality of worker threads for controlling a plurality of worker groups respectively, the worker groups including one or more GPUs; binding each worker thread to a corresponding GPU; loading a plurality of batches of training data from a nonvolatile memory to GPU video memories in the plurality of worker groups; and controlling the plurality of GPUs to perform data processing in parallel through the worker threads. The method can enhance efficiency of multi-GPU parallel data processing. In addition, a parallel data processing apparatus is further provided.

    Abstract translation: 提供了一种基于多个图形处理单元(GPU)的并行数据处理方法,包括:在中央处理单元(CPU)中分别创建多个用于控制多个工作者组的工作线程,所述工作人员组包括一个或 更多GPU 将每个工作线程绑定到相应的GPU; 将多批培训数据从非易失性存储器加载到多个工作者组中的GPU视频存储器; 并且通过工作线程并行地控制多个GPU来执行数据处理。 该方法可以提高多GPU并行数据处理的效率。 另外,还提供并行数据处理装置。

    Method, system and computer storage medium for visual searching based on cloud service
    24.
    发明授权
    Method, system and computer storage medium for visual searching based on cloud service 有权
    基于云服务的视觉搜索的方法,系统和计算机存储介质

    公开(公告)号:US09411849B2

    公开(公告)日:2016-08-09

    申请号:US14241863

    申请日:2013-04-09

    Abstract: A method, system and computer storage medium for visual searching based on cloud service is disclosed. The method includes: receiving, from a client, an image recognition request of cloud service, the request containing image data; forwarding, according to a set classified forwarding rule, the image data to a corresponding classified visual search service; recognizing, by the respective corresponding classified visual search services, corresponding classified type information in the image data, and determining a corresponding name of the image data in accordance with the respective classified type information, and obtaining a classified visual search result; summarizing and sending, to a client, the classified visual search result of the corresponding classified visual search service. By detection and recognition of the classified type information of the image data, the comprehensive feature information of a picture is obtained, based on which further applications are allowed, and thus the user experience is improved.

    Abstract translation: 公开了一种基于云服务的视觉搜索的方法,系统和计算机存储介质。 该方法包括:从客户端接收云服务的图像识别请求,该请求包含图像数据; 根据设置的分类转发规则,将图像数据转发到相应的分类视觉搜索服务; 通过各自相应的分类视觉搜索服务识别图像数据中的相应的分类类型信息,并且根据各个分类的类型信息确定图像数据的相应名称,并获得分类的视觉搜索结果; 总结并向客户发送相应的分类视觉搜索服务的分类视觉搜索结果。 通过对图像数据的分类型信息的检测和识别,获得图像的综合特征信息,基于允许进一步的应用,从而提高了用户体验。

    Method and device for acoustic language model training
    25.
    发明授权
    Method and device for acoustic language model training 有权
    声学语言模型训练的方法和装置

    公开(公告)号:US09396723B2

    公开(公告)日:2016-07-19

    申请号:US14109845

    申请日:2013-12-17

    CPC classification number: G10L15/063 G06F17/28 G10L15/183

    Abstract: A method and a device for training an acoustic language model, include: conducting word segmentation for training samples in a training corpus using an initial language model containing no word class labels, to obtain initial word segmentation data containing no word class labels; performing word class replacement for the initial word segmentation data containing no word class labels, to obtain first word segmentation data containing word class labels; using the first word segmentation data containing word class labels to train a first language model containing word class labels; using the first language model containing word class labels to conduct word segmentation for the training samples in the training corpus, to obtain second word segmentation data containing word class labels; and in accordance with the second word segmentation data meeting one or more predetermined criteria, using the second word segmentation data containing word class labels to train the acoustic language model.

    Abstract translation: 一种用于训练声学语言模型的方法和装置,包括:使用不含词类标签的初始语言模型,在训练语料库中训练样本的词分割,以获得不包含词类标签的初始分词数据; 对不包含词类标签的初始分词数据执行单词类替换,以获得包含单词分类标签的第一分词数据; 使用包含词类标签的第一词分割数据来训练包含词类标签的第一语言模型; 使用包含词类标签的第一语言模型对训练语料库中的训练样本进行词分割,以获得包含词类标签的第二词分割数据; 并且根据满足一个或多个预定标准的第二字分割数据,使用包含词类标签的第二词分割数据来训练声学语言模型。

    Methods and devices for obtaining card information
    26.
    发明授权
    Methods and devices for obtaining card information 有权
    用于获取卡片信息的方法和装置

    公开(公告)号:US09330310B2

    公开(公告)日:2016-05-03

    申请号:US14484066

    申请日:2014-09-11

    Abstract: A server system with one or more processors and memory obtains, from a client device, a card image which includes an image of a card, and identifies a card configuration type corresponding to the card in the card image based on a database of stored card configuration types. Each stored card configuration type in the database is associated with layout information regarding respective features and information regions for the stored card configuration type. In accordance with the identified card configuration type, the server system determines one or more information regions of the card image containing respective card information of the card. The server system extracts at least a portion of the card information of the card from the one or more information regions of the card image and transmits, to the client device, at least the extracted portion of the card information.

    Abstract translation: 具有一个或多个处理器和存储器的服务器系统从客户端设备获得包括卡的图像的卡片图像,并且基于存储的卡配置的数据库来识别与卡片图像中的卡相对应的卡配置类型 类型。 数据库中存储的所有卡配置类型与存储的卡配置类型的关于各个特征和信息区域的布局信息相关联。 根据所识别的卡配置类型,服务器系统确定包含卡的相应卡信息的卡片图像的一个或多个信息区域。 服务器系统从卡片图像的一个或多个信息区域提取卡的卡片信息的至少一部分,并至少提取卡片信息的部分。

    Keyword Detection For Speech Recognition
    27.
    发明申请
    Keyword Detection For Speech Recognition 有权
    语音识别的关键字检测

    公开(公告)号:US20150095032A1

    公开(公告)日:2015-04-02

    申请号:US14567969

    申请日:2014-12-11

    CPC classification number: G10L15/08 G10L15/083 G10L2015/088

    Abstract: This application discloses a method implemented of recognizing a keyword in a speech that includes a sequence of audio frames further including a current frame and a subsequent frame. A candidate keyword is determined for the current frame using a decoding network that includes keywords and filler words of multiple languages, and used to determine a confidence score for the audio frame sequence. A word option is also determined for the subsequent frame based on the decoding network, and when the candidate keyword and the word option are associated with two distinct types of languages, the confidence score of the audio frame sequence is updated at least based on a penalty factor associated with the two distinct types of languages. The audio frame sequence is then determined to include both the candidate keyword and the word option by evaluating the updated confidence score according to a keyword determination criterion.

    Abstract translation: 本申请公开了一种实现的方法,其中识别语音中的关键字,其中包括进一步包括当前帧和后续帧的音频帧序列。 使用包括多种语言的关键词和填充词的解码网络为当前帧确定候选关键字,并且用于确定音频帧序列的置信度分数。 还基于解码网络为后续帧确定字选项,并且当候选关键词和词选项与两种不同类型的语言相关联时,至少基于惩罚来更新音频帧序列的置信度得分 与两种不同类型语言相关联的因素。 然后通过根据关键字确定标准评估更新的可信度得分,确定音频帧序列以包括候选关键词和词选项。

    BIOMETRIC-BASED AUTHENTICATION METHOD, APPARATUS AND SYSTEM
    28.
    发明申请
    BIOMETRIC-BASED AUTHENTICATION METHOD, APPARATUS AND SYSTEM 审中-公开
    基于生物量的认证方法,装置和系统

    公开(公告)号:US20150007295A1

    公开(公告)日:2015-01-01

    申请号:US14478024

    申请日:2014-09-05

    Abstract: A biometric-based authentication method, an apparatus, and a system are described. The method includes: receiving a biometric image to be authenticated sent from a client; performing feature extraction to the biometric image to be authenticated to obtain a biometric template to be authenticated; comparing the biometric template to be authenticated with a locally-stored biometric template; and returning an authentication result. In this case, the feature extraction process may be implemented at a cloud server side, as such, the complexity of the client may be reduced, the expandability of the client may be increased, a limitation that the biometric recognition may only be implemented on the client may be eliminated, and diversified utilization may be supported.

    Abstract translation: 描述了基于生物特征的认证方法,装置和系统。 该方法包括:从客户端接收要认证的生物体图像; 对生物体图像执行特征提取以进行认证,以获得要认证的生物特征模板; 将要认证的生物特征模板与本地存储的生物特征模板进行比较; 并返回认证结果。 在这种情况下,特征提取处理可以在云服务器侧实现,因此,可以减少客户端的复杂性,可以增加客户端的可扩展性,只能在 客户可能被淘汰,并且可以支持多样化的利用。

    COMMUNICATION METHOD AND DEVICE FOR VIDEO SIMULATION IMAGE
    29.
    发明申请
    COMMUNICATION METHOD AND DEVICE FOR VIDEO SIMULATION IMAGE 有权
    用于视频模拟图像的通信方法和装置

    公开(公告)号:US20140139619A1

    公开(公告)日:2014-05-22

    申请号:US14165117

    申请日:2014-01-27

    Abstract: A method and device for communicating a video with a simulation image is provided. The method includes: acquiring, by a sender, video data, transforming the acquired video data into vector data in image recognition algorithm, and sending the vector data to a receiver; and calling, by the receiver, a cartoon rendering model and rendering the received vector data in the video with a corresponding cartoon simulation image according to the cartoon rendering model. By using the present invention, the amount of data transmitted in a network may be reduced, and network bandwidth resources are saved.

    Abstract translation: 提供了一种用于将视频与模拟图像通信的方法和装置。 该方法包括:通过发送方获取视频数据,将获取的视频数据变换为图像识别算法中的矢量数据,并将矢量数据发送给接收机; 并通过接收机呼叫卡通渲染模型,并根据卡通渲染模型,使用相应的卡通模拟图像,在视频中呈现接收到的矢量数据。 通过使用本发明,可以减少在网络中发送的数据量,并节省网络带宽资源。

Patent Agency Ranking