METHOD, APPARATUS AND SYSTEM FOR GENERATING AUGMENTED REALITY MODULE AND STORAGE MEDIUM

    公开(公告)号:US20190129715A1

    公开(公告)日:2019-05-02

    申请号:US16202196

    申请日:2018-11-28

    Abstract: A method for generating an augmented reality module by an apparatus is described. Processing circuitry of the apparatus obtains preset third party software development interface information, the third party software development interface information being uniformly encapsulated with an AR core engine and an AR rendering engine, and a system parameter and pose information that are associated with the AR core engine being passed into the AR rendering engine. The processing circuitry generates, according to the third party software development interface information and a document configured corresponding to the third party software development interface information, an AR module of a mobile client, the correspondingly configured document comprising interface use information of the AR core engine and the AR rendering engine.

    BIOMETRIC-BASED AUTHENTICATION METHOD, APPARATUS AND SYSTEM

    公开(公告)号:US20190012450A1

    公开(公告)日:2019-01-10

    申请号:US16131844

    申请日:2018-09-14

    Abstract: A biometric-based authentication method, an apparatus, and a system are described. The method includes: receiving a biometric image to be authenticated sent from a client; performing feature extraction to the biometric image to be authenticated to obtain a biometric template to be authenticated; comparing the biometric template to be authenticated with a locally-stored biometric template; and returning an authentication result. In this case, the feature extraction process may be implemented at a cloud server side, as such, the complexity of the client may be reduced, the expandability of the client may be increased, a limitation that the biometric recognition may only be implemented on the client may be eliminated, and diversified utilization may be supported.

    Method and device for voiceprint recognition

    公开(公告)号:US09940935B2

    公开(公告)日:2018-04-10

    申请号:US15240696

    申请日:2016-08-18

    CPC classification number: G10L17/18 G10L17/02 G10L17/04 G10L17/08

    Abstract: A method is performed at a device having one or more processors and memory. The device establishes a first-level Deep Neural Network (DNN) model based on unlabeled speech data, the unlabeled speech data containing no speaker labels and the first-level DNN model specifying a plurality of basic voiceprint features for the unlabeled speech data. The device establishes a second-level DNN model by tuning the first-level DNN model based on labeled speech data, the labeled speech data containing speech samples with respective speaker labels, wherein the second-level DNN model specifies a plurality of high-level voiceprint features. Using the second-level DNN model, registers a first high-level voiceprint feature sequence for a user based on a registration speech sample received from the user. The device performs speaker verification for the user based on the first high-level voiceprint feature sequence registered for the user.

    METHOD AND SYSTEM FOR TESTING AND MONITORING A REAL-TIME STREAMING MEDIA RECOGNITION SERVICE PROVIDER
    37.
    发明申请
    METHOD AND SYSTEM FOR TESTING AND MONITORING A REAL-TIME STREAMING MEDIA RECOGNITION SERVICE PROVIDER 有权
    实时流媒体识别服务提供商的测试和监控方法与系统

    公开(公告)号:US20160308737A1

    公开(公告)日:2016-10-20

    申请号:US15191351

    申请日:2016-06-23

    Abstract: A method of testing and monitoring a real-time streaming media recognition service provider is performed at a computer system. The computer system obtains a streaming media signal source, selects a testing sample from the streaming media signal source, records characteristics of the testing sample, and obtains an expected output according to the characteristics of the testing sample. Next, the computer system converts the testing sample into a digital streaming format preset by the service provider and initiates a media recognition request according to the testing sample in the digital streaming format to the service provider. After receiving a media recognition result of the testing sample returned by the service provider according to the media recognition request, the computer system compares the media recognition result with the expected output and indicates whether the service provider is normal in accordance with the comparison result.

    Abstract translation: 在计算机系统上执行测试和监视实时流媒体识别服务提供商的方法。 计算机系统获取流媒体信号源,从流媒体信号源选择测试样本,记录测试样本的特征,并根据测试样本的特点获得预期输出。 接下来,计算机系统将测试样本转换成由服务提供商预设的数字流格式,并根据数字流格式的测试样本向服务提供商发起媒体识别请求。 根据媒体识别请求,在接收到由服务提供商返回的测试样本的媒体识别结果之后,计算机系统将媒体识别结果与预期输出进行比较,并根据比较结果指示服务提供商是否正常。

    Method, device and system for providing language service
    38.
    发明授权
    Method, device and system for providing language service 有权
    用于提供语言服务的方法,设备和系统

    公开(公告)号:US09128930B2

    公开(公告)日:2015-09-08

    申请号:US14563939

    申请日:2014-12-08

    CPC classification number: G06F17/289 G10L13/086 G10L15/005 G10L15/26

    Abstract: A method, device and system for providing a language service are disclosed. In some embodiments, the method is performed at a computer system having one or more processors and memory for storing programs to be executed by the one or more processors. The method includes receiving a first message from a client device. The method includes determining if the first message is in a first language or a second language different than the first language. The method includes translating the first message into a second message in the second language if the first message is in the first language. The method includes, alternatively, generating a third message in the second language if the first message is in the second language, where the third message includes a conversational response to the first message. The method further includes returning one of the second message and the third message to the client device.

    Abstract translation: 公开了一种用于提供语言服务的方法,设备和系统。 在一些实施例中,该方法在具有一个或多个处理器的计算机系统和用于存储要由一个或多个处理器执行的程序的存储器中执行。 该方法包括从客户端设备接收第一消息。 该方法包括确定第一消息是否处于与第一语言不同的第一语言或第二语言。 该方法包括:如果第一消息是第一语言,则将第一消息转换成第二语言的第二消息。 如果第一消息是第二语言,则该方法包括或者以第二语言生成第三消息,其中第三消息包括对第一消息的会话响应。 该方法还包括将第二消息和第三消息中的一个返回到客户端设备。

    METHOD AND APPARATUS FOR BUILDING A LANGUAGE MODEL
    39.
    发明申请
    METHOD AND APPARATUS FOR BUILDING A LANGUAGE MODEL 有权
    用于建立语言模型的方法和装置

    公开(公告)号:US20140358539A1

    公开(公告)日:2014-12-04

    申请号:US14181263

    申请日:2014-02-14

    CPC classification number: G10L15/063 G10L15/183 G10L15/197

    Abstract: A method includes: acquiring data samples; performing categorized sentence mining in the acquired data samples to obtain categorized training samples for multiple categories; building a text classifier based on the categorized training samples; classifying the data samples using the text classifier to obtain a class vocabulary and a corpus for each category; mining the corpus for each category according to the class vocabulary for the category to obtain a respective set of high-frequency language templates; training on the templates for each category to obtain a template-based language model for the category; training on the corpus for each category to obtain a class-based language model for the category; training on the class vocabulary for each category to obtain a lexicon-based language model for the category; building a speech decoder according to an acoustic model, the class-based language model and the lexicon-based language model for any given field, and the data samples.

    Abstract translation: 一种方法包括:获取数据样本; 在获取的数据样本中执行分类句子挖掘以获得用于多个类别的分类训练样本; 基于分类训练样本构建文本分类器; 使用文本分类器对数据样本进行分类,以获得每个类别的类词汇和语料库; 根据类别的词汇量挖掘每个类别的语料库,以获得相应的一组高频语言模板; 对每个类别的模板进行培训,以获取该类别的基于模板的语言模型; 对每个类别的语料库进行训练,以获得该类别的基于类的语言模型; 对每个类别的课堂词汇进行培训,以获得该类别的基于词典的语言模型; 根据声学模型,基于类的语言模型和任何给定字段的基于词典的语言模型构建语音解码器,以及数据样本。

    Systems and Methods for Adding Punctuations
    40.
    发明申请
    Systems and Methods for Adding Punctuations 有权
    添加标点的系统和方法

    公开(公告)号:US20140350939A1

    公开(公告)日:2014-11-27

    申请号:US14160808

    申请日:2014-01-22

    Abstract: Systems and methods are provided for adding punctuations. For example, one or more first feature units are identified in a voice file taken as a whole; the voice file is divided into multiple segments: one or more second feature units are identified in the voice file; a first aggregate weight of first punctuation states of the voice file and a second aggregate weight of second punctuation states of the voice file are determined, using a language model established based on word separation and third semantic features; a weighted calculation is performed to generate a third aggregate weight based on at least information associated with the first aggregate weight and the second aggregate weight; and one or more final punctuations are added to the voice file based on at least information associated with the third aggregate weight.

    Abstract translation: 提供了系统和方法来添加标点符号。 例如,一个或多个第一特征单元在作为整体而言的语音文件中被识别; 语音文件被分成多个段:在语音文件中识别一个或多个第二特征单元; 使用基于词分离和第三语义特征建立的语言模型来确定语音文件的第一标点状态的第一聚合权重和语音文件的第二标点状态的第二聚合权重; 基于至少与第一聚集权重和第二聚集权重相关联的信息来执行加权计算以产生第三聚集权重; 并且基于至少与第三聚合权重相关联的信息将一个或多个最终标点符号添加到语音文件。

Patent Agency Ranking