Patent search ap:("Mazin GILBERT") AND inv:"Mazin GILBERT" Page 1

1.

发明申请
TRANSPARENT VOICE REGISTRATION AND VERIFICATION METHOD AND SYSTEM 有权
Title translation: 透明语音注册和验证方法与系统

公开(公告)号：US20100027767A1

公开(公告)日：2010-02-04

申请号：US12182182

申请日：2008-07-30

Applicant: Mazin GILBERT

Inventor： Mazin GILBERT

IPC: H04M1/64 , G10L11/00 , G10L15/00

CPC classification number: H04M3/4936 , G10L17/04 , G10L17/14 , H04M3/385 , H04M3/42221 , H04M3/51 , H04M2201/36 , H04M2201/41 , H04M2203/6045

Abstract: Transparent voice registration of a party is provided in order to provide voice verification for communications with a service center. Verbal communication spoken by a party during interaction between the party and an agent of the service center is captured. A voice model associated with the captured communication is created and stored in order to provide voice verification during a subsequent call to the service center. When a requester contacts the service center, a comparison of the voice of the requester and a voice model of the person that the requester claims to be is performed, in order to verify the identity of the requester. Additionally, a voice model associated with a party is automatically updated after a subsequent communication between the party and the service center.

Abstract translation: 提供方的透明语音注册，以便为与服务中心的通信提供语音验证。抓捕一方当事人与服务中心代理人进行交流时口头沟通。创建并存储与所捕获的通信相关联的语音模型，以便在后续呼叫服务中心期间提供语音验证。当请求者联系服务中心时，执行请求者的语音与请求者所声称的人的语音模型的比较，以便验证请求者的身份。此外，与一方相关联的语音模型在该方与服务中心之间的后续通信之后自动更新。

2.

发明申请
SYSTEM AND METHOD FOR IMPROVED AUTOMATIC SPEECH RECOGNITION PERFORMANCE 有权
Title translation: 用于改进自动语音识别性能的系统和方法

公开(公告)号：US20110137648A1

公开(公告)日：2011-06-09

申请号：US12631131

申请日：2009-12-04

Applicant: Andrej LJOLJE , Mazin GILBERT

Inventor： Andrej LJOLJE , Mazin GILBERT

IPC: G10L15/00

CPC classification number: G10L15/00 , G10L15/285 , G10L15/32

Abstract: Disclosed herein are systems, methods, and computer-readable storage media for improving automatic speech recognition performance. A system practicing the method identifies idle speech recognition resources and establishes a supplemental speech recognizer on the idle resources based on overall speech recognition demand. The supplemental speech recognizer can differ from a main speech recognizer, and, along with the main speech recognizer, can be associated with a particular speaker. The system performs speech recognition on speech received from the particular speaker in parallel with the main speech recognizer and the supplemental speech recognizer and combines results from the main and supplemental speech recognizer. The system recognizes the received speech based on the combined results. The system can use beam adjustment in place of or in combination with a supplemental speech recognizer. A scheduling algorithm can tailor a particular combination of speech recognition resources and release the supplemental speech recognizer based on increased demand.

Abstract translation: 本文公开了用于改善自动语音识别性能的系统，方法和计算机可读存储介质。实施该方法的系统识别空闲语音识别资源，并且基于总体语音识别需求在空闲资源上建立补充语音识别器。补充语音识别器可以与主语音识别器不同，并且与主语音识别器一起可以与特定扬声器相关联。该系统与主语音识别器和辅助语音识别器并行地执行从特定扬声器接收的语音的语音识别，并且组合来自主语音识别器和补充语音识别器的结果。系统基于组合的结果识别接收到的语音。该系统可以使用波束调整来代替或与补充语音识别器组合。调度算法可以定制语音识别资源的特定组合，并且基于增加的需求来释放补充语音识别器。

3.

发明申请
SYSTEM AND METHOD FOR IMPROVING ROBUSTNESS OF SPEECH RECOGNITION USING VOCAL TRACT LENGTH NORMALIZATION CODEBOOKS 有权
Title translation: 使用VOCAL TRACT LENGTH NORMALIZATION CODEBOOKS来提高语音识别的鲁棒性的系统和方法

公开(公告)号：US20100324893A1

公开(公告)日：2010-12-23

申请号：US12869039

申请日：2010-08-26

Applicant: Mazin GILBERT

Inventor： Mazin GILBERT

IPC: G10L15/00

CPC classification number: G10L15/07

Abstract: Disclosed are systems, methods, and computer readable media for performing speech recognition. The method embodiment comprises selecting a codebook from a plurality of codebooks with a minimal acoustic distance to a received speech sample, the plurality of codebooks generated by a process of (a) computing a vocal tract length for a each of a plurality of speakers, (b) for each of the plurality of speakers, clustering speech vectors, and (c) creating a codebook for each speaker, the codebook containing entries for the respective speaker's vocal tract length, speech vectors, and an optional vector weight for each speech vector, (2) applying the respective vocal tract length associated with the selected codebook to normalize the received speech sample for use in speech recognition, and (3) recognizing the received speech sample based on the respective vocal tract length associated with the selected codebook.

Abstract translation: 公开了用于执行语音识别的系统，方法和计算机可读介质。方法实施例包括从具有对接收到的语音样本的最小声距离的多个码本中选择码本，所述多个码本通过（a）计算多个扬声器中的每一个的声道长度的处理而生成（ b）对于所述多个扬声器中的每一个，聚类语音向量，以及（c）为每个说话者创建码本，所述码本包含用于每个语音向量的相应说话者声道长度，语音向量和可选矢量权重的条目，（2）应用与所选码本相关联的相应声道长度，以规范化用于语音识别的接收到的语音样本，以及（3）基于与所选码本相关联的相应声道长度来识别所接收的语音样本。

4.

发明申请
SYSTEM AND METHOD FOR CREATING A SPEECH SEARCH PLATFORM FOR COUPONS 审中-公开
Title translation: 用于创建语音搜索平台的系统和方法

公开(公告)号：US20100070360A1

公开(公告)日：2010-03-18

申请号：US12339981

申请日：2008-12-19

Applicant: Mazin GILBERT , Jay Wilpon

Inventor： Mazin GILBERT , Jay Wilpon

IPC: G06Q30/00 , G06F17/30 , G06F17/40 , G06F7/08

CPC classification number: G06Q30/02 , G06Q30/0217

Abstract: Disclosed herein are systems, methods, and computer readable-media for creating a speech search platform for coupons. The method includes receiving coupons from vendors, generating indexing information about the received coupons for use with speech searches, integrating the received coupons and respective indexing information into a database accessible through a Representational State Transfer (REST) Application Programming Interface (API) as part of a speech search platform for coupons, receiving from a user a natural language query through the speech search platform for coupons, identifying coupons in the database which match the natural language query based on location and a user profile, and transmitting the identified coupons to the user. The method can further include modifying the REST API to include coupon-specific parameters. Identified coupons can be transmitted to the consumer by notifying a coupon issuer that the user is entitled to a discount.

Abstract translation: 本文公开了用于创建优惠券的语音搜索平台的系统，方法和计算机可读介质。该方法包括从供应商接收优惠券，产生关于所接收的优惠券的索引信息，以便与语音搜索一起使用，将接收的优惠券和相应的索引信息集成到可通过表征状态转移（REST）应用编程接口（API）访问的数据库中，作为用于优惠券的语音搜索平台，通过用于优惠券的语音搜索平台从用户接收自然语言查询，基于位置和用户简档识别与自然语言查询匹配的数据库中的优惠券，以及将所识别的优惠券发送给用户。该方法还可以包括修改REST API以包括优惠券特定的参数。可以通过通知发行人证明用户有权享受折扣，可以将识别的优惠券传送给消费者。

5.

发明申请
SYSTEM AND METHOD FOR OPEN SPEECH RECOGNITION 有权
Title translation: 用于开放语音识别的系统和方法

公开(公告)号：US20120084086A1

公开(公告)日：2012-04-05

申请号：US12895359

申请日：2010-09-30

Applicant: Mazin GILBERT , Srinivas Bangalore , Patrick Haffner , Robert Bell

Inventor： Mazin GILBERT , Srinivas Bangalore , Patrick Haffner , Robert Bell

IPC: G10L15/26

CPC classification number: G10L15/32 , G10L15/063 , G10L15/26 , G10L2015/0638

Abstract: Disclosed herein are systems, methods and non-transitory computer-readable media for performing speech recognition across different applications or environments without model customization or prior knowledge of the domain of the received speech. The disclosure includes recognizing received speech with a collection of domain-specific speech recognizers, determining a speech recognition confidence for each of the speech recognition outputs, selecting speech recognition candidates based on a respective speech recognition confidence for each speech recognition output, and combining selected speech recognition candidates to generate text based on the combination.

Abstract translation: 本文公开了用于在不需要模型定制或接收到的语音的领域的先前知识的情况下在不同的应用或环境上执行语音识别的系统，方法和非暂时的计算机可读介质。该公开内容包括：利用特定领域的语音识别器的集合来识别接收的语音，为每个语音识别输出确定语音识别置信度，基于每个语音识别输出的相应语音识别置信度选择语音识别候选，以及组合所选语音识别候选人基于组合生成文本。

6.

发明申请
TRANSPARENT VOICE REGISTRATION AND VERIFICATION METHOD AND SYSTEM 有权
Title translation: 透明语音注册和验证方法与系统

公开(公告)号：US20120051525A1

公开(公告)日：2012-03-01

申请号：US13292436

申请日：2011-11-09

Applicant: Mazin GILBERT

Inventor： Mazin GILBERT

IPC: G10L17/00 , H04M1/64

CPC classification number: H04M3/4936 , G10L17/04 , G10L17/14 , H04M3/385 , H04M3/42221 , H04M3/51 , H04M2201/36 , H04M2201/41 , H04M2203/6045

Abstract: A method includes registering a voice of a party in order to provide voice verification for communications with an entity. A call is received from a party at a voice response system. The party is prompted for information and verbal communication spoken by the party is captured. A voice model associated with the party is created by processing the captured verbal communication spoken by the party and is stored. The identity of the party is verified and a previously stored voice model of the party, registered during a previous call from the party, is updated. The creation of the voice model is imperceptible to the party.

Abstract translation: 一种方法包括登记一方的声音以便为与实体的通信提供语音验证。从语音响应系统的一方接收到呼叫。党被提示信息和党的口头沟通被捕获。与派对相关联的语音模型是通过处理由该方所说出的所捕获的口头通信来存储的。验证方的身份，并更新先前存储的在该方的先前呼叫中注册的该方的语音模型。声音模式的创建是党无法察觉的。

7.

发明申请
On-Demand Language Translation for Television Programs 有权
Title translation: 电视节目的按需语言翻译

公开(公告)号：US20100217580A1

公开(公告)日：2010-08-26

申请号：US12772580

申请日：2010-05-03

Applicant: Srinivas BANGALORE , David Crawford GIBBON , Mazin GILBERT , Patrick Guy HAFFNER , Zhu LIU , Behzad SHAHRARAY

Inventor： Srinivas BANGALORE , David Crawford GIBBON , Mazin GILBERT , Patrick Guy HAFFNER , Zhu LIU , Behzad SHAHRARAY

IPC: G06F17/28

CPC classification number: H04N21/4856 , G06F17/289 , G10L21/10 , H04N7/0885 , H04N7/17336 , H04N21/2355 , H04N21/2543 , H04N21/25435 , H04N21/4355 , H04N21/440236 , H04N21/4882 , H04N21/4884 , H04N21/8193

Abstract: A method, a system and a machine-readable medium are provided for an on demand translation service. A translation module including at least one language pair module for translating a source language to a target language may be made available for use by a subscriber. The subscriber may be charged a fee for use of the requested on demand translation service or may be provided use of the on demand translation service for free in exchange for displaying commercial messages to the subscriber. A video signal may be received including information in the source language, which may be obtained as text from the video signal and may be translated from the source language to the target language by use of the translation module. Translated information, based on the translated text, may be added into the received video signal. The video signal including the translated information in the target language may be sent to a display device.

Abstract translation: 为按需翻译服务提供方法，系统和机器可读介质。包括用于将源语言翻译成目标语言的至少一个语言对模块的翻译模块可以被用户使用。用户可能会收取使用所请求的按需翻译服务的费用，或者可以免费使用按需翻译服务，以便向用户显示商业消息。可以接收包括源语言的信息的视频信号，其可以从视频信号获取为文本，并且可以通过使用翻译模块从源语言翻译成目标语言。基于翻译文本的翻译信息可以被添加到接收的视频信号中。可以将包括目标语言的翻译信息的视频信号发送到显示装置。

8.

发明申请
METHOD AND APPARATUS FOR IDENTIFYING ACOUSTIC BACKGROUND ENVIRONMENTS TO ENHANCE AUTOMATIC SPEECH RECOGNITION 有权
Title translation: 识别声音背景环境的方法和装置，以增强自动语音识别

公开(公告)号：US20080300871A1

公开(公告)日：2008-12-04

申请号：US11754814

申请日：2007-05-29

Applicant: Mazin GILBERT

Inventor： Mazin GILBERT

IPC: G10L15/00

CPC classification number: G10L15/20 , G10L15/065 , G10L15/07 , G10L15/08 , G10L15/26 , G10L15/30 , G10L21/0216 , G11B27/034

Abstract: Disclosed are systems, methods, and computer readable media for identifying an acoustic environment of a caller. The method embodiment comprises analyzing acoustic features of a received audio signal from a caller, receiving meta-data information, classifying a background environment of the caller based on the analyzed acoustic features and the meta-data, selecting an acoustic model matched to the classified background environment from a plurality of acoustic models, and performing speech recognition as the received audio signal using the selected acoustic model.

Abstract translation: 公开了用于识别呼叫者的声学环境的系统，方法和计算机可读介质。方法实施例包括分析来自呼叫者的接收音频信号的声学特征，接收元数据信息，基于分析的声学特征和元数据对呼叫者的背景环境进行分类，选择与分类背景相匹配的声学模型环境，并且使用所选择的声学模型来执行语音识别作为所接收的音频信号。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification