Method of speech recognition using empirically determined word candidates
    1.
    发明授权
    Method of speech recognition using empirically determined word candidates 有权
    使用经验确定的词候选语音识别方法

    公开(公告)号:US06963834B2

    公开(公告)日:2005-11-08

    申请号:US09867197

    申请日:2001-05-29

    IPC分类号: G10L15/08 G10L15/22 G10L15/26

    CPC分类号: G10L15/08 G10L2015/221

    摘要: A method for performing speech recognition can include determining a recognition result for received user speech. The recognition result can include recognized text and a corresponding confidence score. The confidence score of the recognition result can correspond to a predetermined minimum threshold. If the confidence score does not exceed the predetermined minimum threshold, the user can be presented with at least one empirically determined alternate word candidate corresponding to the recognition result.

    摘要翻译: 用于执行语音识别的方法可以包括确定用于接收到的用户语音的识别结果。 识别结果可以包括识别的文本和相应的置信度得分。 识别结果的置信度分数可以对应于预定的最小阈值。 如果置信度分数不超过预定的最小阈值,则可以向用户呈现与识别结果相对应的至少一个经验确定的替代词候选。

    Method and system for speech recognition using phonetically similar word alternatives
    2.
    发明授权
    Method and system for speech recognition using phonetically similar word alternatives 有权
    使用语音相似词替代语言进行语音识别的方法和系统

    公开(公告)号:US06910012B2

    公开(公告)日:2005-06-21

    申请号:US09858741

    申请日:2001-05-16

    IPC分类号: G10L15/06 G10L15/18 G10L15/02

    CPC分类号: G10L15/19 G10L2015/0636

    摘要: A method for performing speech recognition can include the steps of providing a grammar including entries comprising a parent word and a pseudo word being substantially phonetically equivalent to the parent word. The grammar can provide a translation from the pseudo word to the parent word. The parent word can be received as speech and the speech can be compared to the grammar entries. Additionally, the speech can be matched to the pseudo word and the pseudo word can be translated to the parent word.

    摘要翻译: 一种用于执行语音识别的方法可以包括以下步骤:提供包括条目的语法,所述条目包括父词和基本上在语音上等同于父词的伪词。 语法可以提供从伪词到父词的翻译。 母语可以作为语音接收,语音可以与语法条目进行比较。 另外,语音可以与伪字匹配,并且伪字可以被翻译成母字。

    METHOD AND SYSTEM FOR TRIMMING AUDIO FILES
    3.
    发明申请
    METHOD AND SYSTEM FOR TRIMMING AUDIO FILES 失效
    用于调整音频文件的方法和系统

    公开(公告)号:US20080215316A1

    公开(公告)日:2008-09-04

    申请号:US12029343

    申请日:2008-02-11

    IPC分类号: G10L19/14

    摘要: A system for automatically trimming an audio files based upon textual content associated with the audio file is provided. The source of the textual content may be an electronic document or written language text. The textual content may include predefined hints, a text mark, or end-of-phrase punctuation mark. The system generates a trimming instruction based upon textual content corresponding to the audio file, and the audio file is trimmed based upon the trimming instruction.

    摘要翻译: 提供了一种基于与音频文件相关联的文本内容来自动修整音频文件的系统。 文本内容的来源可以是电子文档或书面语言文本。 文本内容可以包括预定义的提示,文本标记或短语结尾的标点符号。 该系统基于与音频文件对应的文本内容生成修剪指令,并且基于修剪指令修剪音频文件。

    Speech recognition system for database access through the use of data domain overloading of grammars
    5.
    发明授权
    Speech recognition system for database access through the use of data domain overloading of grammars 有权
    语音识别系统通过使用数据库访问语法重写语法

    公开(公告)号:US06662157B1

    公开(公告)日:2003-12-09

    申请号:US09596770

    申请日:2000-06-19

    IPC分类号: G10L1526

    CPC分类号: G10L15/19

    摘要: A method for voice data entry availability in a voice response system can include receiving speech input specifying data in an audio user interface to a data information system for processing data in a data store. The speech input can be received through an audio user interface to the data information system. Subsequently, speech-to-text conversion of the speech input can be performed using a speech recognition engine with reference to a corresponding speech grammar. In particular, the speech grammar can contain a data set of words relating to the data information system. Notably, the data store can contain a subset of the data set, the subset having words which can be processed by the data information system, the subset not having words which cannot be processed by the data information system. If the specified data is included in the speech grammar and if the specified data is in the data store, the speech data in the speech query can be processed. However, if the specified data is not in the data store, it can be reported that the specified data cannot be processed. Finally, if the specified data is not included in the speech grammar, an Out-Of-Grammar (OOG) condition can be reported. Additionally, the speech data in the speech query is not processed.

    摘要翻译: 用于话音响应系统中的话音数据输入可用性的方法可以包括在数据信息系统的音频用户界面中接收指定数据的语音输入,以处理数据存储中的数据。 语音输入可以通过音频用户界面接收到数据信息系统。 随后,可以使用语音识别引擎参考对应的语音语法来执行语音输入的语音到文本转换。 特别地,语音语法可以包含与数据信息系统相关的单词的数据集。 值得注意的是,数据存储可以包含数据集的子集,该子集具有可由数据信息系统处理的字,该子集不具有数据信息系统不能处理的字。 如果指定的数据被包括在语音语法中,并且如果指定的数据在数据存储器中,则可以处理语音查询中的语音数据。 但是,如果指定的数据不在数据存储中,则可以报告无法处理指定的数据。 最后,如果指定的数据不包括在语音语法中,则可以报告语法外(OOG)。 此外,语音查询中的语音数据未被处理。

    METHODS AND APPARATUS FOR VOICE-ENABLING A WEB APPLICATION
    6.
    发明申请
    METHODS AND APPARATUS FOR VOICE-ENABLING A WEB APPLICATION 有权
    用于语音启动WEB应用程序的方法和设备

    公开(公告)号:US20140039885A1

    公开(公告)日:2014-02-06

    申请号:US13565216

    申请日:2012-08-02

    IPC分类号: G10L15/00

    摘要: Methods and apparatus for voice-enabling a web application, wherein the web application includes one or more web pages rendered by a web browser on a computer. At least one information source external to the web application is queried to determine whether information describing a set of one or more supported voice interactions for the web application is available, and in response to determining that the information is available, the information is retrieved from the at least one information source. Voice input for the web application is then enabled based on the retrieved information.

    摘要翻译: 用于语音使能web应用的方法和装置,其中所述web应用包括由计算机上的web浏览器呈现的一个或多个网页。 询问Web应用程序外部的至少一个信息源,以确定描述web应用程序的一个或多个支持的语音交互的集合的信息是否可用,并且响应于确定该信息可用,响应于从 至少有一个信息源。 然后基于检索到的信息启用Web应用程序的语音输入。

    Method and apparatus for coupling a visual browser to a voice browser

    公开(公告)号:US07080315B1

    公开(公告)日:2006-07-18

    申请号:US09605612

    申请日:2000-06-28

    IPC分类号: G06F15/00

    摘要: A method and apparatus for concurrently accessing network-based electronic content in a Voice Browser and a Visual Browser can include the steps of retrieving a network-based document formatted for display in the Visual Browser; identifying in the retrieved document a reference to the Voice Browser, the reference specifying electronic content formatted for audible presentation in the Voice Browser; and, transmitting the reference to the Voice Browser. The Voice Browser can retrieve the specified electronic content and audibly present the electronic content. Concurrently, the Visual Browser can visually present the network-based document formatted for visual presentation in the Visual Browser. Likewise, the method of the invention can include the steps of retrieving a network-based document formatted for audible presentation in the Voice Browser; identifying in the retrieved document a reference to the Visual Browser, the reference specifying electronic content formatted for visual presentation in the Visual Browser; and, transmitting the reference to the Visual Browser. The Visual Browser can retrieve the specified electronic content and visually present the specified electronic content. Concurrently, the Voice Browser can audibly present the network-based document formatted for audible presentation in the Voice Browser.

    Voice over IP protocol based speech system
    9.
    发明授权
    Voice over IP protocol based speech system 有权
    基于语音IP协议的语音系统

    公开(公告)号:US06654722B1

    公开(公告)日:2003-11-25

    申请号:US09596769

    申请日:2000-06-19

    IPC分类号: G10L1100

    摘要: A VoIP-enabled speech server can include a speech application which can be configured to communicate with a VoIP telephony gateway server over a VoIP communications path. The VoIP-enabled speech server can also include a VoIP-compliant call control interface to the VoIP telephony gate server, the VoIP-compliant call control interface establishing the VoIP communications path. In operation, the speech application can receive VoIP-compliant packets from the VoIP telephony gateway server over the VoIP communications path. Subsequently, digitized audio data can be reconstructed from the VoIP-compliant packets, and the digitized audio data can be speech-to-text converted. Additionally, text can be synthesized into digitized audio data and the digitized audio data can be encapsulated in VoIP-compliant packets which can be transmitted over the VoIP communications path to the telephony gateway server.

    摘要翻译: 支持VoIP的语音服务器可以包括可配置成通过VoIP通信路径与VoIP电话网关服务器进行通信的语音应用。 支持VoIP的语音服务器还可以包括VoIP电话门控服务器的VoIP兼容呼叫控制接口,VoIP电话控制接口建立VoIP通信路径。 在操作中,语音应用可以通过VoIP通信路径从VoIP电话网关服务器接收VoIP兼容分组。 随后,可以从VoIP兼容分组重建数字化音频数据,并且数字化的音频数据可以进行语音到文本转换。 此外,文本可以被合成为数字音频数据,并且数字化的音频数据可以被封装在可通过VoIP通信路径传送到电话网关服务器的VoIP兼容分组中。

    METHODS AND APPARATUS FOR VOICED-ENABLING A WEB APPLICATION
    10.
    发明申请
    METHODS AND APPARATUS FOR VOICED-ENABLING A WEB APPLICATION 有权
    用于声明启用WEB应用程序的方法和设备

    公开(公告)号:US20140040745A1

    公开(公告)日:2014-02-06

    申请号:US13565000

    申请日:2012-08-02

    IPC分类号: G06F3/16 G06F15/16

    CPC分类号: G06F3/167 G06F17/30873

    摘要: Methods and apparatus for voice-enabling a web application, wherein the web application includes one or more web pages rendered by a web browser on a computer. At least one information source external to the web application is queried to determine whether information describing a set of one or more supported voice interactions for the web application is available, and in response to determining that the information is available, the information is retrieved from the at least one information source. Voice input for the web application is then enabled based on the retrieved information.

    摘要翻译: 用于语音使能web应用的方法和装置,其中所述web应用包括由计算机上的web浏览器呈现的一个或多个网页。 询问Web应用程序外部的至少一个信息源,以确定描述web应用程序的一个或多个支持的语音交互的集合的信息是否可用,并且响应于确定该信息可用,响应于从 至少有一个信息源。 然后基于检索到的信息启用Web应用程序的语音输入。