SYSTEM AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES
    1.
    发明申请
    SYSTEM AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES 有权
    用于在多模式设备中提供用户演讲的系统和方法

    公开(公告)号:US20080162143A1

    公开(公告)日:2008-07-03

    申请号:US11616682

    申请日:2006-12-27

    CPC classification number: G06F3/167 G10L15/22 G10L2015/228

    Abstract: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.

    Abstract translation: 一种用于提示用户输入多模式接口的方法,包括向用户提供多模式接口的步骤,其中该接口包括具有多个输入区域的视觉接口,每个输入区域具有至少一个输入区域; 选择输入区域并处理由用户提供的多令牌语音输入,其中处理的语音输入包括用于所选输入区域的至少一个输入字段的至少一个值; 以及在至少一个输入字段中存储至少一个值。

    SYSTEMS AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES
    2.
    发明申请
    SYSTEMS AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES 审中-公开
    用于在多模式设备中提供用户演讲的系统和方法

    公开(公告)号:US20130227417A1

    公开(公告)日:2013-08-29

    申请号:US13847974

    申请日:2013-03-20

    Abstract: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.

    Abstract translation: 一种用于提示用户输入多模式接口的方法,包括向用户提供多模式接口的步骤,其中该接口包括具有多个输入区域的视觉接口,每个输入区域具有至少一个输入区域; 选择输入区域并处理由用户提供的多令牌语音输入,其中处理的语音输入包括用于所选输入区域的至少一个输入字段的至少一个值; 以及在至少一个输入字段中存储至少一个值。

    System and methods for prompting user speech in multimodal devices
    3.
    发明授权
    System and methods for prompting user speech in multimodal devices 有权
    在多模式设备中提示用户演讲的系统和方法

    公开(公告)号:US08417529B2

    公开(公告)日:2013-04-09

    申请号:US11616682

    申请日:2006-12-27

    CPC classification number: G06F3/167 G10L15/22 G10L2015/228

    Abstract: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.

    Abstract translation: 一种用于提示用户输入多模式接口的方法,包括向用户提供多模式接口的步骤,其中该接口包括具有多个输入区域的视觉接口,每个输入区域具有至少一个输入区域; 选择输入区域并处理由用户提供的多令牌语音输入,其中处理的语音输入包括用于所选输入区域的至少一个输入字段的至少一个值; 以及在至少一个输入字段中存储至少一个值。

    Automatic speech recognition with a selection list
    4.
    发明授权
    Automatic speech recognition with a selection list 有权
    具有选择列表的自动语音识别

    公开(公告)号:US08612230B2

    公开(公告)日:2013-12-17

    申请号:US11619209

    申请日:2007-01-03

    Abstract: Methods, apparatus, and computer program products are described for automatic speech recognition (‘ASR’) that include accepting by the multimodal application speech input and visual input for selecting or deselecting items in a selection list, the speech input enabled by a speech recognition grammar; providing, from the multimodal application to the grammar interpreter, the speech input and the speech recognition grammar; receiving, by the multimodal application from the grammar interpreter, interpretation results including matched words from the grammar that correspond to items in the selection list and a semantic interpretation token that specifies whether to select or deselect items in the selection list; and determining, by the multimodal application in dependence upon the value of the semantic interpretation token, whether to select or deselect items in the selection list that correspond to the matched words.

    Abstract translation: 描述用于自动语音识别(“ASR”)的方法,装置和计算机程序产品,其包括通过多模式应用语音输入的接受和用于在选择列表中选择或取消选择项目的可视输入,由语音识别语法启用的语音输入 ; 从多模式应用程序提供语法解释器,语音输入和语音识别语法; 通过多模式应用从语法解释器接收包括对应于选择列表中的项目的语法的匹配词的解释结果以及指定是否选择或取消选择列表中的项目的语义解释令牌; 以及根据所述语义解释令牌的值由所述多模式应用程序确定是否选择或取消选择列表中对应于所述匹配词的项目。

    AUTOMATIC SPEECH RECOGNITION WITH A SELECTION LIST
    5.
    发明申请
    AUTOMATIC SPEECH RECOGNITION WITH A SELECTION LIST 有权
    自动语音识别与选择列表

    公开(公告)号:US20080162136A1

    公开(公告)日:2008-07-03

    申请号:US11619209

    申请日:2007-01-03

    Abstract: Methods, apparatus, and computer program products are described for automatic speech recognition (‘ASR’) that include accepting by the multimodal application speech input and visual input for selecting or deselecting items in a selection list, the speech input enabled by a speech recognition grammar; providing, from the multimodal application to the grammar interpreter, the speech input and the speech recognition grammar; receiving, by the multimodal application from the grammar interpreter, interpretation results including matched words from the grammar that correspond to items in the selection list and a semantic interpretation token that specifies whether to select or deselect items in the selection list; and determining, by the multimodal application in dependence upon the value of the semantic interpretation token, whether to select or deselect items in the selection list that correspond to the matched words.

    Abstract translation: 描述用于自动语音识别(“ASR”)的方法,装置和计算机程序产品,其包括通过多模式应用语音输入的接受和用于在选择列表中选择或取消选择项目的可视输入,由语音识别语法启用的语音输入 ; 从多模式应用程序提供语法解释器,语音输入和语音识别语法; 通过多模式应用从语法解释器接收包括对应于选择列表中的项目的语法的匹配词的解释结果和指定是否选择或取消选择列表中的项目的语义解释令牌; 以及根据所述语义解释令牌的值由所述多模式应用程序确定是否选择或取消选择列表中对应于所述匹配词的项目。

    Improving speech capabilities of a multimodal application
    6.
    发明授权
    Improving speech capabilities of a multimodal application 有权
    提高多模式应用程序的语音能力

    公开(公告)号:US08380513B2

    公开(公告)日:2013-02-19

    申请号:US12468166

    申请日:2009-05-19

    CPC classification number: G10L15/22 G10L15/187 G10L15/19 G10L2015/228

    Abstract: Improving speech capabilities of a multimodal application including receiving, by the multimodal browser, a media file having a metadata container; retrieving, by the multimodal browser, from the metadata container a speech artifact related to content stored in the media file for inclusion in the speech engine available to the multimodal browser; determining whether the speech artifact includes a grammar rule or a pronunciation rule; if the speech artifact includes a grammar rule, modifying, by the multimodal browser, the grammar of the speech engine to include the grammar rule; and if the speech artifact includes a pronunciation rule, modifying, by the multimodal browser, the lexicon of the speech engine to include the pronunciation rule.

    Abstract translation: 改善多模式应用的语音能力,包括由多模式浏览器接收具有元数据容器的媒体文件; 由所述多模式浏览器从所述元数据容器检索与存储在所述媒体文件中的内容相关的语音伪像,以包括在所述多模式浏览器中可用的语音引擎中; 确定语音伪影是否包括语法规则或发音规则; 如果语音工件包括语法规则,则由多模式浏览器修改语音引擎的语法以包括语法规则; 并且如果语音伪影包括发音规则,则由多模式浏览器修改语音引擎的词典以包括发音规则。

    METHOD AND ARRANGEMENT FOR MANAGING GRAMMAR OPTIONS IN A GRAPHICAL CALLFLOW BUILDER
    7.
    发明申请
    METHOD AND ARRANGEMENT FOR MANAGING GRAMMAR OPTIONS IN A GRAPHICAL CALLFLOW BUILDER 有权
    用于管理图形呼叫建筑物中的灰度选项的方法和布置

    公开(公告)号:US20120209613A1

    公开(公告)日:2012-08-16

    申请号:US13344193

    申请日:2012-01-05

    CPC classification number: G10L2015/228

    Abstract: A method (10) in a speech recognition application callflow can include the steps of assigning (11) an individual option and a pre-built grammar to a same prompt, treating (15) the individual option as a valid output of the pre-built grammar if the individual option is a potential valid match to a recognition phrase (12) or an annotation (13) in the pre-built grammar, and treating (14) the individual option as an independent grammar from the pre-built grammar if the individual option fails to be a potential valid match to the recognition phrase or the annotation in the pre-built grammar.

    Abstract translation: 语音识别应用程序调用流程中的方法(10)可以包括以下步骤:将单个选项和预先构建的语法分配给相同的提示,将(15)个别选项视为预先构建的有效输出 如果个人选项是预先构建的语法中的识别短语(12)或注释(13)的潜在有效匹配,则将语法(14)作为独立语法从预先构建的语法处理(14),如果 单个选项不能成为预先构建的语法中的识别短语或注释的潜在有效匹配。

    Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets
    9.
    发明授权
    Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets 有权
    使用减少的脚本和预录制的语音资源构建级联TTS语音时减少录制时间

    公开(公告)号:US08019605B2

    公开(公告)日:2011-09-13

    申请号:US11748256

    申请日:2007-05-14

    CPC classification number: G10L13/04

    Abstract: The present invention discloses a system and a method for creating a reduced script, which is read by a voice talent to create a concatenative text-to-speech (TTS) voice. The method can automatically process pre-recorded audio to derive speech assets for a concatenative TTS voice. The pre-recording audio can include sets of recorded phrases used by a speech user interface (Sill). A set of unfulfilled speech assets needed for foil phonetic coverage of the concatenative TTS voice can be determined. A reduced script can be constructed that includes a set of phrases, which when read by a voice talent result in a reduced corpus. When the reduced corpus is automatically processed, a reduced set of speech assets result. The reduced set includes each of the unfulfilled speech assets. When this reduced corpus is combined with existing speech assets the result will be a voice with a complete set of speech assets.

    Abstract translation: 本发明公开了一种用于创建简化脚本的系统和方法,该脚本由语音天才读取以创建级联的文本到语音(TTS)语音。 该方法可以自动处理预先录制的音频,以便为连续的TTS语音导出语音资源。 预录音音频可以包括由语音用户界面(Sill)使用的记录短语集合。 可以确定一连串的TTS语音的箔语音覆盖所需的一组未实现的语音资产。 可以构造一个简化的脚本,其包括一组短语,当通过语音天赋读取时,会产生减少的语料库。 当自动处理缩减的语料库时,会产生一组减少的语音资源。 缩减的集合包括每个未实现的语音资产。 当这种减少的语料库与现有语音资源相结合时,结果将是具有完整语音资产的语音。

    Disambiguation systems and methods for use in generating grammars
    10.
    发明授权
    Disambiguation systems and methods for use in generating grammars 有权
    消歧系统和用于生成语法的方法

    公开(公告)号:US08010343B2

    公开(公告)日:2011-08-30

    申请号:US11304964

    申请日:2005-12-15

    CPC classification number: G06F17/2795 G06F17/30731

    Abstract: A method and system for addressing disambiguation issues in interactive applications by creating a disambiguation system for generating complex grammars that includes homonym detection and grouping, and provides optimization feedback that eliminates time-consuming and repetitive iterative steps during the grammar generation portion of the interactive application configuration.

    Abstract translation: 一种用于通过创建消歧系统来解决交互式应用程序中的消歧问题的方法和系统,用于生成包含同音异动检测和分组的复杂语法,并提供优化反馈,消除交互式应用程序配置语法生成部分期间的耗时和重复的迭代步骤 。

Patent Agency Ranking