METHOD AND SYSTEM FOR PROVIDING MENU AND OTHER SERVICES FOR AN INFORMATION PROCESSING SYSTEM USING A TELEPHONE OR OTHER AUDIO INTERFACE
    2.
    发明申请
    METHOD AND SYSTEM FOR PROVIDING MENU AND OTHER SERVICES FOR AN INFORMATION PROCESSING SYSTEM USING A TELEPHONE OR OTHER AUDIO INTERFACE 审中-公开
    使用电话或其他音频接口为信息处理系统提供菜单和其他服务的方法和系统

    公开(公告)号:US20080154601A1

    公开(公告)日:2008-06-26

    申请号:US11943549

    申请日:2007-11-20

    IPC分类号: G10L15/00 G10L13/00 G10L21/00

    CPC分类号: G10L15/22

    摘要: A method and system for providing efficient menu services for an information processing system that uses a telephone or other form of audio user interface. In one embodiment, the menu services provide effective support for novice users by providing a full listing of available keywords and rotating house advertisements which inform novice users of potential features and information. For experienced users, cues are rendered so that at any time the user can say a desired keyword to invoke the corresponding application. The menu is flat to facilitate its usage. Full keyword listings are rendered after the user is given a brief cue to say a keyword. Service messages rotate words and word prosody. When listening to receive information from the user, after the user has been cued, soft background music or other audible signals are rendered to inform the user that a response may now be spoken to the service. Other embodiments determine default cities, on which to report information, based on characteristics of the caller or based on cities that were previously selected by the caller. Other embodiments provide speech concatenation processes that have co-articulation and real-time subject-matter-based word selection which generate human sounding speech. Other embodiments reduce the occurrences of falsely triggered barge-ins during content delivery by only allowing interruption for certain special words. Other embodiments offer special services and modes for calls having voice recognition trouble. The special services are entered after predetermined criterion have been met by the call. Other embodiments provide special mechanisms for automatically recovering the address of a caller.

    摘要翻译: 一种用于为使用电话或其他形式的音频用户界面的信息处理系统提供有效的菜单服务的方法和系统。 在一个实施例中,菜单服务通过提供可用关键字的完整列表以及向新手用户通知潜在特征和信息的旋转房屋广告来为新手用户提供有效的支持。 对于有经验的用户,渲染提示,使得用户可以在任何时候说出所需的关键字来调用相应的应用。 菜单是平的,以方便其使用。 在用户给出一个简短的提示来说出关键字之后,会显示完整的关键字列表。 服务消息旋转单词和单词韵律。 当收听来自用户的信息时,在用户被提示之后,呈现软背景音乐或其他可听信号以通知用户现在可以向该服务说出响应。 其他实施例基于呼叫者的特征或基于呼叫者先前选择的城市来确定报告信息的默认城市。 其他实施例提供具有产生人类声音语音的共同和基于主题的基于实时的词选择的语音连接过程。 其他实施例通过仅允许某些特定字的中断来减少内容传递期间错误触发的引渡的发生。 其他实施例提供具有语音识别故障的呼叫的特殊服务和模式。 在通过电话满足预定标准后输入特殊服务。 其他实施例提供用于自动恢复呼叫者的地址的特殊机制。

    Handling of speech recognition in a declarative markup language
    3.
    发明授权
    Handling of speech recognition in a declarative markup language 有权
    以声明式标记语言处理语音识别

    公开(公告)号:US06941268B2

    公开(公告)日:2005-09-06

    申请号:US09887750

    申请日:2001-06-21

    IPC分类号: G10L15/22 H04M3/493 G10L21/00

    CPC分类号: H04M3/4938

    摘要: Declarative markup languages for speech applications such as VoiceXML are becoming more prevalent programming modalities for describing speech applications. Present declarative markup languages for speech applications model the running speech application as a state machine with the program specifying the transitions amongst the states. These languages can be extended to support a marker-semantic to more easily solve several problems that are otherwise not easily solved. In one embodiment, a partially overlapping target window is implemented using a mark semantic. Other uses include measurement of user listening time, detection and avoidance of errors, and better resumption of playback after a false barge in.

    摘要翻译: 语音应用程序(如VoiceXML)的声明式标记语言正在成为描述语音应用程序的更为普遍的编程模式。 用于语音应用的现有声明性标记语言将运行中的语音应用程序建模为状态机,其中程序指定状态之间的转换。 这些语言可以被扩展以支持标记语义,以更容易地解决否则不容易解决的几个问题。 在一个实施例中,使用标记语义来实现部分重叠的目标窗口。 其他用途包括测量用户收听时间,检测和避免错误,以及在错误的驳船之后更好地恢复播放。

    Zero-footprint telephone application development
    4.
    发明授权
    Zero-footprint telephone application development 有权
    零占地面积的电话应用开发

    公开(公告)号:US08612925B1

    公开(公告)日:2013-12-17

    申请号:US11548200

    申请日:2006-10-10

    IPC分类号: G06F9/44

    摘要: A zero-footprint remotely hosted phone application development environment is described. The environment allows a developer to use a standard computer without any specialized software (in some embodiments all that is necessary is a web browser and network access) together with a telephone to develop sophisticated phone applications that use speech recognition and/or touch tone inputs to perform tasks, access web-based information, and/or perform commercial transactions. For example, in preparation for a sales pitch for selling hosting services, a non-programmer can develop a short application appropriate to the target customer. After the pitch, access to the demonstration could be given to the target customer to allow them to more fully develop the application. When the target customer is satisfied with the application, they can have their application live for their actual (as opposed to test users) at a suitable phone number simply by having the hosting provider configure the appropriate access. Once the source code of phone application is identified to the development environment, the developer can use a telephone to immediately call the application on the hosted development environment. Some embodiments support concurrent call flow tracking that allows a developer to observe, using a web browser, the execution of her/his application. A variety of reusable libraries are provided to enable the developer to leverage well-developed libraries for common playback, input, and computational tasks. This focuses the development on application specific logic. Embodiments of the invention simplify the process of defining speech recognition grammars within their applications. Embodiments of the invention support rapid application deployment from the development environment to hosted application deployment to the intended audience.

    摘要翻译: 描述了零占用的远程托管手机应用程序开发环境。 该环境允许开发人员使用标准计算机,而无需任何专门的软件(在一些实施例中,所有必要的都是网络浏览器和网络访问)以及电话,以开发使用语音识别和/或触摸音输入的复杂的电话应用 执行任务,访问基于Web的信息和/或执行商业交易。 例如,为了准备销售托管服务的销售点,非程序员可以开发适合于目标客户的简短应用程序。 演出结束后,可以向目标客户提供演示,以便他们更充分地开发应用程序。 当目标客户对应用程序感到满意时,只需使主机提供商配置适当的访问权限,就可以通过适当的电话号码为他们的实际(而不是测试用户)提供应用程序。 一旦电话应用程序的源代码被识别到开发环境,开发人员可以使用电话立即在托管的开发环境上调用应用程序。 一些实施例支持并发呼叫流跟踪,其允许开发者使用web浏览器来观察她/他的应用的执行。 提供了各种可重用的库,使开发人员能够利用开发良好的库进行常见的回放,输入和计算任务。 这将重点放在应用程序专用逻辑上。 本发明的实施例简化了在其应用中定义语音识别语法的过程。 本发明的实施例支持从开发环境到托管应用部署到目标受众的快速应用部署。

    Providing services for an information processing system using an audio interface
    5.
    发明授权
    Providing services for an information processing system using an audio interface 有权
    为使用音频接口的信息处理系统提供服务

    公开(公告)号:US07308408B1

    公开(公告)日:2007-12-11

    申请号:US10955216

    申请日:2004-09-29

    IPC分类号: G10L13/00

    摘要: A method and system for providing efficient menu services for an information processing system that uses a telephone or other form of audio user interface. In one embodiment, the menu services provide effective support for novice users by providing a full listing of available keywords and rotating house advertisements which inform novice users of potential features and information. For experienced users, cues are rendered so that at any time the user can say a desired keyword to invoke the corresponding application. The menu is flat to facilitate its usage. Full keyword listings are rendered after the user is given a brief cue to say a keyword. Service messages rotate words and word prosody. When listening to receive information from the user, after the user has been cued, soft background music or other audible signals are rendered to inform the user that a response may now be spoken to the service. Other embodiments determine default cities, on which to report information, based on characteristics of the caller or based on cities that were previously selected by the caller. Other embodiments provide speech concatenation processes that have co-articulation and real-time subject-matter-based word selection which generate human sounding speech. Other embodiments reduce the occurrences of falsely triggered barge-ins during content delivery by only allowing interruption for certain special words. Other embodiments offer special services and modes for calls having voice recognition trouble. The special services are entered after predetermined criterion have been met by the call. Other embodiments provide special mechanisms for automatically recovering the address of a caller.

    摘要翻译: 一种用于为使用电话或其他形式的音频用户界面的信息处理系统提供有效的菜单服务的方法和系统。 在一个实施例中,菜单服务通过提供可用关键字的完整列表以及向新手用户通知潜在特征和信息的旋转房屋广告来为新手用户提供有效的支持。 对于有经验的用户,渲染提示,使得用户可以在任何时候说出所需的关键字来调用相应的应用。 菜单是平的,以方便其使用。 在用户给出一个简短的提示来说出关键字之后,会显示完整的关键字列表。 服务消息旋转单词和单词韵律。 当收听来自用户的信息时,在用户被提示之后,呈现软背景音乐或其他可听信号以通知用户现在可以向该服务说出响应。 其他实施例基于呼叫者的特征或基于呼叫者先前选择的城市来确定报告信息的默认城市。 其他实施例提供具有产生人类声音语音的共同和基于主题的基于实时的词选择的语音连接过程。 其他实施例通过仅允许某些特定字的中断来减少内容传递期间错误触发的引渡的发生。 其他实施例提供具有语音识别故障的呼叫的特殊服务和模式。 在通过电话满足预定标准后输入特殊服务。 其他实施例提供用于自动恢复呼叫者的地址的特殊机制。

    Providing menu and other services for an information processing system using a telephone or other audio interface
    6.
    发明授权
    Providing menu and other services for an information processing system using a telephone or other audio interface 有权
    为使用电话或其他音频接口的信息处理系统提供菜单和其他服务

    公开(公告)号:US07552054B1

    公开(公告)日:2009-06-23

    申请号:US11563112

    申请日:2006-11-24

    IPC分类号: G10L11/00 G06F15/16

    摘要: A method and system for providing efficient menu services for an information processing system that uses a telephone or other form of audio user interface. In one embodiment, the menu services provide effective support for novice users by providing a full listing of available keywords and rotating house advertisements which inform novice users of potential features and information. For experienced users, cues are rendered so that at any time the user can say a desired keyword to invoke the corresponding application. The menu is flat to facilitate its usage. Full keyword listings are rendered after the user is given a brief cue to say a keyword. Service messages rotate words and word prosody. When listening to receive information from the user, after the user has been cued, soft background music or other audible signals are rendered to inform the user that a response may now be spoken to the service. Other embodiments determine default cities, on which to report information, based on characteristics of the caller or based on cities that were previously selected by the caller. Other embodiments provide speech concatenation processes that have co-articulation and real-time subject-matter-based word selection which generate human sounding speech. Other embodiments reduce the occurrences of falsely triggered barge-ins during content delivery by only allowing interruption for certain special words. Other embodiments offer special services and modes for calls having voice recognition trouble. The special services are entered after predetermined criterion have been met by the call. Other embodiments provide special mechanisms for automatically recovering the address of a caller.

    摘要翻译: 一种用于为使用电话或其他形式的音频用户界面的信息处理系统提供有效的菜单服务的方法和系统。 在一个实施例中,菜单服务通过提供可用关键字的完整列表以及向新手用户通知潜在特征和信息的旋转房屋广告来为新手用户提供有效的支持。 对于有经验的用户,渲染提示,使得用户可以在任何时候说出所需的关键字来调用相应的应用。 菜单是平的,以方便其使用。 在用户给出一个简短的提示来说出关键字之后,会显示完整的关键字列表。 服务消息旋转单词和单词韵律。 当收听来自用户的信息时,在用户被提示之后,呈现软背景音乐或其他可听信号以通知用户现在可以向该服务说出响应。 其他实施例基于呼叫者的特征或基于呼叫者先前选择的城市来确定报告信息的默认城市。 其他实施例提供具有产生人类声音语音的共同和基于主题的基于实时的词选择的语音连接过程。 其他实施例通过仅允许某些特定字的中断来减少内容传递期间错误触发的引渡的发生。 其他实施例提供具有语音识别故障的呼叫的特殊服务和模式。 在通过电话满足预定标准后输入特殊服务。 其他实施例提供用于自动恢复呼叫者的地址的特殊机制。

    Handling of speech recognition in a declarative markup language
    7.
    发明授权
    Handling of speech recognition in a declarative markup language 有权
    以声明式标记语言处理语音识别

    公开(公告)号:US07321856B1

    公开(公告)日:2008-01-22

    申请号:US11197483

    申请日:2005-08-03

    IPC分类号: G10L15/22

    CPC分类号: G10L15/22

    摘要: Declarative markup languages for speech applications such as VoiceXML are becoming more prevalent programming modalities for describing speech applications. Present declarative markup languages for speech applications model the running speech application as a state machine with the program specifying the transitions amongst the states. These languages can be extended to support a marker-semantic to more easily solve several problems that are otherwise not easily solved. In one embodiment, a partially overlapping target window is implemented using a mark semantic. Other uses include measurement of user listening time, detection and avoidance of errors, and better resumption of playback after a false barge in.

    摘要翻译: 语音应用程序(如VoiceXML)的声明式标记语言正在成为描述语音应用程序的更为普遍的编程模式。 用于语音应用的现有声明性标记语言将运行中的语音应用程序建模为状态机,其中程序指定状态之间的转换。 这些语言可以被扩展以支持标记语义,以更容易地解决否则不容易解决的几个问题。 在一个实施例中,使用标记语义来实现部分重叠的目标窗口。 其他用途包括测量用户收听时间,检测和避免错误,以及在错误的驳船之后更好地恢复播放。