Structuring verbal commands to allow concatenation in a voice interface in a mobile device
    1.
    发明授权
    Structuring verbal commands to allow concatenation in a voice interface in a mobile device 有权
    构造语言命令以允许在移动设备中的语音接口中连接

    公开(公告)号:US08452602B1

    公开(公告)日:2013-05-28

    申请号:US13621018

    申请日:2012-09-15

    IPC分类号: G10L21/00 G10L15/04

    摘要: A spoken utterance includes at least a first level of a multi-level command format, in which the first level identifies an application. The spoken utterance may also include a second level of the multi-level command format, in which the second level identifies an action. In response to receiving the spoken utterance at a computing device, a representation of the application identified by the first level is displayed on a display of the computing device. If the spoken utterance includes the second level of the multi-level command format, the action identified by the second level is initiated. If the spoken utterance does not include the second level of the multi-level command format, the computing device waits for a predetermined period of time and provides at least one of an audible or visual action prompt if the second level is not received within the predetermined period of time.

    摘要翻译: 讲话话语包括至少第一级的多级命令格式,其中第一级识别应用。 讲话话语还可以包括多级命令格式的第二级,其中第二级标识动作。 响应于在计算设备处接收到说出的话语,在计算设备的显示器上显示由第一级标识的应用的表示。 如果说出的话语包括多级命令格式的第二级,则启动由第二级标识的动作。 如果说话话语不包括多级命令格式的第二级,则计算设备等待预定的时间段,并且如果在预定的时间段内没有接收到第二级别,则提供听觉或视觉动作提示中的至少一个 一段的时间。

    Layered mobile application user interfaces
    3.
    发明授权
    Layered mobile application user interfaces 有权
    分层移动应用用户界面

    公开(公告)号:US09135914B1

    公开(公告)日:2015-09-15

    申请号:US13621094

    申请日:2012-09-15

    摘要: Disclosed are systems, methods, and devices for providing a layered user interface for one or more applications. A user-interface layer for a voice user interface is generated. The user-interface layer can be based on a markup-language-structured user-interface description for an application configured to execute on a computing device. The user-interface layer can include a command display of one or more voice-accessible commands for the application. The computing device can display at least the user-interface layer of the voice user interface. The computing device can receive an input utterance, obtain input text based upon speech recognition performed upon the input utterance, and determine that the input text corresponds to a voice-accessible command displayed as part of the command display. The computing device can execute the application to perform the command.

    摘要翻译: 公开了用于为一个或多个应用提供分层用户界面的系统,方法和设备。 生成用于语音用户界面的用户界面层。 用户界面层可以基于配置为在计算设备上执行的应用的标记语言结构的用户界面描述。 用户界面层可以包括用于应用的一个或多个语音可访问命令的命令显示。 计算设备至少可以显示语音用户界面的用户界面层。 计算设备可以接收输入话语,基于在输入话语上执行的语音识别获得输入文本,并且确定输入文本对应于作为命令显示的一部分显示的语音可访问命令。 计算设备可以执行应用程序来执行命令。

    Auto Focus
    4.
    发明申请
    Auto Focus 有权
    自动对焦

    公开(公告)号:US20120182381A1

    公开(公告)日:2012-07-19

    申请号:US13274053

    申请日:2011-10-14

    IPC分类号: H04N7/15

    摘要: A method of controlling a user interface to display participants of a call in dependence upon the participants' speech activity in the call, the method including monitoring the speech activity of the participants in the call and determining whether a participant is an active participant or an inactive participant in dependence on the participants' speech activity over a minimum time period of the call. In response to determining whether a participant is an active or inactive participant, an active participant is displayed in a first area of the user interface and an inactive participant is displayed in a second area of the user interface. The first area of the user interface is larger than the second area of the user interface.

    摘要翻译: 一种控制用户界面以根据参与者在呼叫中的语音活动来显示呼叫参与者的方法,所述方法包括监视呼叫中的参与者的语音活动,以及确定参与者是主动参与者还是非活动的 参与者在呼叫的最短时间段内依赖参与者的言语活动。 响应于确定参与者是否是活动或非活动参与者,活动参与者被显示在用户界面的第一区域中,并且非活动参与者被显示在用户界面的第二区域中。 用户界面的第一个区域大于用户界面的第二个区域。

    Auto focus
    5.
    发明授权
    Auto focus 有权
    自动对焦

    公开(公告)号:US08848020B2

    公开(公告)日:2014-09-30

    申请号:US13274053

    申请日:2011-10-14

    IPC分类号: H04N7/14 H04L12/18 H04M3/56

    摘要: A method of controlling a user interface to display participants of a call in dependence upon the participants' speech activity in the call, the method including monitoring the speech activity of the participants in the call and determining whether a participant is an active participant or an inactive participant in dependence on the participants' speech activity over a minimum time period of the call. In response to determining whether a participant is an active or inactive participant, an active participant is displayed in a first area of the user interface and an inactive participant is displayed in a second area of the user interface. The first area of the user interface is larger than the second area of the user interface.

    摘要翻译: 一种控制用户界面以根据参与者在呼叫中的语音活动来显示呼叫参与者的方法,所述方法包括监视呼叫中的参与者的语音活动,以及确定参与者是主动参与者还是非活动的 参与者在呼叫的最短时间段内依赖参与者的言语活动。 响应于确定参与者是否是活动或非活动参与者,活动参与者被显示在用户界面的第一区域中,并且非活动参与者被显示在用户界面的第二区域中。 用户界面的第一个区域大于用户界面的第二个区域。

    Systems And Methods For Continual Speech Recognition And Detection In Mobile Computing Devices
    8.
    发明申请
    Systems And Methods For Continual Speech Recognition And Detection In Mobile Computing Devices 有权
    用于移动计算设备中连续语音识别和检测的系统和方法

    公开(公告)号:US20130085755A1

    公开(公告)日:2013-04-04

    申请号:US13621068

    申请日:2012-09-15

    IPC分类号: G10L15/26

    摘要: The present application describes systems, articles of manufacture, and methods for continuous speech recognition for mobile computing devices. One embodiment includes determining whether a mobile computing device is receiving operating power from an external power source or a battery power source, and activating a trigger word detection subroutine in response to determining that the mobile computing device is receiving power from the external power source. In some embodiments, the trigger word detection subroutine operates continually while the mobile computing device is receiving power from the external power source. The trigger word detection subroutine includes determining whether a plurality of spoken words received via a microphone includes one or more trigger words, and in response to determining that the plurality of spoken words includes at least one trigger word, launching an application corresponding to the at least one trigger word included in the plurality of spoken words.

    摘要翻译: 本申请描述了用于移动计算设备的连续语音识别的系统,制品和方法。 一个实施例包括确定移动计算设备是否从外部电源或电池电源接收工作电力,以及响应于确定移动计算设备正在从外部电源接收电力而激活触发字检测子程序。 在一些实施例中,触发字检测子程序在移动计算设备正在从外部电源接收电力的同时工作。 触发词检测子程序包括确定通过麦克风接收的多个口语单词是否包括一个或多个触发词,并且响应于确定所述多个口语单词包括至少一个触发词,启动与至少一个对应的应用程序 一个触发词包括在多个口语中。

    Hybrid Client/Server Speech Recognition In A Mobile Device
    9.
    发明申请
    Hybrid Client/Server Speech Recognition In A Mobile Device 审中-公开
    移动设备中的混合客户端/服务器语音识别

    公开(公告)号:US20130085753A1

    公开(公告)日:2013-04-04

    申请号:US13586696

    申请日:2012-08-15

    IPC分类号: G10L15/20

    摘要: A computing device is able to use an embedded speech recognizer and a network speech recognizer for speech recognition. In response to detecting speech in the captured audio, the computing device may forward the captured audio to its embedded speech recognizer and to a speech client for the network speech recognizer. The embedded speech recognizer provides an embedded-recognizer result for the captured audio. If a network-recognition criterion is met, the speech client forwards the captured audio to the network speech recognizer and receives a network-recognizer result for the captured audio from the network speech recognizer. A speech recognition result for the captured audio is forwarded to at least one application, wherein the speech recognition result is based on at least one of the embedded-recognizer result and the network-recognizer result.

    摘要翻译: 计算设备能够使用嵌入式语音识别器和用于语音识别的网络语音识别器。 响应于在捕获的音频中检测到语音,计算设备可以将捕获的音频转发到其嵌入式语音识别器和用于网络语音识别器的语音客户端。 嵌入式语音识别器为捕获的音频提供嵌入式识别器结果。 如果满足网络识别标准,则话音客户端将所捕获的音频转发到网络语音识别器,并从网络语音识别器接收所捕获的音频的网络识别器结果。 将捕获的音频的语音识别结果转发到至少一个应用,其中语音识别结果基于嵌入式识别器结果和网络识别器结果中的至少一个。