SYSTEM AND METHOD FOR DISTRIBUTED VOICE MODELS ACROSS CLOUD AND DEVICE FOR EMBEDDED TEXT-TO-SPEECH
    2.
    发明申请
    SYSTEM AND METHOD FOR DISTRIBUTED VOICE MODELS ACROSS CLOUD AND DEVICE FOR EMBEDDED TEXT-TO-SPEECH 有权
    用于分布式语音模型的系统和方法用于嵌入式文本到语音的云和设备

    公开(公告)号:US20160086598A1

    公开(公告)日:2016-03-24

    申请号:US14953771

    申请日:2015-11-30

    CPC classification number: G10L13/04 G10L13/047 G10L13/07

    Abstract: Systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify a speech synthesis context, and determine, based on a local cache of text-to-speech units for a text-to-speech voice and based on the speech synthesis context, additional text-to-speech units which are not in the local cache. The system can request from a server the additional text-to-speech units, and store the additional text-to-speech units in the local cache. The system can then synthesize speech using the text-to-speech units and the additional text-to-speech units in the local cache. The system can prune the cache as the context changes, based on availability of local storage, or after synthesizing the speech. The local cache can store a core set of text-to-speech units associated with the text-to-speech voice that cannot be pruned from the local cache.

    Abstract translation: 用于智能缓存用于语音合成的级联语音单元的系统,方法和计算机可读存储介质。 配置为实施该方法的系统可以识别语音合成上下文,并且基于用于文本到语音语音的文本到语音单元的本地高速缓存并且基于语音合成上下文来确定附加的文本 - 不在本地缓存中的语音单元。 系统可以从服务器请求附加的文本到语音单元,并将附加的文本到语音单元存储在本地高速缓存中。 然后,系统可以使用本地高速缓存中的文本到语音单元和附加的文本到语音单元来合成语音。 系统可以根据本地存储的可用性,或合成语音之后随着上下文的变化修剪缓存。 本地缓存可以存储与文本到语音语音相关联的文本到语音单元的核心集合,其不能从本地高速缓存中修剪。

    SYSTEM AND METHOD FOR SELECTING NETWORK-BASED VERSUS EMBEDDED SPEECH PROCESSING
    3.
    发明申请
    SYSTEM AND METHOD FOR SELECTING NETWORK-BASED VERSUS EMBEDDED SPEECH PROCESSING 审中-公开
    用于选择基于网络的VERSUS嵌入式语音处理的系统和方法

    公开(公告)号:US20150120296A1

    公开(公告)日:2015-04-30

    申请号:US14066105

    申请日:2013-10-29

    CPC classification number: G10L15/30

    Abstract: Disclosed herein are systems, methods, and computer-readable storage media for making a multi-factor decision whether to process speech or language requests via a network-based speech processor or a local speech processor. An example local device configured to practice the method, having a local speech processor, and having access to a remote speech processor, receives a request to process speech. The local device can analyze multi-vector context data associated with the request to identify one of the local speech processor and the remote speech processor as an optimal speech processor. Then the local device can process the speech, in response to the request, using the optimal speech processor. If the optimal speech processor is local, then the local device processes the speech. If the optimal speech processor is remote, the local device passes the request and any supporting data to the remote speech processor and waits for a result.

    Abstract translation: 本文公开了用于进行多因素决定的系统,方法和计算机可读存储介质,以决定是否经由基于网络的语音处理器或本地语音处理器来处理语音或语言请求。 被配置为实施具有本地语音处理器并且具有对远程语音处理器的访问的方法的示例性本地设备接收处理语音的请求。 本地设备可以分析与请求相关联的多向量上下文数据,以将本地语音处理器和远程语音处理器之一识别为最佳语音处理器。 然后本地设备可以根据请求处理语音,使用最佳语音处理器。 如果最佳语音处理器是本地的,则本地设备处理语音。 如果最佳语音处理器是远程的,则本地设备将请求和任何支持数据传送到远程语音处理器并等待结果。

    SYSTEM AND METHOD FOR MANAGING MODELS FOR EMBEDDED SPEECH AND LANGUAGE PROCESSING
    5.
    发明申请
    SYSTEM AND METHOD FOR MANAGING MODELS FOR EMBEDDED SPEECH AND LANGUAGE PROCESSING 有权
    用于管理嵌入式语音和语言处理模型的系统和方法

    公开(公告)号:US20150120287A1

    公开(公告)日:2015-04-30

    申请号:US14064579

    申请日:2013-10-28

    CPC classification number: G10L15/183 G10L15/22 G10L15/30 G10L2015/228

    Abstract: Disclosed herein are systems, methods, and computer-readable storage devices for fetching speech processing models based on context changes in advance of speech requests using the speech processing models. An example local device configured to practice the method, having a local speech processor, and having access to remote speech models, detects a change in context. The change in context can be based on geographical location, language translation, speech in a different language, user language settings, installing or removing an app, and so forth. The local device can determine a speech processing model that is likely to be needed based on the change in context, and that is not stored on the local device. Independently of an explicit request to process speech, the local device can retrieve, from a remote server, the speech processing model for use on the mobile device.

    Abstract translation: 本文公开了用于基于使用语音处理模型的语音请求之前的上下文改变来提取语音处理模型的系统,方法和计算机可读存储设备。 被配置为实施具有本地语音处理器并具有对远程语音模型的访问的方法的示例性本地设备检测上下文中的改变。 上下文的变化可以基于地理位置,语言翻译,以不同语言的语言,用户语言设置,安装或移除应用程序等等。 本地设备可以基于上下文的改变来确定可能需要的语音处理模型,并且不存储在本地设备上。 独立于显式请求处理语音,本地设备可以从远程服务器检索在移动设备上使用的语音处理模型。

    SYSTEM AND METHOD OF PROVIDING SPEECH PROCESSING IN USER INTERFACE

    公开(公告)号:US20160049151A1

    公开(公告)日:2016-02-18

    申请号:US14928193

    申请日:2015-10-30

    CPC classification number: G10L15/26 G06F3/0416 G06F3/162 G10L15/22 G10L15/30

    Abstract: Disclosed are systems, methods and computer-readable media for enabling speech processing in a user interface of a device. The method includes receiving an indication of a field and a user interface of a device, the indication also signaling that speech will follow, receiving the speech from the user at the device, the speech being associated with the field, transmitting the speech as a request to public, common network node that receives and processes speech, processing the transmitted speech and returning text associated with the speech to the device and inserting the text into the field. Upon a second indication from the user, the system processes the text in the field as programmed by the user interface. The present disclosure provides a speech mash up application for a user interface of a mobile or desktop device that does not require expensive speech processing technologies.

Patent Agency Ranking