Method for supporting dynamic grammars in WFST-based ASR
    11.
    发明授权
    Method for supporting dynamic grammars in WFST-based ASR 有权
    在基于WFST的ASR中支持动态语法的方法

    公开(公告)号:US09502031B2

    公开(公告)日:2016-11-22

    申请号:US14494305

    申请日:2014-09-23

    Applicant: Apple Inc.

    Abstract: Systems and processes are disclosed for recognizing speech using a weighted finite state transducer (WFST) approach. Dynamic grammars can be supported by constructing the final recognition cascade during runtime using difference grammars. In a first grammar, non-terminals can be replaced with a, weighted phone loop that produces sequences of mono-phone words. In a second grammar, at runtime, non-terminals can be replaced with sub-grammars derived from user-specific usage data including contact, media, and application lists. Interaction frequencies associated with these entities can be used to weight certain words over others. With all non-terminals replaced, a static recognition cascade with the first grammar can be composed with the personalized second grammar to produce a user-specific WEST. User speech can then be processed to generate candidate words having associated probabilities, and the likeliest result can be output.

    Abstract translation: 公开了使用加权有限状态传感器(WFST)方法识别语音的系统和过程。 动态语法可以通过在运行时使用差异语法构建最终识别级联来支持。 在第一种语法中,非终端可以被替换为产生单声道单词序列的加权电话环路。 在第二语法中,在运行时,非终端可以由源于用户特定使用数据(包括联系人,媒体和应用程序列表)的子语法替代。 与这些实体相关联的相互作用频率可以用于对某些单词进行加权。 随着所有非终端的替换,与第一语法的静态识别级联可以用个性化的第二语法组成,以产生用户特定的WEST。 然后可以处理用户语音以产生具有相关概率的候选词,并且可以输出最有可能的结果。

    User-specific acoustic models
    12.
    发明授权

    公开(公告)号:US11580990B2

    公开(公告)日:2023-02-14

    申请号:US17349758

    申请日:2021-06-16

    Applicant: Apple Inc.

    Abstract: Systems and processes for providing user-specific acoustic models are provided. In accordance with one example, a method includes, at an electronic device having one or more processors, receiving a plurality of speech inputs, each of the speech inputs associated with a same user of the electronic device; providing each of the plurality of speech inputs to a user-independent acoustic model, the user-independent acoustic model providing a plurality of speech results based on the plurality of speech inputs; initiating a user-specific acoustic model on the electronic device; and adjusting the user-specific acoustic model based on the plurality of speech inputs and the plurality of speech results.

    Correction and completion of search queries

    公开(公告)号:US11294944B2

    公开(公告)日:2022-04-05

    申请号:US16147565

    申请日:2018-09-28

    Applicant: Apple Inc.

    Abstract: Aspects of subject technology provide systems and methods for simultaneously spell-correcting and completing partial search queries being entered by a user on the user's electronic device. An apparatus such as a computing device may receive partial search queries from the user's electronic device as each character of the partial search query is entered by the user. The apparatus may utilize a machine-learning model to generate suggested queries that include spelling-corrected versions of the received partial query, query completion suggestions for the partial query, and/or spelling-corrected completion suggestions for the partial query.

    Implicit identification of translation payload with neural machine translation

    公开(公告)号:US10909331B2

    公开(公告)日:2021-02-02

    申请号:US16024475

    申请日:2018-06-29

    Applicant: Apple Inc.

    Abstract: Systems and processes for operating an electronic device to train a machine-learning translation system are described. In one process, a first set of training data is obtained. The first set of training data includes at least one payload in a first language and a translation of the at least one payload in a second language. The process further includes obtaining one or more templates for adapting the at least one payload; adapting the at least one payload using the one or more templates to generate at least one adapted payload formulated as a translation request; generating a second set of training data based on the at least one adapted payload; and training the machine-learning translation system using the second set of training data.

    Automatic speech recognition based on user feedback

    公开(公告)号:US10446141B2

    公开(公告)日:2019-10-15

    申请号:US14591754

    申请日:2015-01-07

    Applicant: Apple Inc.

    Abstract: Systems and processes for processing speech in a digital assistant are provided. In one example process, a first speech input can be received from a user. The first speech input can be processed using a first automatic speech recognition system to produce a first recognition result. An input indicative of a potential error in the first recognition result can be received. The input can be used to improve the first recognition result. For example, the input can include a second speech input that is a repetition of the first speech input. The second speech input can be processed using a second automatic speech recognition system to produce a second recognition result.

Patent Agency Ranking