Dual model speaker identification
    61.
    发明授权

    公开(公告)号:US09711148B1

    公开(公告)日:2017-07-18

    申请号:US13944975

    申请日:2013-07-18

    Applicant: Google Inc.

    CPC classification number: G10L17/02 G10L17/10 G10L17/22

    Abstract: A processing system receives an audio signal encoding an utterance and determines that a first portion of the audio signal corresponds to a predefined phrase. The processing system accesses one or more text-dependent models associated with the predefined phrase and determines a first confidence based on the one or more text-dependent models associated with the predefined phrase, the first confidence corresponding to a first likelihood that a particular speaker spoke the utterance. The processing system determines a second confidence for a second portion of the audio signal using one or more text-independent models, the second confidence corresponding to a second likelihood that the particular speaker spoke the utterance. The processing system then determines that the particular speaker spoke the utterance based at least in part on the first confidence and the second confidence.

    Wireless signal forwarding
    63.
    发明授权

    公开(公告)号:US09699597B2

    公开(公告)日:2017-07-04

    申请号:US14961803

    申请日:2015-12-07

    Applicant: GOOGLE INC.

    CPC classification number: H04W4/80 G06Q20/3278 H04B5/0031 H04W40/244

    Abstract: Forwarding wireless signals comprises a user and a counterpart opening secure applications on a user computing device and a counterpart computing device, respectively. The user places the user computing device within range of a wireless signal, such as a wireless signal provided by a point of sale (“POS”) terminal. The user computing device forwards the wireless signal from the POS terminal to the counterpart computing device. The user computing device forwards the wireless signal from the counterpart computing device to the POS terminal. Thus, the counterpart computing device may conduct a transaction with the POS terminal as if the counterpart computing device were at the location of the POS terminal. The counterpart computing device may also receive a forwarded beacon signal comprising data, such as an offer, provided by the POS terminal or another suitable beacon transmission device at the merchant location.

    WIRELESS SIGNAL FORWARDING
    64.
    发明申请

    公开(公告)号:US20170164139A1

    公开(公告)日:2017-06-08

    申请号:US14961803

    申请日:2015-12-07

    Applicant: GOOGLE INC.

    CPC classification number: H04W4/80 G06Q20/3278 H04B5/0031 H04W40/244

    Abstract: Forwarding wireless signals comprises a user and a counterpart opening secure applications on a user computing device and a counterpart computing device, respectively. The user places the user computing device within range of a wireless signal, such as a wireless signal provided by a point of sale (“POS”) terminal. The user computing device forwards the wireless signal from the POS terminal to the counterpart computing device. The user computing device forwards the wireless signal from the counterpart computing device to the POS terminal. Thus, the counterpart computing device may conduct a transaction with the POS terminal as if the counterpart computing device were at the location of the POS terminal. The counterpart computing device may also receive a forwarded beacon signal comprising data, such as an offer, provided by the POS terminal or another suitable beacon transmission device at the merchant location.

    PROMOTING VOICE ACTIONS TO HOTWORDS
    66.
    发明申请

    公开(公告)号:US20170116988A1

    公开(公告)日:2017-04-27

    申请号:US15365334

    申请日:2016-11-30

    Applicant: Google Inc.

    Inventor: Matthew Sharifi

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, where a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. Further actions include, in response to determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, designating the voice command as a hotword.

    Drag-and-drop on a mobile device
    68.
    发明授权

    公开(公告)号:US09606716B2

    公开(公告)日:2017-03-28

    申请号:US14522927

    申请日:2014-10-24

    Applicant: GOOGLE INC.

    Abstract: Implementations provide an improved drag-and-drop operation on a mobile device. For example, a method includes identifying a drag area in a user interface of a first mobile application in response to a drag command, identifying an entity from a data store based on recognition performed on content in the drag area, receiving a drop location associated with a second mobile application, determining an action to perform in the second mobile application based on the drop location, and performing the action in the second mobile action using the entity. Another method may include receiving a selection of a smart copy control for a text input control in a first mobile application, receiving a selected area of a display generated by a second mobile application, identifying an entity in the selected area, automatically navigating back to the text input control, and pasting a description of the entity in the text input control.

    Text-dependent speaker identification
    70.
    发明授权
    Text-dependent speaker identification 有权
    文字相关的扬声器识别

    公开(公告)号:US09542948B2

    公开(公告)日:2017-01-10

    申请号:US14612830

    申请日:2015-02-03

    Applicant: Google Inc.

    CPC classification number: G10L17/18 G10L17/005

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speaker verification. The methods, systems, and apparatus include actions of inputting speech data that corresponds to a particular utterance to a first neural network and determining an evaluation vector based on output at a hidden layer of the first neural network. Additional actions include obtaining a reference vector that corresponds to a past utterance of a particular speaker. Further actions include inputting the evaluation vector and the reference vector to a second neural network that is trained on a set of labeled pairs of feature vectors to identify whether speakers associated with the labeled pairs of feature vectors are the same speaker. More actions include determining, based on an output of the second neural network, whether the particular utterance was likely spoken by the particular speaker.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的用于说话者验证的计算机程序。 方法,系统和装置包括将对应于特定话语的语音数据输入到第一神经网络并基于第一神经网络的隐藏层处的输出来确定评估向量的动作。 附加动作包括获得对应于特定说话者的过去话语的参考矢量。 进一步的动作包括将评估向量和参考矢量输入到第二神经网络,该第二神经网络被训练在一组标记的特征矢量对上,以识别与标记的特征矢量对相关联的扬声器是否是相同的扬声器。 更多的动作包括基于第二神经网络的输出确定特定话语是否可能由特定说话者说出。

Patent Agency Ranking