Providing pre-computed hotword models
    81.
    发明授权
    Providing pre-computed hotword models 有权
    提供预先计算的词典模型

    公开(公告)号:US09263042B1

    公开(公告)日:2016-02-16

    申请号:US14340833

    申请日:2014-07-25

    Applicant: Google Inc.

    Inventor: Matthew Sharifi

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for obtaining, for each of multiple words or sub-words, audio data corresponding to multiple users speaking the word or sub-word; training, for each of the multiple words or sub-words, a pre-computed hotword model for the word or sub-word based on the audio data for the word or sub-word; receiving a candidate hotword from a computing device; identifying one or more pre-computed hotword models that correspond to the candidate hotword; and providing the identified, pre-computed hotword models to the computing device.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于为多个单词或子单词中的每一个获得对应于说话单词或子单词的多个用户的音频数据; 针对所述多个单词或子单词中的每一个的训练,基于所述单词或子单词的音频数据的用于所述单词或子单词的预先计算的词典模型; 从计算设备接收候选词; 识别与所述候选词语对应的一个或多个预先计算的词典模型; 以及将所识别的预先计算的热词模型提供给所述计算设备。

    Reference signal suppression in speech recognition
    82.
    发明授权
    Reference signal suppression in speech recognition 有权
    语音识别中的参考信号抑制

    公开(公告)号:US09240183B2

    公开(公告)日:2016-01-19

    申请号:US14181374

    申请日:2014-02-14

    Applicant: Google Inc.

    Abstract: The technology described herein can be embodied in a method that includes receiving a first signal representing an output of a speaker device, and a second signal comprising the output of the speaker device, and an audio signal corresponding to an utterance of a speaker. The method includes aligning one or more segments of the first signal with one or more segments of the second signal. Acoustic features of the one or more segments of the first and second signals are classified to obtain a first set of vectors and a second set of vectors, respectively, the vectors being associated with speech units. The second set is modified using the first set, such that the modified second set represents a suppression of the output of the speaker device in the second signal. A transcription of the utterance of the speaker can be generated from the modified second set of vectors.

    Abstract translation: 本文描述的技术可以以包括接收表示扬声器装置的输出的第一信号和包括扬声器装置的输出的第二信号以及对应于说话者发声的音频信号的方法来实现。 该方法包括将第一信号的一个或多个段对准第二信号的一个或多个段。 第一和第二信号的一个或多个段的声学特征被分类以分别获得与语音单元相关联的向量的第一组向量和第二组向量。 使用第一组修改第二组,使得修改的第二组表示抑制第二信号中的扬声器设备的输出。 可以从修改的第二组向量生成说话者的话语的转录。

    Pitch shift and time stretch resistant audio matching
    83.
    发明授权
    Pitch shift and time stretch resistant audio matching 有权
    音调偏移和时间舒适的音频匹配

    公开(公告)号:US09213703B1

    公开(公告)日:2015-12-15

    申请号:US13670453

    申请日:2012-11-06

    Applicant: Google Inc.

    Abstract: Systems and methods are provided herein relating to audio matching. Descriptors can be generated based on anchor points and interest points that characterize the local neighborhood surrounding the anchor point. Characterizing the local spectrogram neighborhood surrounding anchor points can be more robust to pitch shift distortions and time stretch distortions. Those anchor points surrounded by a lack of spectral activity or even spectral activity can be filtered from further examination. Using these pitch shift and time stretch resistant audio features within descriptors can provide for more accurate and efficient audio matching.

    Abstract translation: 本文提供了与音频匹配有关的系统和方法。 描述符可以基于表征锚点周围的局部邻域的锚点和兴趣点来生成。 表征锚点附近的局部光谱图邻域对于俯仰偏移失真和时间拉伸失真可以更加鲁棒。 由光谱活动不足或光谱活动所包围的那些锚点可以从进一步的检查中滤除。 在描述符中使用这些音调移位和时间延伸的音频特征可以提供更精确和高效的音频匹配。

    REFERENCE SIGNAL SUPPRESSION IN SPEECH RECOGNITION
    84.
    发明申请
    REFERENCE SIGNAL SUPPRESSION IN SPEECH RECOGNITION 有权
    语音识别中的参考信号抑制

    公开(公告)号:US20150235651A1

    公开(公告)日:2015-08-20

    申请号:US14181374

    申请日:2014-02-14

    Applicant: Google Inc.

    Abstract: The technology described herein can be embodied in a method that includes receiving a first signal representing an output of a speaker device, and a second signal comprising the output of the speaker device, and an audio signal corresponding to an utterance of a speaker. The method includes aligning one or more segments of the first signal with one or more segments of the second signal. Acoustic features of the one or more segments of the first and second signals are classified to obtain a first set of vectors and a second set of vectors, respectively, the vectors being associated with speech units. The second set is modified using the first set, such that the modified second set represents a suppression of the output of the speaker device in the second signal. A transcription of the utterance of the speaker can be generated from the modified second set of vectors.

    Abstract translation: 本文描述的技术可以以包括接收表示扬声器装置的输出的第一信号和包括扬声器装置的输出的第二信号以及对应于扬声器发声的音频信号的方法来实现。 该方法包括将第一信号的一个或多个段对准第二信号的一个或多个段。 第一和第二信号的一个或多个段的声学特征被分类以分别获得与语音单元相关联的向量的第一组向量和第二组向量。 使用第一组修改第二组,使得修改的第二组表示抑制第二信号中的扬声器设备的输出。 可以从修改的第二组向量生成说话者的话语的转录。

    PROMOTING VOICE ACTIONS TO HOTWORDS
    85.
    发明申请
    PROMOTING VOICE ACTIONS TO HOTWORDS 有权
    促进对热点的声音行动

    公开(公告)号:US20150161990A1

    公开(公告)日:2015-06-11

    申请号:US14221520

    申请日:2014-03-21

    Applicant: Google Inc.

    Inventor: Matthew Sharifi

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, where a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. Further actions include, in response to determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, designating the voice command as a hotword.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于将某些语音命令指定为热词。 方法,系统和装置包括接收随后的语音命令的热门词汇的动作。 附加动作包括确定语音命令满足与指定语音命令相关联的一个或多个预定准则作为热门词,其中指定为热门词汇的语音命令被视为语音输入,而不管语音命令是否在另一个之前 热门词 响应于确定语音命令满足与指定语音命令相关联的一个或多个预定标准作为热门词,响应于指定语音命令作为热门词语。

    Interactive game based on user generated music content
    86.
    发明授权
    Interactive game based on user generated music content 有权
    基于用户生成的音乐内容的互动游戏

    公开(公告)号:US08895830B1

    公开(公告)日:2014-11-25

    申请号:US13647345

    申请日:2012-10-08

    Applicant: Google Inc.

    CPC classification number: A63F13/814 A63F13/65 A63F2300/5533

    Abstract: Systems and methods are provided herein relating to interactive gaming within a media sharing service. Game data, such as sets of notes extracted from the audio track of user generated videos or from audio samples, can be generated based on videos containing musical content or from audio content. A device can use the game data to facilitate an interactive game during playback of the user generated videos or audio samples. Players can press buttons, for example, corresponding to notes as the video with musical content is played within the game interface. Players can be scored for accuracy, and can play with other players in a multiplayer environment. In this sense, user generated video content or audio content can be transformed and used within a gaming interface to increase interaction and engagement between users in a media sharing service.

    Abstract translation: 本文提供了与媒体共享服务内的交互式游戏相关的系统和方法。 可以基于包含音乐内容的视频或从音频内容生成诸如从用户生成的视频的音轨或从音频样本中提取的音符集的游戏数据。 设备可以使用游戏数据来促进在播放用户生成的视频或音频样本期间的交互式游戏。 播放器可以按钮,例如对应于音符,因为具有音乐内容的视频在游戏界面内播放。 玩家可以获得准确的得分,并且可以在多人游戏环境中与其他玩家玩耍。 在这个意义上,用户生成的视频内容或音频内容可以在游戏界面中进行变换和使用,以增加媒体共享服务中用户之间的互动和互动。

    Speech endpointing based on voice profile
    87.
    发明授权
    Speech endpointing based on voice profile 有权
    基于语音配置文件的语音终点

    公开(公告)号:US08843369B1

    公开(公告)日:2014-09-23

    申请号:US14142399

    申请日:2013-12-27

    Applicant: Google Inc.

    Inventor: Matthew Sharifi

    CPC classification number: G10L25/87 G10L25/03

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing based on a voice profile. In one aspect, a method includes the actions of receiving audio data corresponding to an utterance spoken by a particular user. The actions further include generating a voice profile for the particular user using at least a portion of the audio data. The actions further include determining in the audio data a beginning point or an ending point of the utterance based at least in part on the voice profile for the particular user. The actions further include based on the beginning point, the ending point, or both the beginning point and the ending point, outputting data indicating the utterance.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于基于语音简档的语音终点。 一方面,一种方法包括接收对应于特定用户所说话语的音频数据的动作。 所述动作还包括使用所述音频数据的至少一部分来生成所述特定用户的语音简档。 动作还包括至少部分地基于特定用户的语音简档来确定音频数据中的话语的起点或终点。 动作进一步包括基于起始点,终点或起始点和终点两者,输出表示说话的数据。

    SYSTEM TO INTEGRATE REAL-WORLD OBJECTS INTO SOCIAL NETWORKS
    88.
    发明申请
    SYSTEM TO INTEGRATE REAL-WORLD OBJECTS INTO SOCIAL NETWORKS 审中-公开
    将真实世界对象整合到社会网络中的系统

    公开(公告)号:US20140188742A1

    公开(公告)日:2014-07-03

    申请号:US13843144

    申请日:2013-03-15

    Applicant: Google Inc.

    CPC classification number: G06Q30/0282 G06Q50/01

    Abstract: A method for displaying an aggregate count of endorsements is provided, including the following method operations: processing a request for an online resource from a mobile device, the online resource being associated with an object, the online resource including an endorsement mechanism; sending the online resource to the mobile device; processing an input from a user triggering the endorsement mechanism, to define an endorsement of the object by the user; updating an aggregate count of endorsements of the object to include the endorsement of the object by the user; sending the updated aggregate count of endorsements to the social display device for display on the social display device.

    Abstract translation: 提供一种用于显示认可的总计数的方法,包括以下方法操作:处理来自移动设备的在线资源的请求,所述在线资源与对象相关联,所述在线资源包括认可机制; 将在线资源发送到移动设备; 处理来自用户触发认可机制的输入,以定义用户对该对象的认可; 更新对象的认可的总计数以包括用户对对象的认可; 将更新的认可总计数发送到社交显示设备以在社交显示设备上显示。

    IDENTIFYING MEDIA CONTENT
    89.
    发明申请

    公开(公告)号:US20140114659A1

    公开(公告)日:2014-04-24

    申请号:US14142042

    申请日:2013-12-27

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving (i) audio data that encodes a spoken natural language query, and (ii) environmental audio data, obtaining a transcription of the spoken natural language query, determining a particular content type associated with one or more keywords in the transcription, providing at least a portion of the environmental audio data to a content recognition engine, and identifying a content item that has been output by the content recognition engine, and that matches the particular content type.

    Action suggestions for user-selected content

    公开(公告)号:US10970646B2

    公开(公告)日:2021-04-06

    申请号:US14872582

    申请日:2015-10-01

    Applicant: GOOGLE INC.

    Abstract: Systems and methods are provided for suggesting actions for selected text based on content displayed on a mobile device. An example method can include converting a selection made via a display device into a query, providing the query to an action suggestion model that is trained to predict an action given a query, each action being associated with a mobile application, receiving one or more predicted actions, and initiating display of the one or more predicted actions on the display device. Another example method can include identifying, from search records, queries where a website is highly ranked, the website being one of a plurality of websites in a mapping of websites to mobile applications. The method can also include generating positive training examples for an action suggestion model from the identified queries, and training the action suggestion model using the positive training examples.

Patent Agency Ranking