ENHANCED SPEECH ENDPOINTING
    2.
    发明申请

    公开(公告)号:US20180012591A1

    公开(公告)日:2018-01-11

    申请号:US15711260

    申请日:2017-09-21

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.

    SPEECH ENDPOINTING
    3.
    发明申请
    SPEECH ENDPOINTING 审中-公开

    公开(公告)号:US20170110118A1

    公开(公告)日:2017-04-20

    申请号:US14923637

    申请日:2015-10-27

    Applicant: Google Inc.

    CPC classification number: G06F16/285 G06F16/685 G10L15/04

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing are described. In one aspect, a method includes the action of accessing voice query log data that includes voice queries spoken by a particular user. The actions further include based on the voice query log data that includes voice queries spoken by a particular user, determining a pause threshold from the voice query log data that includes voice queries spoken by the particular user. The actions further include receiving, from the particular user, an utterance. The actions further include determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold. The actions further include based on determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold, processing the utterance as a voice query.

    ENHANCED SPEECH ENDPOINTING
    4.
    发明申请
    ENHANCED SPEECH ENDPOINTING 审中-公开
    增强语音终点

    公开(公告)号:US20170069309A1

    公开(公告)日:2017-03-09

    申请号:US15192431

    申请日:2016-06-24

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的用于接收包括话语的音频数据的计算机程序,获得指示一个或多个预期语音识别结果的上下文数据,基于上下文数据确定预期语音识别结果, 接收由语音识别引擎产生的中间语音识别结果,根据上下文数据将中间语音识别结果与音频数据的预期语音识别结果进行比较,确定中间语音识别结果是否对应于预期语音识别结果 基于所述上下文数据的所述音频数据,以及响应于确定所述中间语音识别结果匹配所述预期语音识别结果而设置语音结束结束并提供最终语音识别结果,所述最终语音识别结果包括所述一个或多个预期的 语音识别 由上下文数据指示的结果。

    SPEECH ENDPOINTING BASED ON WORD COMPARISONS

    公开(公告)号:US20160260427A1

    公开(公告)日:2016-09-08

    申请号:US15156478

    申请日:2016-05-17

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing based on word comparisons are described. In one aspect, a method includes the actions of obtaining a transcription of an utterance. The actions further include determining, as a first value, a quantity of text samples in a collection of text samples that (i) include terms that match the transcription, and (ii) do not include any additional terms. The actions further include determining, as a second value, a quantity of text samples in the collection of text samples that (i) include terms that match the transcription, and (ii) include one or more additional terms. The actions further include classifying the utterance as a likely incomplete utterance or not a likely incomplete utterance based at least on comparing the first value and the second value.

    SPEECH ENDPOINTING
    6.
    发明申请
    SPEECH ENDPOINTING 审中-公开

    公开(公告)号:US20170110116A1

    公开(公告)日:2017-04-20

    申请号:US15196663

    申请日:2016-06-29

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing are described. In one aspect, a method includes the action of accessing voice query log data that includes voice queries spoken by a particular user. The actions further include based on the voice query log data that includes voice queries spoken by a particular user, determining a pause threshold from the voice query log data that includes voice queries spoken by the particular user. The actions further include receiving, from the particular user, an utterance. The actions further include determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold. The actions further include based on determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold, processing the utterance as a voice query.

    Using web ranking to resolve anaphora
    7.
    发明授权
    Using web ranking to resolve anaphora 有权
    使用网页排名来解析暗喻

    公开(公告)号:US09542441B1

    公开(公告)日:2017-01-10

    申请号:US14935268

    申请日:2015-11-06

    Applicant: Google Inc.

    Abstract: The specification relates to a method of receiving a first query and a second query. The method analyzes the second query for a presence of anaphora. If anaphora is present, the method analyzes the first query for a presence of an entity that can be associated with the anaphora. If the analysis analyzing the first query returns two or more associated entities, the method forms a third query wherein the anaphora of the second query is replaced with one of the associated entities and forms a fourth query wherein the anaphora is replaced with the other of the associated entities. The third query and the fourth query are sent to a query-ranking engine. The third query and the fourth query receive a ranking and the higher-ranked query is sent to a search engine.

    Abstract translation: 本说明书涉及一种接收第一查询和第二查询的方法。 该方法分析了第二次查询的存在。 如果存在隐喻,则该方法分析第一个查询,以查看可与隐喻相关联的实体的存在。 如果分析第一查询的分析返回两个或更多个相关联的实体,则该方法形成第三查询,其中第二查询的描述被替换为相关联的实体中的一个,并形成第四查询,其中该照明被另一个 关联实体。 第三个查询和第四个查询被发送到查询排名引擎。 第三查询和第四查询接收到排名,并且将较高排名的查询发送到搜索引擎。

    SPEECH ENDPOINTING BASED ON WORD COMPARISONS
    8.
    发明申请
    SPEECH ENDPOINTING BASED ON WORD COMPARISONS 有权
    基于词语比较的语音终点

    公开(公告)号:US20150310879A1

    公开(公告)日:2015-10-29

    申请号:US14681203

    申请日:2015-04-08

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing based on word comparisons are described. In one aspect, a method includes the actions of obtaining a transcription of an utterance. The actions further include determining, as a first value, a quantity of text samples in a collection of text samples that (i) include terms that match the transcription, and (ii) do not include any additional terms. The actions further include determining, as a second value, a quantity of text samples in the collection of text samples that (i) include terms that match the transcription, and (ii) include one or more additional terms. The actions further include classifying the utterance as a likely incomplete utterance or not a likely incomplete utterance based at least on comparing the first value and the second value.

    Abstract translation: 描述了包括在计算机存储介质上编码的计算机程序,用于基于词比较的语音终点的方法,系统和装置。 一方面,一种方法包括获得话语转录的动作。 所述动作还包括确定文本样本集合中的文本样本的数量(i)包括与转录匹配的条款,以及(ii)不包括任何附加条款。 所述动作进一步包括将文本样本集合中的文本样本的数量确定为(i)包括与转录匹配的术语,以及(ii)包括一个或多个附加术语。 所述动作进一步包括至少基于比较第一值和第二值,将话语分类为可能不完整的话语,或者不是可能的不完整的话语。

    SEARCH RESULT PREFETCHING OF VOICE QUERIES
    9.
    发明申请

    公开(公告)号:US20170193111A1

    公开(公告)日:2017-07-06

    申请号:US14988990

    申请日:2016-01-06

    Applicant: Google Inc.

    CPC classification number: G06F17/30864 G06F17/30401 G06F17/30554 G10L15/26

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data encoding an utterance; obtaining an intermediate transcription of the utterance; before a final transcription of the utterance is obtained: i) determining that the intermediate transcription of the utterance is associated with a previously received search query, ii) obtaining one or more results that are identified as responsive to the previously received search query, and iii) storing one or more of the results; obtaining the final transcription of the utterance; determining that the final transcription of the utterance is also associated with the previously received search query; and in response to determining that the final transcription of the utterance is also associated with the previously received search query, providing the stored one or more results for output.

    ENHANCED SPEECH ENDPOINTING
    10.
    发明申请

    公开(公告)号:US20170069308A1

    公开(公告)日:2017-03-09

    申请号:US14844563

    申请日:2015-09-03

    Applicant: Google Inc.

    CPC classification number: G10L15/04 G06F17/2765 G10L15/18 G10L2015/228

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.

Patent Agency Ranking