-
公开(公告)号:US20170270159A1
公开(公告)日:2017-09-21
申请号:US14024262
申请日:2013-09-11
Applicant: Google Inc.
Inventor: Bo Wang , Pravir Kumar Gupta , Omer Bar-or , Vishaal Kapoor , David Peter Whipp , Nitin Mangesh Shetti , Michael Buchanan , Bruce Christensen , Cheng Li
IPC: G06F17/30
CPC classification number: G06F16/243 , G06F16/2425
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining query results in response to queries. One of the methods includes obtaining first query results that are responsive to a first query; determining that the first query results do not satisfy a requirement; obtaining one or more modified queries for the first query; selecting a modified query from the one or more modified queries; obtaining second query results that are responsive to the selected modified query; analyzing the second query results and the first query results; determining to provide one or more second query results as a result of the analyzing; and providing the one or more second query results.
-
公开(公告)号:US20180012591A1
公开(公告)日:2018-01-11
申请号:US15711260
申请日:2017-09-21
Applicant: Google Inc.
Inventor: Petar Aleksic , Glen Shires , Michael Buchanan
CPC classification number: G10L15/05 , G06F3/167 , G10L15/04 , G10L15/22 , G10L15/26 , G10L25/78 , G10L2015/088 , G10L2025/783
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.
-
公开(公告)号:US20170110118A1
公开(公告)日:2017-04-20
申请号:US14923637
申请日:2015-10-27
Applicant: Google Inc.
Inventor: Siddhi Tadpatrikar , Michael Buchanan , Pravir Kumar Gupta
CPC classification number: G06F16/285 , G06F16/685 , G10L15/04
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing are described. In one aspect, a method includes the action of accessing voice query log data that includes voice queries spoken by a particular user. The actions further include based on the voice query log data that includes voice queries spoken by a particular user, determining a pause threshold from the voice query log data that includes voice queries spoken by the particular user. The actions further include receiving, from the particular user, an utterance. The actions further include determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold. The actions further include based on determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold, processing the utterance as a voice query.
-
公开(公告)号:US20170069309A1
公开(公告)日:2017-03-09
申请号:US15192431
申请日:2016-06-24
Applicant: Google Inc.
Inventor: Petar Aleksic , Glen Shires , Michael Buchanan
CPC classification number: G10L15/05 , G06F3/167 , G10L15/04 , G10L15/22 , G10L15/26 , G10L25/78 , G10L2015/088 , G10L2025/783
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的用于接收包括话语的音频数据的计算机程序,获得指示一个或多个预期语音识别结果的上下文数据,基于上下文数据确定预期语音识别结果, 接收由语音识别引擎产生的中间语音识别结果,根据上下文数据将中间语音识别结果与音频数据的预期语音识别结果进行比较,确定中间语音识别结果是否对应于预期语音识别结果 基于所述上下文数据的所述音频数据,以及响应于确定所述中间语音识别结果匹配所述预期语音识别结果而设置语音结束结束并提供最终语音识别结果,所述最终语音识别结果包括所述一个或多个预期的 语音识别 由上下文数据指示的结果。
-
公开(公告)号:US20160260427A1
公开(公告)日:2016-09-08
申请号:US15156478
申请日:2016-05-17
Applicant: Google Inc.
Inventor: Michael Buchanan , Pravir Kumar Gupta , Christopher Bo Tandiono
CPC classification number: G10L15/05 , G10L15/04 , G10L15/22 , G10L15/26 , G10L17/06 , G10L25/51 , G10L25/78 , G10L25/87 , G10L25/90
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing based on word comparisons are described. In one aspect, a method includes the actions of obtaining a transcription of an utterance. The actions further include determining, as a first value, a quantity of text samples in a collection of text samples that (i) include terms that match the transcription, and (ii) do not include any additional terms. The actions further include determining, as a second value, a quantity of text samples in the collection of text samples that (i) include terms that match the transcription, and (ii) include one or more additional terms. The actions further include classifying the utterance as a likely incomplete utterance or not a likely incomplete utterance based at least on comparing the first value and the second value.
-
公开(公告)号:US20170110116A1
公开(公告)日:2017-04-20
申请号:US15196663
申请日:2016-06-29
Applicant: Google Inc.
Inventor: Siddhi Tadpatrikar , Michael Buchanan , Pravir Kumar Gupta
IPC: G10L15/05 , G06F17/30 , G10L15/065 , G10L15/26 , G10L25/78
CPC classification number: G10L15/05 , G06F17/30746 , G10L15/04 , G10L15/065 , G10L15/07 , G10L15/22 , G10L15/26 , G10L25/78 , G10L2025/783
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing are described. In one aspect, a method includes the action of accessing voice query log data that includes voice queries spoken by a particular user. The actions further include based on the voice query log data that includes voice queries spoken by a particular user, determining a pause threshold from the voice query log data that includes voice queries spoken by the particular user. The actions further include receiving, from the particular user, an utterance. The actions further include determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold. The actions further include based on determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold, processing the utterance as a voice query.
-
公开(公告)号:US09542441B1
公开(公告)日:2017-01-10
申请号:US14935268
申请日:2015-11-06
Applicant: Google Inc.
Inventor: Michael Buchanan , Mark Andrew Paskin , Pravir Kumar Gupta
IPC: G06F17/30
CPC classification number: G06F17/3043 , G06F17/30477 , G06F17/3053 , G06F17/30554 , G06F17/30646 , G06F17/30663 , G06F17/30864
Abstract: The specification relates to a method of receiving a first query and a second query. The method analyzes the second query for a presence of anaphora. If anaphora is present, the method analyzes the first query for a presence of an entity that can be associated with the anaphora. If the analysis analyzing the first query returns two or more associated entities, the method forms a third query wherein the anaphora of the second query is replaced with one of the associated entities and forms a fourth query wherein the anaphora is replaced with the other of the associated entities. The third query and the fourth query are sent to a query-ranking engine. The third query and the fourth query receive a ranking and the higher-ranked query is sent to a search engine.
Abstract translation: 本说明书涉及一种接收第一查询和第二查询的方法。 该方法分析了第二次查询的存在。 如果存在隐喻,则该方法分析第一个查询,以查看可与隐喻相关联的实体的存在。 如果分析第一查询的分析返回两个或更多个相关联的实体,则该方法形成第三查询,其中第二查询的描述被替换为相关联的实体中的一个,并形成第四查询,其中该照明被另一个 关联实体。 第三个查询和第四个查询被发送到查询排名引擎。 第三查询和第四查询接收到排名,并且将较高排名的查询发送到搜索引擎。
-
公开(公告)号:US20150310879A1
公开(公告)日:2015-10-29
申请号:US14681203
申请日:2015-04-08
Applicant: Google Inc.
Inventor: Michael Buchanan , Pravir Kumar Gupta , Christopher Bo Tandiono
CPC classification number: G10L15/05 , G10L15/04 , G10L15/22 , G10L15/26 , G10L17/06 , G10L25/51 , G10L25/78 , G10L25/87 , G10L25/90
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing based on word comparisons are described. In one aspect, a method includes the actions of obtaining a transcription of an utterance. The actions further include determining, as a first value, a quantity of text samples in a collection of text samples that (i) include terms that match the transcription, and (ii) do not include any additional terms. The actions further include determining, as a second value, a quantity of text samples in the collection of text samples that (i) include terms that match the transcription, and (ii) include one or more additional terms. The actions further include classifying the utterance as a likely incomplete utterance or not a likely incomplete utterance based at least on comparing the first value and the second value.
Abstract translation: 描述了包括在计算机存储介质上编码的计算机程序,用于基于词比较的语音终点的方法,系统和装置。 一方面,一种方法包括获得话语转录的动作。 所述动作还包括确定文本样本集合中的文本样本的数量(i)包括与转录匹配的条款,以及(ii)不包括任何附加条款。 所述动作进一步包括将文本样本集合中的文本样本的数量确定为(i)包括与转录匹配的术语,以及(ii)包括一个或多个附加术语。 所述动作进一步包括至少基于比较第一值和第二值,将话语分类为可能不完整的话语,或者不是可能的不完整的话语。
-
公开(公告)号:US20170193111A1
公开(公告)日:2017-07-06
申请号:US14988990
申请日:2016-01-06
Applicant: Google Inc.
Inventor: Christopher Bo Tandiono , Michael Buchanan , Nathan David Howard , Ishai Rabinovitz
CPC classification number: G06F17/30864 , G06F17/30401 , G06F17/30554 , G10L15/26
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data encoding an utterance; obtaining an intermediate transcription of the utterance; before a final transcription of the utterance is obtained: i) determining that the intermediate transcription of the utterance is associated with a previously received search query, ii) obtaining one or more results that are identified as responsive to the previously received search query, and iii) storing one or more of the results; obtaining the final transcription of the utterance; determining that the final transcription of the utterance is also associated with the previously received search query; and in response to determining that the final transcription of the utterance is also associated with the previously received search query, providing the stored one or more results for output.
-
公开(公告)号:US20170069308A1
公开(公告)日:2017-03-09
申请号:US14844563
申请日:2015-09-03
Applicant: Google Inc.
Inventor: Petar Aleksic , Glen Shires , Michael Buchanan
CPC classification number: G10L15/04 , G06F17/2765 , G10L15/18 , G10L2015/228
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.
-
-
-
-
-
-
-
-
-