-
公开(公告)号:US20140229469A1
公开(公告)日:2014-08-14
申请号:US14259241
申请日:2014-04-23
Applicant: Google Inc.
Inventor: Michael J. LeBeau , John Nicholas Jitkoff , William J. Byrne
IPC: G06F17/30
CPC classification number: G06F16/9537 , G06F16/24578 , G06F16/951 , G06F16/953 , G06F16/957 , G10L15/25 , H04M1/72561 , H04M3/4931 , H04M3/4935 , H04M2201/40 , H04M2242/15
Abstract: In general, the subject matter described in this specification can be embodied in methods, systems, and program products for providing search results automatically to a user of a computing device. A spoken input provided by a user to a computing device is received. The spoken input is transmitted to a computer server system that is remote from the computing device. Search result information that is responsive to the spoken input is receiving by the computing device and in response to the transmitted spoken input. An alert is provided to the user that the device will connect the user to a target of the search result information if the user does not intervene to stop the connecting of the user. The user is connected to the target of the search result information based on a determination that the user has not intervened to stop the connecting of the user.
Abstract translation: 通常,本说明书中描述的主题可以体现在用于向计算设备的用户自动提供搜索结果的方法,系统和程序产品中。 接收由用户提供给计算设备的语音输入。 口头输入被传送到远离计算设备的计算机服务器系统。 响应于语音输入的搜索结果信息由计算设备接收并且响应于所传输的语音输入。 如果用户不干预以停止用户的连接,则向用户提供警报,该设备将用户连接到搜索结果信息的目标。 基于用户没有干预以停止用户连接的确定,用户被连接到搜索结果信息的目标。
-
22.
公开(公告)号:US20140229185A1
公开(公告)日:2014-08-14
申请号:US14252913
申请日:2014-04-15
Applicant: Google Inc.
Inventor: William J. Byrne , Alexander H. Gruenstein , Douglas H. Beeferman
IPC: G10L15/06
CPC classification number: G10L15/22 , G06F3/04842 , G06F3/167 , G06F17/2795 , G10L15/02 , G10L15/063 , G10L15/1822 , G10L15/26 , G10L2015/0631 , G10L2015/0635 , G10L2015/0638 , G10L2015/221
Abstract: Predicting and learning users' intended actions on an electronic device based on free-form speech input. Users' actions can be monitored to develop a list of carrier phrases having one or more actions that correspond to the carrier phrases. A user can speak a command into a device to initiate an action. The spoken command can be parsed and compared to a list of carrier phrases. If the spoken command matches one of the known carrier phrases, the corresponding action(s) can be presented to the user for selection. If the spoken command does not match one of the known carrier phrases, search results (e.g., Internet search results) corresponding to the spoken command can be presented to the user. The actions of the user in response to the presented action(s) and/or the search results can be monitored to update the list of carrier phrases.
Abstract translation: 基于自由形式语音输入,预测和学习用户对电子设备的预期动作。 可以监视用户的动作以开发具有与运营商短语对应的一个或多个动作的运营商短语的列表。 用户可以向设备发出命令以启动动作。 可以解析口头命令并将其与载体短语列表进行比较。 如果口头命令与已知的运营商短语之一匹配,则可以将相应的动作呈现给用户进行选择。 如果口头命令与已知的运营商短语之一不匹配,则可以向用户呈现与口语命令对应的搜索结果(例如,因特网搜索结果)。 可以监视用户响应于所呈现的动作和/或搜索结果的动作以更新运营商短语列表。
-
公开(公告)号:US09894460B1
公开(公告)日:2018-02-13
申请号:US15196429
申请日:2016-06-29
Applicant: Google Inc.
Inventor: Michael J. LeBeau , John Nicholas Jitkoff , William J. Byrne
CPC classification number: H04W4/50 , G10L15/08 , G10L15/1822 , G10L15/26 , G10L2015/223 , H04L67/02 , H04L67/20 , H04M1/271
Abstract: In general, the subject matter described in this specification can be embodied in methods, systems, and program products for receiving a voice query at a mobile computing device and generating data that represents content of the voice query. The data is provided to a server system. A textual query that has been determined by a speech recognizer at the server system to be a textual form of at least part of the data is received at the mobile computing device. The textual query is determined to include a carrier phrase of one or more words that is reserved by a first third-party application program installed on the computing device. The first third-party application is selected, from a group of one or more third-party applications, to receive all or a part of the textual query. All or a part of the textual query is provided to the selected first application program.
-
公开(公告)号:US09881608B2
公开(公告)日:2018-01-30
申请号:US15608110
申请日:2017-05-30
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
IPC: G10L15/00 , G10L21/00 , G10L25/00 , G06F17/27 , G06F17/21 , G10L15/22 , G10L15/30 , G10L15/26 , G06F17/24 , G06F3/0484 , G06F17/22 , G10L15/01 , G06F3/0482 , G06F3/0488
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
-
公开(公告)号:US20170270926A1
公开(公告)日:2017-09-21
申请号:US15608110
申请日:2017-05-30
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
IPC: G10L15/22 , G10L15/30 , G10L15/26 , G06F3/0482 , G06F3/0488 , G06F3/0484 , G06F17/22 , G10L15/01 , G06F17/27 , G06F17/24
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
-
公开(公告)号:US09570077B1
公开(公告)日:2017-02-14
申请号:US14259540
申请日:2014-04-23
Applicant: Google Inc.
Inventor: Michael J. LeBeau , John Nicholas Jitkoff , William J. Byrne
CPC classification number: H04W4/50 , G10L15/08 , G10L15/1822 , G10L15/26 , G10L2015/223 , H04L67/02 , H04L67/20 , H04M1/271
Abstract: In general, the subject matter described in this specification can be embodied in methods, systems, and program products for receiving a voice query at a mobile computing device and generating data that represents content of the voice query. The data is provided to a server system. A textual query that has been determined by a speech recognizer at the server system to be a textual form of at least part of the data is received at the mobile computing device. The textual query is determined to include a carrier phrase of one or more words that is reserved by a first third-party application program installed on the computing device. The first third-party application is selected, from a group of one or more third-party applications, to receive all or a part of the textual query. All or a part of the textual query is provided to the selected first application program.
Abstract translation: 通常,本说明书中描述的主题可以体现在用于在移动计算设备处接收语音查询的方法,系统和程序产品中并且生成表示语音查询的内容的数据。 数据被提供给服务器系统。 在移动计算设备处接收由服务器系统上的语音识别器确定为至少部分数据的文本形式的文本查询。 文本查询被确定为包括由安装在计算设备上的第一第三方应用程序保留的一个或多个单词的载体短语。 从一组或多个第三方应用程序中选择第一个第三方应用程序来接收全部或部分文本查询。 文本查询的全部或部分提供给所选择的第一个应用程序。
-
公开(公告)号:US20170025123A1
公开(公告)日:2017-01-26
申请号:US15284323
申请日:2016-10-03
Applicant: GOOGLE INC.
Inventor: Brian Strope , William J. Byrne , Francoise Beaufays
IPC: G10L15/22 , G06Q30/02 , G06F17/30 , G10L15/197 , G10L15/30
CPC classification number: G10L15/22 , G06F17/30241 , G06F17/30864 , G06F17/30867 , G06F17/3087 , G06Q30/02 , G10L15/18 , G10L15/197 , G10L15/26 , G10L15/30 , G10L2015/223 , G10L2015/228
Abstract: A method of operating a voice-enabled business directory search system includes receiving category-business pairs, each category-business pair including a business category and a specific business, and establishing a data structure having nodes based on the category-business pairs. Each node of the data structure is associated with one or more business categories and a speech recognition language model for recognizing specific businesses associated with the one or more businesses categories.
Abstract translation: 操作启用语音的业务目录搜索系统的方法包括接收类别业务对,每个类别业务对包括业务类别和特定业务,以及基于类别业务对建立具有节点的数据结构。 数据结构的每个节点与一个或多个业务类别和用于识别与一个或多个业务类别相关联的特定业务的语音识别语言模型相关联。
-
公开(公告)号:US09466287B2
公开(公告)日:2016-10-11
申请号:US14988201
申请日:2016-01-05
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
IPC: G10L21/00 , G10L15/01 , G06F17/27 , G10L15/22 , G10L15/30 , G10L15/26 , G06F17/24 , G06F3/0484 , G06F17/22
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
-
公开(公告)号:US20160163308A1
公开(公告)日:2016-06-09
申请号:US15045571
申请日:2016-02-17
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
IPC: G10L15/01 , G10L15/30 , G06F17/22 , G10L15/26 , G06F3/0484
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
-
公开(公告)号:US20150294668A1
公开(公告)日:2015-10-15
申请号:US14747306
申请日:2015-06-23
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
Abstract translation: 除了别的以外,本说明书的主题可以实现用于校正转录文本中的单词的计算机实现的方法,包括从麦克风接收语音音频数据。 该方法还包括将语音音频数据发送到转录系统。 该方法还包括从转录系统接收从语音音频数据转录的单词格。 该方法还包括从单词格中呈现一个或多个转录词。 所述方法还包括接收所呈现的转录词中的至少一个的用户选择。 该方法还包括向所选择的转录词提供来自词格的一个或多个替代词。 该方法还包括接收至少一个替代单词的用户选择。 所述方法还包括用所选择的替代词替换所呈现的转录词中的所选转录词。
-
-
-
-
-
-
-
-
-