专利检索 ap:("Google LLC") AND inv:"Diego Melendo Casado" 第 1 页

1.

发明公开
AUTOMATICALLY DETERMINING LANGUAGE FOR SPEECH RECOGNITION OF SPOKEN UTTERANCE RECEIVED VIA AN AUTOMATED ASSISTANT INTERFACE 审中-公开

公开(公告)号：US20240194191A1

公开(公告)日：2024-06-13

申请号：US18389033

申请日：2023-11-13

申请人： GOOGLE LLC

发明人： Pu-sen Chao , Diego Melendo Casado , Ignacio Lopez Moreno

IPC分类号： G10L15/14 , G06F3/16 , G10L15/00 , G10L15/02 , G10L15/08 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30

CPC分类号： G10L15/14 , G06F3/167 , G10L15/005 , G10L15/02 , G10L15/1822 , G10L15/183 , G10L15/22 , G10L15/30 , G10L2015/088 , G10L2015/223 , G10L2015/228

摘要： Implementations relate to determining a language for speech recognition of a spoken utterance, received via an automated assistant interface, for interacting with an automated assistant. Implementations can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Selection of a speech recognition model for a particular language can based on one or more interaction characteristics exhibited during a dialog session between a user and an automated assistant. Such interaction characteristics can include anticipated user input types, anticipated user input durations, a duration for monitoring for a user response, and/or an actual duration of a provided user response.

2.

发明授权
Adaptive interface in a voice-based networked system 有权

公开(公告)号：US11817084B2

公开(公告)日：2023-11-14

申请号：US16880647

申请日：2020-05-21

申请人： GOOGLE LLC

发明人： Pu-sen Chao , Diego Melendo Casado , Ignacio Lopez Moreno

IPC分类号： G10L15/14 , G10L15/02 , G10L15/18 , G06F3/16 , G10L15/00 , G10L15/183 , G10L15/22 , G10L15/30 , G10L15/08

CPC分类号： G10L15/14 , G06F3/167 , G10L15/005 , G10L15/02 , G10L15/183 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L2015/088 , G10L2015/223 , G10L2015/228

摘要： The present disclosure relates generally to determining a language for speech recognition of a spoken utterance, received via an automated assistant interface, for interacting with an automated assistant. The system can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Selection of a speech recognition model for a particular language can based on one or more interaction characteristics exhibited during a dialog session between a user and an automated assistant. Such interaction characteristics can include anticipated user input types, anticipated user input durations, a duration for monitoring for a user response, and/or an actual duration of a provided user response.

3.

发明授权
Dynamic and/or context-specific hot words to invoke automated assistant 有权

公开(公告)号：US11810557B2

公开(公告)日：2023-11-07

申请号：US17676130

申请日：2022-02-19

申请人： Google LLC

发明人： Diego Melendo Casado

IPC分类号： G10L15/183 , G10L15/22

CPC分类号： G10L15/183 , G10L15/22 , G10L2015/223

摘要： Techniques are described herein for enabling the use of “dynamic” or “context-specific” hot words to invoke an automated assistant. In various implementations, an automated assistant may be executed in a default listening state at least in part on a user's computing device(s). While in the default listening state, audio data captured by microphone(s) may be monitored for default hot words. Detection of the default hot word(s) transitions of the automated assistant into a speech recognition state. Sensor signal(s) generated by hardware sensor(s) integral with the computing device(s) may be detected and analyzed to determine an attribute of the user. Based on the analysis, the automated assistant may transition into an enhanced listening state in which the audio data may be monitored for enhanced hot word(s). Detection of enhanced hot word(s) triggers the automated assistant to perform a responsive action without requiring detection of default hot word(s).

4.

发明授权
Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface 有权

公开(公告)号：US11798541B2

公开(公告)日：2023-10-24

申请号：US17099367

申请日：2020-11-16

申请人： Google LLC

发明人： Pu-sen Chao , Diego Melendo Casado , Ignacio Lopez Moreno

IPC分类号： G10L15/00 , G10L15/197 , G10L15/22 , G10L15/30 , G10L15/08 , G10L15/14 , G10L15/18 , G10L13/00

CPC分类号： G10L15/197 , G10L13/00 , G10L15/005 , G10L15/08 , G10L15/14 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L2015/088 , G10L2015/223 , G10L2015/228

摘要： Determining a language for speech recognition of a spoken utterance received via an automated assistant interface for interacting with an automated assistant. Implementations can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Implementations determine a user profile that corresponds to audio data that captures a spoken utterance, and utilize language(s), and optionally corresponding probabilities, assigned to the user profile in determining a language for speech recognition of the spoken utterance. Some implementations select only a subset of languages, assigned to the user profile, to utilize in speech recognition of a given spoken utterance of the user. Some implementations perform speech recognition in each of multiple languages assigned to the user profile, and utilize criteria to select only one of the speech recognitions as appropriate for generating and providing content that is responsive to the spoken utterance.

5.

发明公开
MULTI-USER AUTHENTICATION ON A DEVICE 审中-公开

公开(公告)号：US20230335116A1

公开(公告)日：2023-10-19

申请号：US18210963

申请日：2023-06-16

申请人： GOOGLE LLC

发明人： Meltem Oktem , Taral Pradeep Joglekar , Fnu Heryandi , Pu-sen Chao , Ignacio Lopez Moreno , Salil Rajadhyaksha , Alexander H. Gruenstein , Diego Melendo Casado

IPC分类号： G10L15/08 , G06F21/32 , G10L15/22 , G10L17/00 , G06V40/10 , G10L17/06 , G10L15/07 , G06F16/635

CPC分类号： G10L15/08 , G06F16/636 , G06F21/32 , G06V40/10 , G10L15/07 , G10L15/22 , G10L17/00 , G10L17/06 , G10L2015/088 , G10L15/26

摘要： In some implementations, processor(s) can receive an utterance from a speaker, and determine whether the speaker is a known user of a user device or not a known user of the user device. The user device can be shared by a plurality of known users. Further, the processor(s) can determine whether the utterance corresponds to a personal request or non-personal request. Moreover, and in response to determining that the speaker not a known user of the user device and in response to determining that the utterance corresponds to a non-personal request, the processor(s) can cause a response to the utterance to be provided for presentation to the speaker at the user device response to the utterance, or can cause an action to be performed by the user device responsive to the utterance.

6.

发明申请
DYNAMIC AND/OR CONTEXT-SPECIFIC HOT WORDS TO INVOKE AUTOMATED ASSISTANT 有权

公开(公告)号：US20220246140A1

公开(公告)日：2022-08-04

申请号：US17676130

申请日：2022-02-19

申请人： Google LLC

发明人： Diego Melendo Casado

IPC分类号： G10L15/183

摘要： Techniques are described herein for enabling the use of “dynamic” or “context-specific” hot words to invoke an automated assistant. In various implementations, an automated assistant may be executed in a default listening state at least in part on a user's computing device(s). While in the default listening state, audio data captured by microphone(s) may be monitored for default hot words. Detection of the default hot word(s) transitions of the automated assistant into a speech recognition state. Sensor signal(s) generated by hardware sensor(s) integral with the computing device(s) may be detected and analyzed to determine an attribute of the user. Based on the analysis, the automated assistant may transition into an enhanced listening state in which the audio data may be monitored for enhanced hot word(s). Detection of enhanced hot word(s) triggers the automated assistant to perform a responsive action without requiring detection of default hot word(s).

7.

发明授权
Selection biasing 有权

公开(公告)号：US11334182B2

公开(公告)日：2022-05-17

申请号：US16708539

申请日：2019-12-10

申请人： Google LLC

发明人： Jakob Nicolaus Foerster , Diego Melendo Casado , Glen Shires

IPC分类号： G06F3/041 , G06F3/04842 , G06F3/04883 , G06F3/16 , G06F3/0488 , G06F3/023 , G06F40/58 , G06F40/274

摘要： In some implementations, data indicating a touch received on a proximity-sensitive display is received while the proximity-sensitive display is presenting one or more items. In one aspect, the techniques describe may involve a process for disambiguating touch selections of hypothesized items, such as text or graphical objects that have been generated based on input data, on a proximity-sensitive display. This process may allow a user to more easily select hypothesized items that the user may wish to correct, by determining whether a touch received through the proximity-sensitive display represents a selection of each hypothesized item based at least on a level of confidence that the hypothesized item accurately represents the input data.

8.

发明申请
Device Leadership Negotiation Among Voice Interface Devices 有权

公开(公告)号：US20210249015A1

公开(公告)日：2021-08-12

申请号：US17242273

申请日：2021-04-27

申请人： GOOGLE LLC

发明人： Kenneth Mixter , Diego Melendo Casado , Alexander Houston Gruenstein , Terry Tai , Christopher Thaddeus Hughes , Matthew Nirvan Sharifi

IPC分类号： G10L15/22 , G10L15/32

摘要： The various implementations described herein include methods and systems for determining device leadership among voice interface devices. In one aspect, a method is performed at a first electronic device of a plurality of electronic devices, each having microphones, a speaker, processors, and memory storing programs for execution by the processors. The first device detects a voice input. It determines a device state and a relevance of the voice input. It identifies a subset of electronic devices from the plurality to which the voice input is relevant. In accordance with a determination that the subset includes the first device, the first device determines a first score of a criterion associated with the voice input and receives second scores of the criterion from other devices in the subset. In accordance with a determination that the first score is higher than the second scores, the first device responds to the detected input.

9.

发明授权
Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface 有权

公开(公告)号：US10896672B2

公开(公告)日：2021-01-19

申请号：US15769023

申请日：2018-04-16

申请人： Google LLC

发明人： Pu-sen Chao , Diego Melendo Casado , Ignacio Lopez Moreno

IPC分类号： G10L15/14 , G10L15/02 , G10L15/18 , G10L15/00 , G10L15/22 , G10L15/30 , G06F3/16 , G10L15/183 , G10L15/08

摘要： Implementations relate to determining a language for speech recognition of a spoken utterance, received via an automated assistant interface, for interacting with an automated assistant. Implementations can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Selection of a speech recognition model for a particular language can based on one or more interaction characteristics exhibited during a dialog session between a user and an automated assistant. Such interaction characteristics can include anticipated user input types, anticipated user input durations, a duration for monitoring for a user response, and/or an actual duration of a provided user response.

10.

发明申请
DYNAMIC AND/OR CONTEXT-SPECIFIC HOT WORDS TO INVOKE AUTOMATED ASSISTANT 审中-公开

公开(公告)号：US20200342855A1

公开(公告)日：2020-10-29

申请号：US16618681

申请日：2018-08-21

申请人： Google LLC

发明人： Diego Melendo Casado

IPC分类号： G10L15/183

摘要： Techniques are described herein for enabling the use of “dynamic” or “context-specific” hot words to invoke an automated assistant. In various implementations, an automated assistant may be executed in a default listening state at least in part on a user's computing device(s). While in the default listening state, audio data captured by microphone(s) may be monitored for default hot words. Detection of the default hot word(s) transitions of the automated assistant into a speech recognition state. Sensor signal(s) generated by hardware sensor(s) integral with the computing device(s) may be detected and analyzed to determine an attribute of the user. Based on the analysis, the automated assistant may transition into an enhanced listening state in which the audio data may be monitored for enhanced hot word(s). Detection of enhanced hot word(s) triggers the automated assistant to perform a responsive action without requiring detection of default hot word(s).

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类