Patent search ap:("GOOGLE LLC") AND inv:"Daniel Valcarce" Page 1

1.

发明授权
Contextual suppression of assistant command(s) 有权

公开(公告)号：US11557293B2

公开(公告)日：2023-01-17

申请号：US17321994

申请日：2021-05-17

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov

IPC: G10L15/22 , G10L15/18 , G10L25/78 , G10L15/05 , G10L15/08

Abstract: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.

2.

发明公开
CONTEXTUAL SUPPRESSION OF ASSISTANT COMMAND(S) 审中-公开

公开(公告)号：US20240347060A1

公开(公告)日：2024-10-17

申请号：US18750663

申请日：2024-06-21

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov

IPC: G10L15/22 , G10L15/05 , G10L15/08 , G10L15/18 , G10L25/78

CPC classification number: G10L15/22 , G10L15/05 , G10L15/1815 , G10L25/78 , G10L2015/088 , G10L2015/223

Abstract: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.

3.

发明公开
ALTERING A CANDIDATE TEXT REPRESENTATION, OF SPOKEN INPUT, BASED ON FURTHER SPOKEN INPUT 审中-公开

公开(公告)号：US20230252995A1

公开(公告)日：2023-08-10

申请号：US17667314

申请日：2022-02-08

Applicant: GOOGLE LLC

Inventor： Matthew Sharifi , Victor Carbune , Bogdan Prisacari , Alexander Froemmgen , Milosz Kmieciak , Felix Weissenberger , Daniel Valcarce

IPC: G10L15/26 , G10L15/22 , G10L15/08 , G10L15/06

CPC classification number: G10L15/26 , G10L15/22 , G10L15/08 , G10L15/063 , G10L2015/088

Abstract: Various implementations include determining whether further spoken input is intended to correct at least one word in a candidate text representation of spoken input. Various implementations include receiving audio data capturing spoken input of a user. Various implementations include rendering output based on the candidate text representation to the user. Various implementations include receiving, while the output is being rendered, further audio data capturing the further spoken input. In response to determining the further spoken input is intended to correct the at least one word in the candidate text representation, various implementations include generating a revised text representation of the spoken input by altering at least one word in the candidate text representation based on one or more terms in the further candidate text representation.

4.

发明申请
DYNAMICALLY CONFIGURING A WARM WORD BUTTON WITH ASSISTANT COMMANDS 有权

公开(公告)号：US20230061929A1

公开(公告)日：2023-03-02

申请号：US17532315

申请日：2021-11-22

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Antonio Gaetani , Bastiaan Van Eeckhoudt , Daniel Valcarce , Michael Golikov , Justin Lu , Ondrej Skopek , Nicolo D'Ercole , Zaheed Sabur , Behshad Behzadi , Luv Kothari

IPC: G10L17/22

Abstract: Implementations described herein relate to configuring a dynamic warm word button, that is associated with a client device, with particular assistant commands based on detected occurrences of warm word activation events at the client device. In response to detecting an occurrence of a given warm word activation event at the client device, implementations can determine whether user verification is required for a user that actuated the warm word button. Further, in response to determining that the user verification is required for the user that actuated the warm word button, the user verification can be performed. Moreover, in response to determining that the user that actuated the warm word button has been verified, implementations can cause an automated assistant to perform the particular assistant command associated with the warm word activation event. Audio-based and/or non-audio-based techniques can be utilized to perform the user verification.

5.

发明授权
Contextual suppression of assistant command(s) 有权

公开(公告)号：US12057119B2

公开(公告)日：2024-08-06

申请号：US18092883

申请日：2023-01-03

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov

IPC: G10L15/22 , G10L15/05 , G10L15/18 , G10L25/78 , G10L15/08

CPC classification number: G10L15/22 , G10L15/05 , G10L15/1815 , G10L25/78 , G10L2015/088 , G10L2015/223

Abstract: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.

6.

发明公开
CONTEXTUAL SUPPRESSION OF ASSISTANT COMMAND(S) 审中-公开

公开(公告)号：US20230143177A1

公开(公告)日：2023-05-11

申请号：US18092883

申请日：2023-01-03

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov

IPC: G10L15/22 , G10L15/05 , G10L15/18 , G10L25/78

CPC classification number: G10L15/22 , G10L15/05 , G10L15/1815 , G10L25/78 , G10L2015/088

Abstract: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.

7.

发明申请
CONTEXTUAL SUPPRESSION OF ASSISTANT COMMAND(S) 有权

公开(公告)号：US20220366903A1

公开(公告)日：2022-11-17

申请号：US17321994

申请日：2021-05-17

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov

IPC: G10L15/22 , G10L15/18 , G10L15/05 , G10L25/78

Abstract: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification