Patent search ap:("GOOGLE LLC") AND inv:"Turaj Zakizadeh Shabestary" Page 1

1.

发明授权
Methods and systems for detecting and processing speech signals 有权

公开(公告)号：US12051423B2

公开(公告)日：2024-07-30

申请号：US18159076

申请日：2023-01-24

Applicant: Google LLC

Inventor： Jay Pierre Civelli , Mikhal Shemer , Turaj Zakizadeh Shabestary , David Tapuska

IPC: G10L15/22 , G10L15/30 , G10L15/32 , G10L15/02 , G10L15/08

CPC classification number: G10L15/30 , G10L15/22 , G10L15/32 , G10L15/02 , G10L2015/088 , G10L2015/223

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

2.

发明申请
SELECTIVE ADAPTATION AND UTILIZATION OF NOISE REDUCTION TECHNIQUE IN INVOCATION PHRASE DETECTION 有权

公开(公告)号：US20220392441A1

公开(公告)日：2022-12-08

申请号：US17886726

申请日：2022-08-12

Applicant: Google LLC

Inventor： Christopher Hughes , Yiteng Huang , Turaj Zakizadeh Shabestary , Taylor Applebaum

IPC: G10L15/20 , G10L15/02 , G10L15/08 , G10L15/22 , G10L21/0232 , G10L25/84

Abstract: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

3.

发明申请
SELECTIVE ADAPTATION AND UTILIZATION OF NOISE REDUCTION TECHNIQUE IN INVOCATION PHRASE DETECTION 审中-公开

公开(公告)号：US20200294496A1

公开(公告)日：2020-09-17

申请号：US16886139

申请日：2020-05-28

Applicant: Google LLC

Inventor： Christopher Hughes , Yiteng Huang , Turaj Zakizadeh Shabestary , Taylor Applebaum

IPC: G10L15/20 , G10L15/02 , G10L15/08 , G10L15/22 , G10L21/0232 , G10L25/84

Abstract: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

4.

发明申请
ECHO CANCELLATION FOR KEYWORD SPOTTING 审中-公开

公开(公告)号：US20200152220A1

公开(公告)日：2020-05-14

申请号：US16598462

申请日：2019-10-10

Applicant: GOOGLE LLC

Inventor： Turaj Zakizadeh Shabestary , Willem Bastiaan Kleijn , Jan Skoglund

IPC: G10L21/0232 , H04R3/04 , G10L15/08 , G10L21/0208 , H04M9/08

Abstract: Techniques of performing linear acoustic echo cancellation performing a phase correction operation on the estimate of the echo signal based on a clock drift between a capture of an input microphone signal and a playout of a loudspeaker signal. Along these lines, the existence of the clock drift, i.e., a small difference in the sampling rates of the input microphone signal and the loudspeaker signal, can cause processing circuitry in a device configured to perform LAEC operations to generate a filter based on the magnitudes of the short-term Fourier transforms (STFTs) of the input microphone signal and the loudspeaker signal. Such a filter is real-valued and results in a positive estimate of the acoustic echo signal included in the input microphone signal. The phase of this estimate may then be aligned with the phase of the input microphone signal.

5.

发明授权
Methods and systems for detecting and processing speech signals 有权

公开(公告)号：US10163443B2

公开(公告)日：2018-12-25

申请号：US15624935

申请日：2017-06-16

Applicant: Google LLC

Inventor： Jay Pierre Civelli , Mikhal Shemer , Turaj Zakizadeh Shabestary , David Tapuska

IPC: G10L21/00 , G10L15/30 , G10L15/22 , G10L15/32 , G10L15/02 , G10L15/08

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

6.

发明授权
STFT-based echo muter 有权

公开(公告)号：US12051434B2

公开(公告)日：2024-07-30

申请号：US17643825

申请日：2021-12-11

Applicant: Google LLC

Inventor： Turaj Zakizadeh Shabestary , Arun Narayanan

IPC: G10L21/0224 , G10L21/0208 , G10L21/0232

CPC classification number: G10L21/0224 , G10L21/0232 , G10L2021/02082

Abstract: A method for Short-Time Fourier Transform-based echo muting includes receiving a microphone signal including acoustic echo captured by a microphone and corresponding to audio content from an acoustic speaker, and receiving a reference signal including a sequence of frames representing the audio content. For each frame in a sequence of frames, the method includes processing, using an acoustic echo canceler configured to receive a respective frame as input to generate a respective output signal frame that cancels the acoustic echo from the respective frame, and determining, using a Double-talk Detector (DTD), based on the respective frame and the respective output signal frame, whether the respective frame includes a double-talk frame or an echo-only frame. For each respective frame that includes the echo-only frame, muting the respective output signal frame, and performing speech processing on the respective output signal frame for each respective frame that includes the double-talk frame.

7.

发明授权
Selective adaptation and utilization of noise reduction technique in invocation phrase detection 有权

公开(公告)号：US11984117B2

公开(公告)日：2024-05-14

申请号：US17886726

申请日：2022-08-12

Applicant: Google LLC

Inventor： Christopher Hughes , Yiteng Huang , Turaj Zakizadeh Shabestary , Taylor Applebaum

IPC: G10L15/20 , G10L15/02 , G10L15/08 , G10L15/22 , G10L21/0232 , G10L25/84 , G10L21/0216

CPC classification number: G10L15/20 , G10L15/02 , G10L15/08 , G10L15/22 , G10L21/0232 , G10L25/84 , G10L2015/025 , G10L2015/088 , G10L2015/223 , G10L2021/02166

Abstract: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

8.

发明公开
Methods And Systems For Detecting And Processing Speech Signals 审中-公开

公开(公告)号：US20230169979A1

公开(公告)日：2023-06-01

申请号：US18159076

申请日：2023-01-24

Applicant: Google LLC

Inventor： Jay Pierre Civelli , Mikhal Shemer , Turaj Zakizadeh Shabestary , David Tapuska

IPC: G10L15/30 , G10L15/22 , G10L15/32

CPC classification number: G10L15/30 , G10L15/22 , G10L15/32 , G10L15/02

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

9.

发明授权
Selective adaptation and utilization of noise reduction technique in invocation phrase detection 有权

公开(公告)号：US11417324B2

公开(公告)日：2022-08-16

申请号：US16886139

申请日：2020-05-28

Applicant: Google LLC

Inventor： Christopher Hughes , Yiteng Huang , Turaj Zakizadeh Shabestary , Taylor Applebaum

IPC: G10L15/20 , G10L15/02 , G10L15/08 , G10L15/22 , G10L21/0232 , G10L25/84 , G10L21/0216

Abstract: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

10.

发明申请
Methods And Systems For Detecting And Processing Speech Signals 有权

公开(公告)号：US20210090574A1

公开(公告)日：2021-03-25

申请号：US17114621

申请日：2020-12-08

Applicant: Google LLC

Inventor： Jay Pierre Civelli , Mikhal Shemer , Turaj Zakizadeh Shabestary , David Tapuska

IPC: G10L15/30 , G10L15/22 , G10L15/32

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification