-
公开(公告)号:US11924618B2
公开(公告)日:2024-03-05
申请号:US17959734
申请日:2022-10-04
Applicant: Google LLC
Inventor: Rajeev Conrad Nongpiur , Ananya Misra , Chanwoo Kim
CPC classification number: H04R3/005 , H04R5/027 , H04R29/005 , H04R29/006 , H04R2201/401 , H04R2430/20
Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.
-
公开(公告)号:US11308963B2
公开(公告)日:2022-04-19
申请号:US16936948
申请日:2020-07-23
Applicant: Google LLC
Inventor: Chanwoo Kim , Rajeev Nongpiur , Michiel Bacchiani
Abstract: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.
-
公开(公告)号:US20190387315A1
公开(公告)日:2019-12-19
申请号:US16555118
申请日:2019-08-29
Applicant: Google LLC
Inventor: Rajeev Conrad Nongpiur , Ananya Misra , Chanwoo Kim
Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.
-
公开(公告)号:US20230027458A1
公开(公告)日:2023-01-26
申请号:US17959734
申请日:2022-10-04
Applicant: Google LLC
Inventor: Rajeev Conrad Nongpiur , Ananya Misra , Chanwoo Kim
Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.
-
公开(公告)号:US20190333507A1
公开(公告)日:2019-10-31
申请号:US16412677
申请日:2019-05-15
Applicant: Google LLC
Inventor: Chanwoo Kim , Rajeev Conrad Nongpiur , Michiel A.U. Bacchiani
Abstract: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.
-
公开(公告)号:US11470419B2
公开(公告)日:2022-10-11
申请号:US16555118
申请日:2019-08-29
Applicant: Google LLC
Inventor: Rajeev Conrad Nongpiur , Ananya Misra , Chanwoo Kim
Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.
-
公开(公告)号:US20220238112A1
公开(公告)日:2022-07-28
申请号:US17722960
申请日:2022-04-18
Applicant: Google LLC
Inventor: Chanwoo Kim , Rajeev Conrad Nongpiur , Michiel A.U. Bacchiani
Abstract: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.
-
公开(公告)号:US10755714B2
公开(公告)日:2020-08-25
申请号:US16412677
申请日:2019-05-15
Applicant: Google LLC
Inventor: Chanwoo Kim , Rajeev Conrad Nongpiur , Michiel A. U. Bacchiani
Abstract: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.
-
公开(公告)号:US20180279043A1
公开(公告)日:2018-09-27
申请号:US15996070
申请日:2018-06-01
Applicant: Google LLC
Inventor: Chanwoo Kim , Rajeev Conrad Nongpiur , Ananya Misra
CPC classification number: H04R3/005 , H04R5/027 , H04R29/005 , H04R29/006 , H04R2201/401 , H04R2430/20
Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.
-
公开(公告)号:US10412489B2
公开(公告)日:2019-09-10
申请号:US15996070
申请日:2018-06-01
Applicant: Google LLC
Inventor: Rajeev Conrad Nongpiur , Ananya Misra , Chanwoo Kim
Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.
-
-
-
-
-
-
-
-
-