Query endpointing based on lip detection

    公开(公告)号:US11308963B2

    公开(公告)日:2022-04-19

    申请号:US16936948

    申请日:2020-07-23

    Applicant: Google LLC

    Abstract: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.

    AURALIZATION FOR MULTI-MICROPHONE DEVICES
    3.
    发明申请

    公开(公告)号:US20190387315A1

    公开(公告)日:2019-12-19

    申请号:US16555118

    申请日:2019-08-29

    Applicant: Google LLC

    Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.

    AURALIZATION FOR MULTI-MICROPHONE DEVICES

    公开(公告)号:US20230027458A1

    公开(公告)日:2023-01-26

    申请号:US17959734

    申请日:2022-10-04

    Applicant: Google LLC

    Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.

    QUERY ENDPOINTING BASED ON LIP DETECTION
    5.
    发明申请

    公开(公告)号:US20190333507A1

    公开(公告)日:2019-10-31

    申请号:US16412677

    申请日:2019-05-15

    Applicant: Google LLC

    Abstract: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.

    Auralization for multi-microphone devices

    公开(公告)号:US11470419B2

    公开(公告)日:2022-10-11

    申请号:US16555118

    申请日:2019-08-29

    Applicant: Google LLC

    Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.

    QUERY ENDPOINTING BASED ON LIP DETECTION

    公开(公告)号:US20220238112A1

    公开(公告)日:2022-07-28

    申请号:US17722960

    申请日:2022-04-18

    Applicant: Google LLC

    Abstract: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.

    Query endpointing based on lip detection

    公开(公告)号:US10755714B2

    公开(公告)日:2020-08-25

    申请号:US16412677

    申请日:2019-05-15

    Applicant: Google LLC

    Abstract: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.

    Auralization for multi-microphone devices

    公开(公告)号:US10412489B2

    公开(公告)日:2019-09-10

    申请号:US15996070

    申请日:2018-06-01

    Applicant: Google LLC

    Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.

Patent Agency Ranking