AUTOMATIC SPEECH RECOGNITION IMPOSTER REJECTION ON A HEADPHONE WITH AN ACCELEROMETER

    公开(公告)号:US20210125609A1

    公开(公告)日:2021-04-29

    申请号:US16666252

    申请日:2019-10-28

    Applicant: Apple Inc.

    Abstract: A signal processing method to determine whether or not a detected key-phrase is spoken by a wearer of a headphone. The method receives an accelerometer signal from an accelerometer in a headphone and receives a microphone signal from at least one microphone in the headphone. The method detects a key-phrase using the microphone signal and generates a voice activity detection (VAD) signal based on the accelerometer signal. The method determines whether the VAD signal indicates that the detected key-phrase is spoken by a wearer of the headphone. Responsive to determining that the VAD signal indicates that the detected key-phrase is spoken by the wearer of the headphone, triggering a virtual personal assistant (VPA).

    STATE CLASSIFICATION FOR AUDIO ACCESSORIES, AND RELATED SYSTEMS AND METHODS

    公开(公告)号:US20210099782A1

    公开(公告)日:2021-04-01

    申请号:US16998135

    申请日:2020-08-20

    Applicant: Apple Inc.

    Abstract: An earphone has a housing and a corresponding user-contact surface configured to urge against a user's anatomy. The housing defines an acoustic chamber and an acoustic port opening from the acoustic chamber. The user-contact surface is complementarily configured relative to the user's anatomy. When the earphone is donned, the user-contact surface forms an acoustic seal between the user-contact surface and the user's anatomy, acoustically coupling the acoustic chamber with the user's ear canal. An acoustic driver is positioned in the housing and acoustically coupled with the acoustic chamber. A microphone transducer acoustically couples with the acoustic port. A processing component is configured to detect a presence or an absence of anti-resonance in a spectral envelope observed by the microphone transducer.

    Noise-dependent audio signal selection system

    公开(公告)号:US11227617B2

    公开(公告)日:2022-01-18

    申请号:US16563624

    申请日:2019-09-06

    Applicant: Apple Inc.

    Abstract: A device implementing an automatic speech recognition triggering system includes at least one processor configured to receive first and second audio signals respectively corresponding to first and second microphones of a device. The at least one processor is further configured to generate, based on at least one of the first or second audio signals, a third audio signal corresponding to a voice beam directed to an expected position of a mouth of a user. The at least one processor is further configured to determine whether wind noise is present in at least one of the first, second, or third audio signals. The at least one processor is further configured to, based on determining whether wind noise is present, an audio signal from among the second or third audio signals, for a determination of whether at least one of the first or second audio signals corresponds to the user.

    Method and system for speech enhancement using a remote microphone

    公开(公告)号:US10332538B1

    公开(公告)日:2019-06-25

    申请号:US15999121

    申请日:2018-08-17

    Applicant: Apple Inc.

    Abstract: A speech enhancement system for a remote microphone has a wireless receiver that receives a signal from a first microphone of a remote device. A delay buffer receives a second microphone signal from a second microphone and delays by an adjustable delay. The adjustable delay is based on a difference between a wireless delay and an acoustic delay. A noise suppressor produces an output audio signal for an earpiece speaker, based on the first microphone signal and the adjustable delayed second microphone signal. Other aspects are also described and claimed.

    Memory and computation efficient cross-correlation and delay estimation

    公开(公告)号:US10431238B1

    公开(公告)日:2019-10-01

    申请号:US15999120

    申请日:2018-08-17

    Applicant: Apple Inc.

    Abstract: A digital processor-based memory-efficient and computation-efficient audio signal processing technique partitions each of two audio signals into shorter segments and combines the shorter segments into combined segments. A processor cross-correlates the first combined segment and the second combined segment into a cross-correlation result, which is written into a cross-correlation array. The result may be used for delay estimation (to estimate the relative delay between the two audio signals.) Other aspects are also described and claimed.

Patent Agency Ranking