-
公开(公告)号:US20210074316A1
公开(公告)日:2021-03-11
申请号:US16708296
申请日:2019-12-09
申请人: Apple Inc.
发明人: Mehrez SOUDEN , Ante JUKIC , Jason WUNG , Ashrith DESHPANDE , Joshua D. ATKINS
IPC分类号: G10L25/81 , G10L25/18 , G10L21/0232 , G06K9/00 , G10L15/25 , G10L15/22 , G06N7/00 , G06N20/00
摘要: A device implementing a system for processing speech in an audio signal includes at least one processor configured to receive an audio signal corresponding to at least one microphone of a device, and to determine, using a first model, a first probability that a speech source is present in the audio signal. The at least one processor is further configured to determine, using a second model, a second probability that an estimated location of a source of the audio signal corresponds to an expected position of a user of the device, and to determine a likelihood that the audio signal corresponds to the user of the device based on the first and second probabilities.
-
公开(公告)号:US20240267674A1
公开(公告)日:2024-08-08
申请号:US18522108
申请日:2023-11-28
申请人: Apple Inc.
摘要: Aspects of the subject technology relate to providing device-independent audio for electronic devices. In one or more implementations, microphone data captured by multiple microphones at an electronic device may be provided to a device-specific audio generalizer at the electronic device. The device-specific audio generalizer may utilize device specific information to generalize the microphone data to form device-independent audio data. The device-independent audio data may then be provided to a device-independent machine learning model at the electronic device or another electronic device for further processing.
-
公开(公告)号:US20180336892A1
公开(公告)日:2018-11-22
申请号:US15920091
申请日:2018-03-13
申请人: Apple Inc.
发明人: Yoon KIM , John BRIDLE , Joshua D. ATKINS , Feipeng LI , Mehrez SOUDEN
CPC分类号: G10L15/22 , G10L15/1822 , G10L15/30 , G10L25/51 , G10L2015/228 , G10L2021/02166 , H04R3/005
摘要: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors, memory, and a plurality of microphones, sampling, at each of the plurality of microphones of the electronic device, an audio signal to obtain a plurality of audio signals; processing the plurality of audio signals to obtain a plurality of audio streams; and determining, based on the plurality of audio streams, whether any of the plurality of audio signals corresponds to a spoken trigger. The method further includes, in accordance with a determination that the plurality of audio signals corresponds to the spoken trigger, initiating a session of the digital assistant; and in accordance with a determination that the plurality of audio signals does not correspond to the spoken trigger, foregoing initiating a session of the digital assistant.
-
公开(公告)号:US20220270629A1
公开(公告)日:2022-08-25
申请号:US17589889
申请日:2022-01-31
申请人: Apple Inc.
IPC分类号: G10L21/028 , H04R1/10 , G10L21/0232 , G06N20/00
摘要: Implementations of the subject technology provide systems and methods for providing audio source separation for audio input, such as for audio devices having limited power and/or computing resources. The subject technology may allow an audio device to leverage processing and/or power resources of a companion device that is communicatively coupled to the audio device. The companion device may identify a noise condition of the audio device, select a source separation model based on the noise condition, and provide the source separation model to the audio device. In this way, the audio device can provide audio source separation functionality using a relatively small footprint source separation model that is specific to the noise condition in which the audio device is operated.
-
公开(公告)号:US20190074009A1
公开(公告)日:2019-03-07
申请号:US16181138
申请日:2018-11-05
申请人: Apple Inc.
发明人: Yoon KIM , John BRIDLE , Joshua D. ATKINS , Feipeng LI , Mehrez SOUDEN
摘要: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors, memory, and a plurality of microphones, sampling, at each of the plurality of microphones of the electronic device, an audio signal to obtain a plurality of audio signals; processing the plurality of audio signals to obtain a plurality of audio streams; and determining, based on the plurality of audio streams, whether any of the plurality of audio signals corresponds to a spoken trigger. The method further includes, in accordance with a determination that the plurality of audio signals corresponds to the spoken trigger, initiating a session of the digital assistant; and in accordance with a determination that the plurality of audio signals does not correspond to the spoken trigger, foregoing initiating a session of the digital assistant.
-
公开(公告)号:US20240029754A1
公开(公告)日:2024-01-25
申请号:US18376438
申请日:2023-10-03
申请人: Apple Inc.
IPC分类号: G10L21/028 , H04R1/10 , G06N20/00 , G10L21/0232
CPC分类号: G10L21/028 , H04R1/1016 , H04R1/1041 , H04R1/1083 , G06N20/00 , G10L21/0232 , H04R2420/07
摘要: Implementations of the subject technology provide systems and methods for providing audio source separation for audio input, such as for audio devices having limited power and/or computing resources. The subject technology may allow an audio device to leverage processing and/or power resources of a companion device that is communicatively coupled to the audio device. The companion device may identify a noise condition of the audio device, select a source separation model based on the noise condition, and provide the source separation model to the audio device. In this way, the audio device can provide audio source separation functionality using a relatively small footprint source separation model that is specific to the noise condition in which the audio device is operated.
-
公开(公告)号:US20230111509A1
公开(公告)日:2023-04-13
申请号:US18080550
申请日:2022-12-13
申请人: Apple Inc.
发明人: Yoon KIM , John BRIDLE , Joshua D. ATKINS , Feipeng LI , Mehrez SOUDEN
IPC分类号: G10L15/22 , H04R1/40 , G10L15/08 , G10L15/04 , H04R3/00 , G10L15/30 , G10L15/18 , G10L15/28 , G10L21/0216
摘要: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors, memory, and a plurality of microphones, sampling, at each of the plurality of microphones of the electronic device, an audio signal to obtain a plurality of audio signals; processing the plurality of audio signals to obtain a plurality of audio streams; and determining, based on the plurality of audio streams, whether any of the plurality of audio signals corresponds to a spoken trigger. The method further includes, in accordance with a determination that the plurality of audio signals corresponds to the spoken trigger, initiating a session of the digital assistant; and in accordance with a determination that the plurality of audio signals does not correspond to the spoken trigger, foregoing initiating a session of the digital assistant.
-
公开(公告)号:US20210097998A1
公开(公告)日:2021-04-01
申请号:US17111132
申请日:2020-12-03
申请人: Apple Inc.
发明人: Yoon KIM , John BRIDLE , Joshua D. ATKINS , Feipeng LI , Mehrez SOUDEN
摘要: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors, memory, and a plurality of microphones, sampling, at each of the plurality of microphones of the electronic device, an audio signal to obtain a plurality of audio signals; processing the plurality of audio signals to obtain a plurality of audio streams; and determining, based on the plurality of audio streams, whether any of the plurality of audio signals corresponds to a spoken trigger. The method further includes, in accordance with a determination that the plurality of audio signals corresponds to the spoken trigger, initiating a session of the digital assistant; and in accordance with a determination that the plurality of audio signals does not correspond to the spoken trigger, foregoing initiating a session of the digital assistant.
-
-
-
-
-
-
-