-
公开(公告)号:US11721355B2
公开(公告)日:2023-08-08
申请号:US17677850
申请日:2022-02-22
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Lance Jabr , Matthew S. Connolly , Robert D. Silfvast , Sean A. Ramprashad , Carlos Avendano , Miquel Espi Marques
IPC: G10L21/0388 , G10L21/0208 , G10L19/008 , G10L21/0272
CPC classification number: G10L21/0388 , G10L19/008 , G10L21/0208 , G10L21/0272
Abstract: A first device obtains, from the array, several audio signals and processes the audio signals to produce a speech signal and one or more ambient signals. The first device processes the ambient signals to produce a sound-object sonic descriptor that has metadata describing a sound object within an acoustic environment. The first device transmits, over a communication data link, the speech signal and the descriptor to a second electronic device that is configured to spatially reproduce the sound object using the descriptor mixed with the speech signal, to produce several mixed signals to drive several speakers.
-
公开(公告)号:US11295754B2
公开(公告)日:2022-04-05
申请号:US16940792
申请日:2020-07-28
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Lance Jabr , Matthew S. Connolly , Robert D. Silfvast , Sean A. Ramprashad , Carlos Avendano , Miquel Espi Marques
IPC: G10L21/0388 , G10L21/0208 , G10L19/008 , G10L21/0272
Abstract: A first device obtains, from the array, several audio signals and processes the audio signals to produce a speech signal and one or more ambient signals. The first device processes the ambient signals to produce a sound-object sonic descriptor that has metadata describing a sound object within an acoustic environment. The first device transmits, over a communication data link, the speech signal and the descriptor to a second electronic device that is configured to spatially reproduce the sound object using the descriptor mixed with the speech signal, to produce several mixed signals to drive several speakers.
-
公开(公告)号:US11941968B2
公开(公告)日:2024-03-26
申请号:US18103486
申请日:2023-01-30
Applicant: Apple Inc.
Inventor: Hyung-Suk Kim , Daniel C. Klingler , Miquel Espi Marques , Carlos M. Avendano
CPC classification number: G08B21/182 , G01H3/005 , G06N20/00 , G08B7/06 , G10L25/51
Abstract: An electronic device includes a processor, and a memory containing instructions that, when executed by the processor, cause the electronic device to learn a sound emitted by a legacy device and to issue an output when the electronic device subsequently hears the sound. For example, the electronic device can receive a training input and extract a compact representation of a sound in the training input, which the device stores. The device can receive an audio signal corresponding to an observed acoustic scene and extract a representation of the observed acoustic scene from the audio signal. The electronic device can determine whether the sound is present in the observed acoustic scene at least in part from a comparison of the representation of the observed acoustic scene with the representation of the sound. The electronic device emits a selected output responsive to determining that the sound is present in the acoustic scene.
-
公开(公告)号:US20210020018A1
公开(公告)日:2021-01-21
申请号:US16872168
申请日:2020-05-11
Applicant: Apple Inc.
Inventor: Hyung-Suk Kim , Daniel C. Klingler , Miquel Espi Marques , Carlos M. Avendano
Abstract: An electronic device includes a processor, and a memory containing instructions that, when executed by the processor, cause the electronic device to learn a sound emitted by a legacy device and to issue an output when the electronic device subsequently hears the sound. For example, the electronic device can receive a training input and extract a compact representation of a sound in the training input, which the device stores. The device can receive an audio signal corresponding to an observed acoustic scene and extract a representation of the observed acoustic scene from the audio signal. The electronic device can determine whether the sound is present in the observed acoustic scene at least in part from a comparison of the representation of the observed acoustic scene with the representation of the sound. The electronic device emits a selected output responsive to determining that the sound is present in the acoustic scene.
-
公开(公告)号:US20200090644A1
公开(公告)日:2020-03-19
申请号:US16564775
申请日:2019-09-09
Applicant: Apple Inc.
Inventor: Daniel C. Klingler , Carlos M. Avendano , Hyung-Suk Kim , Miquel Espi Marques
Abstract: An electronic device has one or more microphones that pick up a sound. At least one feature extractor processes the audio signals from the microphones, that contain the picked up the sound, to determine several features for the sound. The electronic device also includes a classifier that has a machine learning model which is configured to determine a sound classification, such as artificial versus natural for the sound, based upon at least one of the determined features. Other aspects are also described and claimed.
-
公开(公告)号:US12182674B2
公开(公告)日:2024-12-31
申请号:US17737017
申请日:2022-05-04
Applicant: Apple Inc.
Inventor: Jonathan Huang , Miquel Espi Marques , Carlos M. Avendano , Kevin M. Durand , David Findlay , Vasudha Kowtha , Daniel C. Klingler , Yichi Zhang
Abstract: The subject disclosure provides systems and methods for providing locally trained models for detecting individual sounds using electronic devices. Local detection of individual sounds with a detection model at an electronic device can be provided by obtaining training samples for the detection model with the electronic device, and generating additional negative and positive training samples based on the obtained training samples. A two-stage detection process may be provided, in which a trigger model at a device compares an audio input to a reference sound to trigger a detection model at the device. The detection of individual sounds with a detection model at an electronic device can also leverage audio capture capabilities of multiple devices in an acoustic scene to capture multiple concurrent training samples.
-
公开(公告)号:US20240363094A1
公开(公告)日:2024-10-31
申请号:US18622449
申请日:2024-03-29
Applicant: Apple Inc.
Inventor: Ashok Masilamani , Prateek Murgai , John Woodruff , David M. Fischer , Jonathan D. Sheaffer , Jonathan Huang , Sorin V. Dusan , Andrew W. Malta , Erik D. Hornberger , Yichi Zhang , Miquel Espi Marques , Carlos M. Avendano
IPC: G10K11/178 , G10L21/0208 , G10L21/0216 , G10L21/0308 , H04R1/10 , H04R3/00
CPC classification number: G10K11/17854 , G10K11/17823 , G10K11/17827 , G10K11/17881 , G10K11/17885 , G10L21/0208 , G10L21/0308 , H04R1/1041 , H04R3/005 , G10K2210/1081 , G10L2021/02166 , H04R2460/01 , H04R2460/13
Abstract: A conversation detector processes microphone signals and other sensor signals of a headphone to declare a conversation and configures a filter block to activate a transparency audio signal. It then declares an end to the conversation based on processing one or more of the microphone signals and the other sensor signals, and in response deactivates the transparency audio signal. The conversation detector monitors an idle duration in which an OVAD and a TVAD are both or simultaneously indicating no activity and declares the end to the conversation in response to the idle duration being longer than an idle threshold. Other aspects are also described and claimed.
-
公开(公告)号:US20220180889A1
公开(公告)日:2022-06-09
申请号:US17677850
申请日:2022-02-22
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Lance Jabr , Matthew S. Connolly , Robert D. Silfvast , Sean A. Ramprashad , Carlos Avendano , Miquel Espi Marques
IPC: G10L21/0388 , G10L21/0208 , G10L19/008 , G10L21/0272
Abstract: A first device obtains, from the array, several audio signals and processes the audio signals to produce a speech signal and one or more ambient signals. The first device processes the ambient signals to produce a sound-object sonic descriptor that has metadata describing a sound object within an acoustic environment. The first device transmits, over a communication data link, the speech signal and the descriptor to a second electronic device that is configured to spatially reproduce the sound object using the descriptor mixed with the speech signal, to produce several mixed signals to drive several speakers.
-
公开(公告)号:US20210035597A1
公开(公告)日:2021-02-04
申请号:US16940792
申请日:2020-07-28
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Lance Jabr , Matthew S. Connolly , Robert D. Silfvast , Sean A. Ramprashad , Carlos Avendano , Miquel Espi Marques
IPC: G10L21/0388 , G10L21/0272 , G10L19/008 , G10L21/0208
Abstract: A first device obtains, from the array, several audio signals and processes the audio signals to produce a speech signal and one or more ambient signals. The first device processes the ambient signals to produce a sound-object sonic descriptor that has metadata describing a sound object within an acoustic environment. The first device transmits, over a communication data link, the speech signal and the descriptor to a second electronic device that is configured to spatially reproduce the sound object using the descriptor mixed with the speech signal, to produce several mixed signals to drive several speakers.
-
公开(公告)号:US12094457B2
公开(公告)日:2024-09-17
申请号:US17992785
申请日:2022-11-22
Applicant: Apple Inc.
Inventor: Daniel C. Klingler , Carlos M. Avendano , Hyung-Suk Kim , Miquel Espi Marques
Abstract: An electronic device has one or more microphones that pick up a sound. At least one feature extractor processes the audio signals from the microphones, that contain the picked up the sound, to determine several features for the sound. The electronic device also includes a classifier that has a machine learning model which is configured to determine a sound classification, such as artificial versus natural for the sound, based upon at least one of the determined features. Other aspects are also described and claimed.
-
-
-
-
-
-
-
-
-