-
公开(公告)号:US20220167107A1
公开(公告)日:2022-05-26
申请号:US17666868
申请日:2022-02-08
Applicant: Apple Inc.
Inventor: Matthew S. Connolly , Christopher T. Eubank , Stephen E. Pinto , Tomlinson Holman
IPC: H04S7/00 , G06F3/16 , G10L19/008 , G10L19/16
Abstract: An audio asset library containing audio assets formatted in accordance with a file format for spatial audio includes asset metadata that enables simulated reality (SR) application developers to compose sounds for use in SR applications. The audio assets are formatted to include audio data encoding a sound capable of being composed into a SR application along with asset metadata describing not only how the sound was encoded, but also how a listener in SR environment experiences the sound. A SR developer platform is configured so that developers can compose sound for SR objects using audio assets stored in the audio library, including editing the asset metadata to include transformation parameters that support dynamic transformation of the asset metadata in the SR environment to alter how the SR listener experiences the composed sound. Other embodiments are also described and claimed.
-
公开(公告)号:US11295754B2
公开(公告)日:2022-04-05
申请号:US16940792
申请日:2020-07-28
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Lance Jabr , Matthew S. Connolly , Robert D. Silfvast , Sean A. Ramprashad , Carlos Avendano , Miquel Espi Marques
IPC: G10L21/0388 , G10L21/0208 , G10L19/008 , G10L21/0272
Abstract: A first device obtains, from the array, several audio signals and processes the audio signals to produce a speech signal and one or more ambient signals. The first device processes the ambient signals to produce a sound-object sonic descriptor that has metadata describing a sound object within an acoustic environment. The first device transmits, over a communication data link, the speech signal and the descriptor to a second electronic device that is configured to spatially reproduce the sound object using the descriptor mixed with the speech signal, to produce several mixed signals to drive several speakers.
-
公开(公告)号:US11272308B2
公开(公告)日:2022-03-08
申请号:US16645422
申请日:2018-09-25
Applicant: Apple Inc.
Inventor: Matthew S. Connolly , Christopher T. Eubank , Stephen E. Pinto , Tomlinson Holman
IPC: H04R25/00 , H04S7/00 , G06F3/16 , G10L19/008 , G10L19/16
Abstract: An audio asset library containing audio assets formatted in accordance with a file format for spatial audio includes asset metadata that enables simulated reality (SR) application developers to compose sounds for use in SR applications. The audio assets are formatted to include audio data encoding a sound capable of being composed into a SR application along with asset metadata describing not only how the sound was encoded, but also how a listener in SR environment experiences the sound. A SR developer platform is configured so that developers can compose sound for SR objects using audio assets stored in the audio library, including editing the asset metadata to include transformation parameters that support dynamic transformation of the asset metadata in the SR environment to alter how the SR listener experiences the composed sound. Other embodiments are also described and claimed.
-
公开(公告)号:US11227623B1
公开(公告)日:2022-01-18
申请号:US16868360
申请日:2020-05-06
Applicant: Apple Inc.
Inventor: Nikolas T. Vitt , Christopher T. Eubank , Jonathan D. Sheaffer , Tomlinson Holman
IPC: G10L25/51 , G10L15/00 , H04R3/04 , H04R3/00 , H04R5/04 , H04R5/033 , H04R5/027 , G06F3/01 , H04R3/12
Abstract: Audio processing with audio transparency can include receiving a user content audio signal and receiving a microphone signal. The microphone signal can contain sensed sound of a user environment. Strength of the sensed sound can be increased based on strength of the user content audio signal, to reduce a masking of the sensed sound during playback. The sensed sound and the user content audio signal can be combined in a composite output audio signal used to drive a speaker. Other aspects are also described and claimed.
-
公开(公告)号:US20210329381A1
公开(公告)日:2021-10-21
申请号:US17360825
申请日:2021-06-28
Applicant: Apple Inc.
Inventor: Tomlinson Holman , Christopher T. Eubank , Joshua D. Atkins , Soenke Pelzer , Dirk Schroeder
Abstract: An audio device can sense sound in a physical environment using a plurality of microphones to generate a plurality of microphone signals. Clean speech can be extracted from microphone signals. Ambience can be extracted from the microphone signals. The clean speech can be encoded at a first compression level. The ambience can be encoded at a second compression level that is higher than the first compression level. Other aspects are also described and claimed.
-
公开(公告)号:US10782779B1
公开(公告)日:2020-09-22
申请号:US16580372
申请日:2019-09-24
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Daniel P. Patterson
Abstract: In some implementations, a method includes: obtaining user movement information, wherein the user movement information characterizes real-world body pose and trajectory information of the user; generating, from real-world user movement information and a predetermined placement of the virtual instrument in the computer generated reality (CGR) environment, a predicted virtual instrument interaction time for a virtual instrument interaction prior to the virtual instrument interaction occurring; determining whether or not the predicted virtual instrument interaction time falls within an acceptable temporal range around one of a plurality of temporal sound markers; and in response to determining that the predicted virtual instrument interaction time falls within the acceptable temporal range around a particular temporal sound marker of the plurality of temporal sound markers, quantizing the virtual instrument interaction by presenting play of the virtual instrument to match the particular temporal sound marker of the plurality of temporal sound markers.
-
公开(公告)号:US20240236610A1
公开(公告)日:2024-07-11
申请号:US18611407
申请日:2024-03-20
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Joshua D. Atkins , Soenke Pelzer , Dirk Schroeder
IPC: H04S7/00
CPC classification number: H04S7/305 , H04S2420/01
Abstract: Processing sound in an enhanced reality environment can include generating, based on an image of a physical environment, an acoustic model of the physical environment. Audio signals captured by a microphone array, can capture a sound in the physical environment. Based on these audio signals, one or more measured acoustic parameters of the physical environment can be generated. A target audio signal can be processed using the model of the physical environment and the measured acoustic parameters, resulting in a plurality of output audio channels having a virtual sound source with a virtual location. The output audio channels can be used to drive a plurality of speakers. Other aspects are also described and claimed.
-
公开(公告)号:US11956623B2
公开(公告)日:2024-04-09
申请号:US17360823
申请日:2021-06-28
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Joshua D. Atkins , Soenke Pelzer , Dirk Schroeder
IPC: H04S7/00
CPC classification number: H04S7/305 , H04S2420/01
Abstract: Processing sound in an enhanced reality environment can include generating, based on an image of a physical environment, an acoustic model of the physical environment. Audio signals captured by a microphone array, can capture a sound in the physical environment. Based on these audio signals, one or more measured acoustic parameters of the physical environment can be generated. A target audio signal can be processed using the model of the physical environment and the measured acoustic parameters, resulting in a plurality of output audio channels having a virtual sound source with a virtual location. The output audio channels can be used to drive a plurality of speakers. Other aspects are also described and claimed.
-
公开(公告)号:US11947005B2
公开(公告)日:2024-04-02
申请号:US17978859
申请日:2022-11-01
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Ryan S. Carlin
CPC classification number: G01S15/89 , G06T7/50 , G06T2207/10028
Abstract: In one implementation, a method includes: obtaining a first depth estimation characterizing a distance between the device and a surface in a real-world environment, wherein the first depth estimation is derived from image data including a representation of the surface; receiving, using the audio transceiver, an acoustic reflection of an acoustic wave, wherein the acoustic wave is transmitted in a known direction relative to the device; and determining a second depth estimation based on the acoustic reflection, wherein the second depth estimation characterizes the distance between the device and the surface in the real-world environment; and determining a confirmed depth estimation characterizing the distance between the device and the surface based on resolving any mismatch between the first depth estimation and the second depth estimation.
-
公开(公告)号:US11930337B2
公开(公告)日:2024-03-12
申请号:US17360825
申请日:2021-06-28
Applicant: Apple Inc.
Inventor: Tomlinson Holman , Christopher T. Eubank , Joshua D. Atkins , Soenke Pelzer , Dirk Schroeder
IPC: G10L19/16 , G10L21/0208 , G10L21/0216 , H04R3/00 , H04R3/04 , H04R5/027 , H04R5/033 , H04R5/04
CPC classification number: H04R5/027 , G10L19/167 , G10L21/0216 , H04R3/005 , H04R3/04 , H04R5/033 , H04R5/04 , G10L2021/02082 , G10L2021/02166 , H04R2420/07
Abstract: An audio device can sense sound in a physical environment using a plurality of microphones to generate a plurality of microphone signals. Clean speech can be extracted from microphone signals. Ambience can be extracted from the microphone signals. The clean speech can be encoded at a first compression level. The ambience can be encoded at a second compression level that is higher than the first compression level. Other aspects are also described and claimed.
-
-
-
-
-
-
-
-
-