-
公开(公告)号:US20220167107A1
公开(公告)日:2022-05-26
申请号:US17666868
申请日:2022-02-08
Applicant: Apple Inc.
Inventor: Matthew S. Connolly , Christopher T. Eubank , Stephen E. Pinto , Tomlinson Holman
IPC: H04S7/00 , G06F3/16 , G10L19/008 , G10L19/16
Abstract: An audio asset library containing audio assets formatted in accordance with a file format for spatial audio includes asset metadata that enables simulated reality (SR) application developers to compose sounds for use in SR applications. The audio assets are formatted to include audio data encoding a sound capable of being composed into a SR application along with asset metadata describing not only how the sound was encoded, but also how a listener in SR environment experiences the sound. A SR developer platform is configured so that developers can compose sound for SR objects using audio assets stored in the audio library, including editing the asset metadata to include transformation parameters that support dynamic transformation of the asset metadata in the SR environment to alter how the SR listener experiences the composed sound. Other embodiments are also described and claimed.
-
公开(公告)号:US11295754B2
公开(公告)日:2022-04-05
申请号:US16940792
申请日:2020-07-28
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Lance Jabr , Matthew S. Connolly , Robert D. Silfvast , Sean A. Ramprashad , Carlos Avendano , Miquel Espi Marques
IPC: G10L21/0388 , G10L21/0208 , G10L19/008 , G10L21/0272
Abstract: A first device obtains, from the array, several audio signals and processes the audio signals to produce a speech signal and one or more ambient signals. The first device processes the ambient signals to produce a sound-object sonic descriptor that has metadata describing a sound object within an acoustic environment. The first device transmits, over a communication data link, the speech signal and the descriptor to a second electronic device that is configured to spatially reproduce the sound object using the descriptor mixed with the speech signal, to produce several mixed signals to drive several speakers.
-
公开(公告)号:US11272308B2
公开(公告)日:2022-03-08
申请号:US16645422
申请日:2018-09-25
Applicant: Apple Inc.
Inventor: Matthew S. Connolly , Christopher T. Eubank , Stephen E. Pinto , Tomlinson Holman
IPC: H04R25/00 , H04S7/00 , G06F3/16 , G10L19/008 , G10L19/16
Abstract: An audio asset library containing audio assets formatted in accordance with a file format for spatial audio includes asset metadata that enables simulated reality (SR) application developers to compose sounds for use in SR applications. The audio assets are formatted to include audio data encoding a sound capable of being composed into a SR application along with asset metadata describing not only how the sound was encoded, but also how a listener in SR environment experiences the sound. A SR developer platform is configured so that developers can compose sound for SR objects using audio assets stored in the audio library, including editing the asset metadata to include transformation parameters that support dynamic transformation of the asset metadata in the SR environment to alter how the SR listener experiences the composed sound. Other embodiments are also described and claimed.
-
公开(公告)号:US12010490B1
公开(公告)日:2024-06-11
申请号:US18149659
申请日:2023-01-03
Applicant: Apple Inc.
Inventor: Symeon Delikaris Manias , Mehrez Souden , Ante Jukic , Matthew S. Connolly , Sabine Webel , Ronald J. Guglielmone, Jr.
Abstract: An audio renderer can have a machine learning model that jointly processes audio and visual information of an audiovisual recording. The audio renderer can generate output audio channels. Sounds captured in the audiovisual recording and present in the output audio channels are spatially mapped based on the joint processing of the audio and visual information by the machine learning model. Other aspects are described.
-
公开(公告)号:US20230144903A1
公开(公告)日:2023-05-11
申请号:US18087214
申请日:2022-12-22
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Matthew S. Connolly , Stephen E. Pinto
CPC classification number: H04S7/304 , G06F3/165 , H04S3/008 , H04S2400/01 , H04S2400/03 , H04S2400/11
Abstract: Channels of audio data in a spatial audio object are associated with any one or more of a direction and a location of one or more recorded sounds, which channels are to be reproduced as spatial sound. A visualized spatial sound object represents a snapshot/thumbnail of the spatial sound. To preview the spatial sound (by experiencing its snapshot or thumbnail), a user manipulates the orientation of the visualized spatial sound object, and a weighted downmix of the channels is rendered for output as a spatial preview sound, e.g., a single output audio signal is provided to a spatial audio renderer; one or more of the channels that are oriented toward the user are emphasized in the preview sound, more than channels that are oriented away from the user. Other aspects are also described and claimed.
-
公开(公告)号:US20220180889A1
公开(公告)日:2022-06-09
申请号:US17677850
申请日:2022-02-22
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Lance Jabr , Matthew S. Connolly , Robert D. Silfvast , Sean A. Ramprashad , Carlos Avendano , Miquel Espi Marques
IPC: G10L21/0388 , G10L21/0208 , G10L19/008 , G10L21/0272
Abstract: A first device obtains, from the array, several audio signals and processes the audio signals to produce a speech signal and one or more ambient signals. The first device processes the ambient signals to produce a sound-object sonic descriptor that has metadata describing a sound object within an acoustic environment. The first device transmits, over a communication data link, the speech signal and the descriptor to a second electronic device that is configured to spatially reproduce the sound object using the descriptor mixed with the speech signal, to produce several mixed signals to drive several speakers.
-
公开(公告)号:US11128977B2
公开(公告)日:2021-09-21
申请号:US16645429
申请日:2018-09-26
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Matthew S. Connolly , Stephen E. Pinto
Abstract: Channels of audio data in a spatial audio object are associated with any one or more of a direction and a location of one or more recorded sounds, which channels are to be reproduced as spatial sound. A visualized spatial sound object represents a snapshot/thumbnail of the spatial sound. To preview the spatial sound (by experiencing its snapshot or thumbnail), a user manipulates the orientation of the visualized spatial sound object, and a weighted downmix of the channels is rendered for output as a spatial preview sound, e.g., a single output audio signal is provided to a spatial audio renderer; one or more of the channels that are oriented toward the user are emphasized in the preview sound, more than channels that are oriented away from the user. Other aspects are also described and claimed.
-
公开(公告)号:US20210035597A1
公开(公告)日:2021-02-04
申请号:US16940792
申请日:2020-07-28
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Lance Jabr , Matthew S. Connolly , Robert D. Silfvast , Sean A. Ramprashad , Carlos Avendano , Miquel Espi Marques
IPC: G10L21/0388 , G10L21/0272 , G10L19/008 , G10L21/0208
Abstract: A first device obtains, from the array, several audio signals and processes the audio signals to produce a speech signal and one or more ambient signals. The first device processes the ambient signals to produce a sound-object sonic descriptor that has metadata describing a sound object within an acoustic environment. The first device transmits, over a communication data link, the speech signal and the descriptor to a second electronic device that is configured to spatially reproduce the sound object using the descriptor mixed with the speech signal, to produce several mixed signals to drive several speakers.
-
公开(公告)号:US20200221248A1
公开(公告)日:2020-07-09
申请号:US16645429
申请日:2018-09-26
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Matthew S. Connolly , Stephen E. Pinto
Abstract: Channels of audio data in a spatial audio object are associated with any one or more of a direction and a location of one or more recorded sounds, which channels are to be reproduced as spatial sound. A visualized spatial sound object represents a snapshot/thumbnail of the spatial sound. To preview the spatial sound (by experiencing its snapshot or thumbnail), a user manipulates the orientation of the visualized spatial sound object, and a weighted downmix of the channels is rendered for output as a spatial preview sound, e.g., a single output audio signal is provided to a spatial audio renderer; one or more of the channels that are oriented toward the user are emphasized in the preview sound, more than channels that are oriented away from the user. Other aspects are also described and claimed.
-
-
-
-
-
-
-
-