-
公开(公告)号:US10362431B2
公开(公告)日:2019-07-23
申请号:US15777058
申请日:2016-11-17
Inventor: Dirk Jeroen Breebaart , David Matthew Cooper , Mark F. Davis , David S. McGrath , Kristofer Kjoerling , Harald Mundt , Rhonda J. Wilson
IPC: H04S7/00 , G10L19/008 , H04S3/00 , H04R5/033
Abstract: A method of encoding channel or object based input audio for playback, the method including the steps of: (a) initially rendering the channel or object based input audio into an initial output presentation; (b) determining an estimate of the dominant audio component from the channel or object based input audio and determining a series of dominant audio component weighting factors for mapping the initial output presentation into the dominant audio component; (c) determining an estimate of the dominant audio component direction or position; and (d) encoding the initial output presentation, the dominant audio component weighting factors, the dominant audio component direction or position as the encoded signal for playback.
-
公开(公告)号:US20190179604A1
公开(公告)日:2019-06-13
申请号:US16309230
申请日:2017-06-14
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Mark Alexander , Chunjian Li , Joshua Brandon Lando , Alan J. Seefeldt , Phillip C. Brown , Dirk Jeroen Breebaart
Abstract: Media input audio data corresponding to a media stream and microphone input audio data from at least one microphone may be received. A first level of at least one of a plurality of frequency bands of the media input audio data, as well as a second level of at least one of a plurality of frequency bands of the microphone input audio data, may be determined. Media output audio data and microphone output audio data may be produced by adjusting levels of one or more of the first and second plurality of frequency bands based on the perceived loudness of the microphone input audio data, of the microphone output audio data, of the media output audio data and the media input audio data. One or more processes may be modified upon receipt of a mode-switching indication.
-
公开(公告)号:US20170171687A1
公开(公告)日:2017-06-15
申请号:US15375488
申请日:2016-12-12
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Dirk Jeroen Breebaart , Lianwu Chen , Lie Lu
CPC classification number: H04S7/30 , H04S2400/11 , H04S2400/13
Abstract: Example embodiments disclosed herein relate to audio object clustering with single channel quality preservation. A method of clustering audio objects is disclosed. The method includes determining cluster positions based on object positions of the audio objects and a reference speaker layout, the reference speaker layout indicating speakers located at different speaker positions. The method also includes determining object-to-cluster gains based on the determined cluster positions, the object positions and the reference speaker layout, an object-to-cluster gain defining a proportion of the respective audio object that is assigned to a cluster associated with one of the determined cluster positions. The method further includes clustering the audio objects based on the object-to-cluster gains and the cluster positions for generating cluster signals. Corresponding system, computer program product and device for clustering audio objects are also disclosed.
-
公开(公告)号:US20240282323A1
公开(公告)日:2024-08-22
申请号:US18649738
申请日:2024-04-29
Inventor: Dirk Jeroen Breebaart , David Matthew Cooper , Leif Jonas Samuelsson
IPC: G10L19/02 , G10L19/008 , H04S7/00
CPC classification number: G10L19/0212 , G10L19/008 , G10L19/0204 , H04S7/308 , H04R2460/03 , H04S2400/01 , H04S2420/01 , H04S2420/03 , H04S2420/07
Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.
-
公开(公告)号:US11721348B2
公开(公告)日:2023-08-08
申请号:US17510205
申请日:2021-10-25
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Dirk Jeroen Breebaart
IPC: G10L19/008 , G10L19/012 , G10L19/00 , G10L19/02
CPC classification number: G10L19/008 , G10L19/00 , G10L19/012 , G10L19/0212
Abstract: Encoding/decoding an audio signal having one or more audio components, wherein each audio component is associated with a spatial location. A first audio signal presentation (z) of the audio components, a first set of transform parameters (w(f)), and signal level data (β2) are encoded and transmitted to the decoder. The decoder uses the first set of transform parameters (w(f)) to form a reconstructed simulation input signal intended for an acoustic environment simulation, and applies a signal level modification (α) to the reconstructed simulation input signal. The signal level modification is based on the signal level data (β2) and data (p2) related to the acoustic environment simulation. The attenuated reconstructed simulation input signal is then processed in an acoustic environment simulator. With this process, the decoder does not need to determine the signal level of the simulation input signal, thereby reducing processing load.
-
16.
公开(公告)号:US11576004B2
公开(公告)日:2023-02-07
申请号:US17688744
申请日:2022-03-07
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Grant A. Davidson , Kuan-Chieh Yen , Dirk Jeroen Breebaart
IPC: H04S7/00
Abstract: Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.
-
公开(公告)号:US11423917B2
公开(公告)日:2022-08-23
申请号:US16882747
申请日:2020-05-26
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Dirk Jeroen Breebaart , David Matthew Cooper , Leif Jonas Samuelsson
IPC: G10L19/02 , H04S7/00 , G10L19/008
Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.
-
公开(公告)号:US20210295852A1
公开(公告)日:2021-09-23
申请号:US17225133
申请日:2021-04-08
Inventor: Dirk Jeroen Breebaart , David Matthew Cooper , Leif Jonas Samuelsson , Jeroen Koppens , Rhonda J. Wilson , Heiko Purnhagen , Alexander Stahlmann
IPC: G10L19/008 , G06F3/16 , H04L29/06 , H04S1/00 , H04S7/00
Abstract: A method for encoding an input audio stream including the steps of obtaining a first playback stream presentation of the input audio stream intended for reproduction on a first audio reproduction system, obtaining a second playback stream presentation of the input audio stream intended for reproduction on a second audio reproduction system, determining a set of transform parameters suitable for transforming an intermediate playback stream presentation to an approximation of the second playback stream presentation, wherein the transform parameters are determined by minimization of a measure of a difference between the approximation of the second playback stream presentation and the second playback stream presentation, and encoding the first playback stream presentation and the set of transform parameters for transmission to a decoder.
-
公开(公告)号:US10659880B2
公开(公告)日:2020-05-19
申请号:US16191123
申请日:2018-11-14
Inventor: Dirk Jeroen Breebaart , Mark David de Burgh , Nicholas Luke Appleton , Heiko Purnhagen , Mark William Gerrard , David Matthew Cooper
IPC: H04R5/02 , H04R3/14 , H04R5/04 , H04S1/00 , H04M1/03 , H04M1/60 , H04R3/04 , H04S3/00 , H04S7/00
Abstract: A method of processing audio data for replay on a mobile device with a first speaker and a second speaker, wherein the audio data comprises a respective audio signal for each of the first and second speakers, includes: determining a device orientation of the mobile device; if the determined device orientation is vertical orientation, applying a first processing mode to the audio signals for the first and second speakers; and if the determined device orientation is horizontal orientation, applying a second processing mode to the audio signals for the first and second speakers. Applying the first processing mode involves: determining respective mono audio signals in at least two frequency bands based on the audio signals for the first and second speakers; in a first one of the at least two frequency bands, routing a larger portion of the respective mono audio signal to one of the first and second speakers; and in a second one of the at least two frequency bands, routing a larger portion of the respective mono audio signal to the other one of the first and second speakers. Applying the second processing mode involves applying cross-talk cancellation to the audio signals for the first and second speakers.
-
公开(公告)号:US20190327575A1
公开(公告)日:2019-10-24
申请号:US16309578
申请日:2017-06-20
Applicant: Dolby Laboratories Licensing Corporation
Inventor: C. Phillip Brown , Joshua Brandon Lando , Mark F. Davis , Alan J. Seefeldt , David Matthew Cooper , Dirk Jeroen Breebaart , Rhonda Wilson
IPC: H04S7/00
Abstract: A system and method of modifying a binaural signal using headtracking information. The system calculates a delay, a first filter response, and a second filter response, and applies these to the left and right components of the binaural signal according to the headtracking information. The system may also apply headtracking to parametric binaural signals. In this manner, headtracking may be applied to pre-rendered binaural audio.
-
-
-
-
-
-
-
-
-