-
公开(公告)号:US12119010B2
公开(公告)日:2024-10-15
申请号:US18366385
申请日:2023-08-07
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Dirk Jeroen Breebaart
IPC: G10L19/012 , G10L19/00 , G10L19/008 , G10L19/02
CPC classification number: G10L19/008 , G10L19/00 , G10L19/012 , G10L19/0212
Abstract: Encoding/decoding an audio signal having one or more audio components, wherein each audio component is associated with a spatial location. A first audio signal presentation (z) of the audio components, a first set of transform parameters (w(f)), and signal level data (β2) are encoded and transmitted to the decoder. The decoder uses the first set of transform parameters (w(f)) to form a reconstructed simulation input signal intended for an acoustic environment simulation, and applies a signal level modification (α) to the reconstructed simulation input signal. The signal level modification is based on the signal level data (β2) and data (p2) related to the acoustic environment simulation. The attenuated reconstructed simulation input signal is then processed in an acoustic environment simulator. With this process, the decoder does not need to determine the signal level of the simulation input signal, thereby reducing processing load.
-
公开(公告)号:US12002480B2
公开(公告)日:2024-06-04
申请号:US18351769
申请日:2023-07-13
Inventor: Dirk Jeroen Breebaart , David Matthew Cooper , Leif Jonas Samuelsson
IPC: G10L19/02 , G10L19/008 , H04S7/00
CPC classification number: G10L19/0212 , G10L19/008 , G10L19/0204 , H04S7/308 , H04R2460/03 , H04S2400/01 , H04S2420/01 , H04S2420/03 , H04S2420/07
Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.
-
公开(公告)号:US11950078B2
公开(公告)日:2024-04-02
申请号:US18309099
申请日:2023-04-28
Inventor: Leif Jonas Samuelsson , Dirk Jeroen Breebaart , David Matthew Cooper , Jeroen Koppens
CPC classification number: H04S1/002 , H04R5/04 , H04S3/00 , H04S7/303 , H04S3/008 , H04S3/02 , H04S2420/01 , H04S2420/03
Abstract: Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.
-
公开(公告)号:US20240105186A1
公开(公告)日:2024-03-28
申请号:US18487232
申请日:2023-10-16
Inventor: Dirk Jeroen Breebaart , David Matthew Cooper , Leif Jonas Samuelsson , Jeroen Koppens , Rhonda J. Wilson , Heiko Purnhagen , Alexander Stahlmann
CPC classification number: G10L19/008 , G06F3/16 , H04L65/70 , H04L65/75 , H04S1/007 , H04S7/305 , H04S2400/01 , H04S2400/03 , H04S2400/07
Abstract: A method for encoding an input audio stream including the steps of obtaining a first playback stream presentation of the input audio stream intended for reproduction on a first audio reproduction system, obtaining a second playback stream presentation of the input audio stream intended for reproduction on a second audio reproduction system, determining a set of transform parameters suitable for transforming an intermediate playback stream presentation to an approximation of the second playback stream presentation, wherein the transform parameters are determined by minimization of a measure of a difference between the approximation of the second playback stream presentation and the second playback stream presentation, and encoding the first playback stream presentation and the set of transform parameters for transmission to a decoder.
-
公开(公告)号:US11943605B2
公开(公告)日:2024-03-26
申请号:US17694506
申请日:2022-03-14
Inventor: Dirk Jeroen Breebaart , Antonio Mateos Sole , Heiko Purnhagen , Nicolas R. Tsingos
CPC classification number: H04S7/303 , H04R5/02 , H04S3/008 , H04S7/30 , H04S2400/11 , H04S2420/03
Abstract: Described herein is a method (30) of rendering an audio signal (17) for playback in an audio environment (27) defined by a target loudspeaker system (23), the audio signal (17) including audio data relating to an audio object and associated position data indicative of an object position. Method (30) includes the initial step (31) of receiving the audio signal (17). At step (32) loudspeaker layout data for the target loudspeaker system (23) is received. At step (33) control data is received that is indicative of a position modification to be applied to the audio object in the audio environment (27). At step (38) in response to the position data, loudspeaker layout data and control data, rendering modification data is generated. Finally, at step (39) the audio signal (17) is rendered with the rendering modification data to output the audio signal (17) with the audio object at a modified object position that is between loudspeakers within the audio environment (27).
-
公开(公告)号:US20230335142A1
公开(公告)日:2023-10-19
申请号:US18043905
申请日:2021-09-07
Inventor: Dirk Jeroen Breebaart , Michael Eckert , Heiko Purnhagen
IPC: G10L19/008 , G10L19/22
CPC classification number: G10L19/008 , G10L19/22
Abstract: A method comprising receiving a first input bit stream for a first parametrically coded input audio signal, the first input bit stream including data representing a first input core audio signal and a first set including at least one spatial parameter relating to the first parametrically coded input audio signal. A first covariance matrix of the first parametrically coded audio signal is determined based on the spatial parameter(s) of the first set. A modified set including at least one spatial parameter is determined based on the determined first covariance matrix, wherein the modified set is different from the first set. An output core audio signal is determined, which is based on, or constituted by, the first input core audio signal. An output bit stream for a parametrically coded output audio signal is generated, the output bit stream including data representing the output core audio signal and the modified set.
-
公开(公告)号:US11553296B2
公开(公告)日:2023-01-10
申请号:US17167442
申请日:2021-02-04
Applicant: Dolby Laboratories Licensing Corporation
Inventor: C. Phillip Brown , Joshua Brandon Lando , Mark F. Davis , Alan J. Seefeldt , David Matthew Cooper , Dirk Jeroen Breebaart , Rhonda Wilson
IPC: H04S7/00
Abstract: A system and method of modifying a binaural signal using headtracking information. The system calculates a delay, a first filter response, and a second filter response, and applies these to the left and right components of the binaural signal according to the headtracking information. The system may also apply headtracking to parametric binaural signals. In this manner, headtracking may be applied to pre-rendered binaural audio.
-
公开(公告)号:US11430463B2
公开(公告)日:2022-08-30
申请号:US17259787
申请日:2019-07-11
Inventor: Giulio Cengarle , Antonio Mateos Sole , Dirk Jeroen Breebaart
IPC: G10L21/034 , G10L21/028 , G10L25/18 , G10L25/51
Abstract: Various embodiments are disclosed for (possibly simultaneously) applying EQ and DRC to audio signals. In an embodiment, a method comprises: dividing an input audio signal into n frames, where n is a positive integer greater than one; dividing each frame of the input audio signal into Nb frequency bands, where Nb is a positive integer greater than one; for each frame n: computing an input level of the input audio signal in each band f, resulting in a input audio level distribution for the input audio signal; computing a gain for each band f based at least in part on a mapping of one or more properties of the input audio level distribution to a reference N audio level distribution computed from one or more reference audio signals; and applying each computed gain for each band f to each corresponding band f of the input audio signal.
-
公开(公告)号:US11354088B2
公开(公告)日:2022-06-07
申请号:US17248992
申请日:2021-02-16
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Mark Alexander , Chunjian Li , Joshua Brandon Lando , Alan J. Seefeldt , C. Phillip Brown , Dirk Jeroen Breebaart
Abstract: Media input audio data corresponding to a media stream and microphone input audio data from at least one microphone may be received. A first level of at least one of a plurality of frequency bands of the media input audio data, as well as a second level of at least one of a plurality of frequency bands of the microphone input audio data, may be determined. Media output audio data and microphone output audio data may be produced by adjusting levels of one or more of the first and second plurality of frequency bands based on the perceived loudness of the microphone input audio data, of the microphone output audio data, of the media output audio data and the media input audio data. One or more processes may be modified upon receipt of a mode-switching indication.
-
50.
公开(公告)号:US11212638B2
公开(公告)日:2021-12-28
申请号:US17012076
申请日:2020-09-04
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Kuan-Chieh Yen , Dirk Jeroen Breebaart , Grant A. Davidson , Rhonda Wilson , David M. Cooper , Zhiwei Shuang
IPC: H04S7/00 , G10L19/008 , H04S3/00
Abstract: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.
-
-
-
-
-
-
-
-
-