-
公开(公告)号:US09830922B2
公开(公告)日:2017-11-28
申请号:US15117647
申请日:2015-02-23
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Lianwu Chen , Lie Lu , Dirk Jeroen Breebaart
Abstract: Embodiments of the present invention relate to audio object clustering by utilizing temporal variation of audio objects. There is provided a method of estimating temporal variation of an audio object for use in audio object clustering. The method comprises obtaining at least one segment of an audio track associated with the audio object, the at least one segment containing the audio object; estimating variation of the audio object over a time duration of the at least one segment based on at least one property of the audio object and adjusting, at least partially based on the estimated variation of the audio object, a contribution of the audio object to the determination of a centroid in the audio object clustering. Corresponding system and computer program product are disclosed.
-
22.
公开(公告)号:US09747909B2
公开(公告)日:2017-08-29
申请号:US14907542
申请日:2014-07-23
Inventor: Dirk Jeroen Breebaart , Lie Lu , Antonio Mateos Sole , Nicolas R. Tsingos
IPC: G10L19/00 , G10L19/02 , G10L19/025 , G10L19/008 , G10L19/06 , G10L19/26
CPC classification number: G10L19/025 , G10L19/00 , G10L19/008 , G10L19/02 , G10L19/06 , G10L19/26
Abstract: Embodiments are directed to a method for processing an input audio signal, comprising: splitting the input audio signal into at least two components, in which the first component is characterized by fast fluctuations in the input signal envelope, and a second component that is relatively stationary over time; processing the second, stationary component by a decorrelation circuit; and constructing an output signal by combining the output of the decorrelator circuit with the input signal and/or the first component signal.
-
公开(公告)号:US09654895B2
公开(公告)日:2017-05-16
申请号:US14909058
申请日:2014-07-24
Inventor: Dirk Jeroen Breebaart , Lie Lu , Nicolas R. Tsingos , Antonio Mateos Sole
IPC: H04R5/02 , H04S7/00 , H04S3/00 , G10L19/008 , G10L19/00 , G10L19/018
CPC classification number: H04S7/308 , G10L19/00 , G10L19/008 , G10L19/018 , G10L19/20 , H04S3/002 , H04S2400/11 , H04S2400/13 , H04S2400/15 , H04S2420/03 , H04S2420/07
Abstract: Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.
-
公开(公告)号:US12302082B2
公开(公告)日:2025-05-13
申请号:US18606040
申请日:2024-03-15
Inventor: Leif Jonas Samuelsson , Dirk Jeroen Breebaart , David Matthew Cooper , Jeroen Koppens
Abstract: Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.
-
公开(公告)号:US12212953B2
公开(公告)日:2025-01-28
申请号:US18349704
申请日:2023-07-10
Inventor: Dirk Jeroen Breebaart , Lie Lu , Nicolas R. Tsingos , Antonio Mateos Sole
IPC: G10L19/20 , G10L19/00 , G10L19/008 , G10L19/018 , H04S3/00 , H04S7/00
Abstract: Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.
-
公开(公告)号:US20230359430A1
公开(公告)日:2023-11-09
申请号:US18351357
申请日:2023-07-12
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Mark Alexander , Chunjian Li , Joshua Brandon Lando , Alan J. Seefeldt , C. Phillip Brown , Dirk Jeroen Breebaart
CPC classification number: G06F3/165 , H04B15/00 , H04R1/1041 , H04R1/1083 , H04R3/04 , H04R29/001 , H04R5/033 , H04R2430/01
Abstract: Media input audio data corresponding to a media stream and microphone input audio data from at least one microphone may be received. A first level of at least one of a plurality of frequency bands of the media input audio data, as well as a second level of at least one of a plurality of frequency bands of the microphone input audio data, may be determined. Media output audio data and microphone output audio data may be produced by adjusting levels of one or more of the first and second plurality of frequency bands based on the perceived loudness of the microphone input audio data, of the microphone output audio data, of the media output audio data and the media input audio data. One or more processes may be modified upon receipt of a mode-switching indication.
-
公开(公告)号:US11798567B2
公开(公告)日:2023-10-24
申请号:US17225133
申请日:2021-04-08
Inventor: Dirk Jeroen Breebaart , David Matthew Cooper , Leif Jonas Samuelsson , Jeroen Koppens , Rhonda J. Wilson , Heiko Purnhagen , Alexander Stahlmann
CPC classification number: G10L19/008 , G06F3/16 , H04L65/70 , H04L65/75 , H04S1/007 , H04S7/305 , H04S2400/01 , H04S2400/03 , H04S2400/07
Abstract: A method for encoding an input audio stream including the steps of obtaining a first playback stream presentation of the input audio stream intended for reproduction on a first audio reproduction system, obtaining a second playback stream presentation of the input audio stream intended for reproduction on a second audio reproduction system, determining a set of transform parameters suitable for transforming an intermediate playback stream presentation to an approximation of the second playback stream presentation, wherein the transform parameters are determined by minimization of a measure of a difference between the approximation of the second playback stream presentation and the second playback stream presentation, and encoding the first playback stream presentation and the set of transform parameters for transmission to a decoder.
-
公开(公告)号:US20230091218A1
公开(公告)日:2023-03-23
申请号:US18060232
申请日:2022-11-30
Applicant: Dolby Laboratories Licensing Corporation
Inventor: C. Phillip Brown , Joshua Brandon Lando , Mark F. Davis , Alan J. Seefeldt , David Matthew Cooper , Dirk Jeroen Breebaart , Rhonda Wilson
IPC: H04S7/00
Abstract: A system and method of modifying a binaural signal using headtracking information. The system calculates a delay, a first filter response, and a second filter response, and applies these to the left and right components of the binaural signal according to the headtracking information. The system may also apply headtracking to parametric binaural signals. In this manner, headtracking may be applied to pre-rendered binaural audio.
-
公开(公告)号:US11277707B2
公开(公告)日:2022-03-15
申请号:US16938561
申请日:2020-07-24
Inventor: Dirk Jeroen Breebaart , Antonio Mateos Sole , Heiko Purnhagen , Nicolas R. Tsingos
Abstract: Described herein is a method (30) of rendering an audio signal (17) for playback in an audio environment (27) defined by a target loudspeaker system (23), the audio signal (17) including audio data relating to an audio object and associated position data indicative of an object position. Method (30) includes the initial step (31) of receiving the audio signal (17). At step (32) loudspeaker layout data for the target loudspeaker system (23) is received. At step (33) control data is received that is indicative of a position modification to be applied to the audio object in the audio environment (27). At step (38) in response to the position data, loudspeaker layout data and control data, rendering modification data is generated. Finally, at step (39) the audio signal (17) is rendered with the rendering modification data to output the audio signal (17) with the audio object at a modified object position that is between loudspeakers within the audio environment (27).
-
30.
公开(公告)号:US11272311B2
公开(公告)日:2022-03-08
申请号:US17090772
申请日:2020-11-05
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Grant A. Davidson , Kuan-Chieh Yen , Dirk Jeroen Breebaart
IPC: H04S7/00
Abstract: Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.
-
-
-
-
-
-
-
-
-