-
公开(公告)号:US12164832B2
公开(公告)日:2024-12-10
申请号:US18351357
申请日:2023-07-12
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Mark Alexander , Chunjian Li , Joshua Brandon Lando , Alan J. Seefeldt , C. Phillip Brown , Dirk Jeroen Breebaart
Abstract: Media input audio data corresponding to a media stream and microphone input audio data from at least one microphone may be received. A first level of at least one of a plurality of frequency bands of the media input audio data, as well as a second level of at least one of a plurality of frequency bands of the microphone input audio data, may be determined. Media output audio data and microphone output audio data may be produced by adjusting levels of one or more of the first and second plurality of frequency bands based on the perceived loudness of the microphone input audio data, of the microphone output audio data, of the media output audio data and the media input audio data. One or more processes may be modified upon receipt of a mode-switching indication.
-
公开(公告)号:US12069464B2
公开(公告)日:2024-08-20
申请号:US17625720
申请日:2020-07-07
Inventor: Dirk Jeroen Breebaart , David Matthew Cooper , Giulio Cengarle , Brett G. Crockett , Rhonda J. Wilson
CPC classification number: H04S3/008 , H04R5/04 , H04S7/301 , H04S2400/01 , H04S2420/03
Abstract: A method for generating mastered audio content, the method comprising obtaining an input audio content comprising a number, M1, of audio signals, obtaining rendered presentation of the input audio content, the rendered presentation comprising a number, M2, of audio signals, obtaining a mastered presentation generated by mastering the rendered presentation, comparing the mastered presentation with the rendered presentation to determine one or more indications of differences between the mastered presentation and the rendered presentation, modifying one or more of the audio signals of the input audio content based on the indications of differences to generate the mastered audio content. With this approach, conventional, typically stereo, channel-based mastering tools can be used to provide a mastered version of any input audio content, including object-based immersive audio content.
-
公开(公告)号:US20230360659A1
公开(公告)日:2023-11-09
申请号:US18351769
申请日:2023-07-13
Inventor: Dirk Jeroen Breebaart , David Matthew Cooper , Leif Jonas Samuelsson
IPC: G10L19/02 , H04S7/00 , G10L19/008
CPC classification number: G10L19/0212 , H04S7/308 , G10L19/008 , G10L19/0204 , H04S2420/01 , H04S2420/03 , H04S2420/07 , H04S2400/01 , H04R2460/03
Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.
-
公开(公告)号:US11736890B2
公开(公告)日:2023-08-22
申请号:US17372833
申请日:2021-07-12
Inventor: Dirk Jeroen Breebaart , Lie Lu , Nicolas R. Tsingos , Antonio Mateos Sole
IPC: G10L19/20 , G10L19/018 , G10L19/00 , H04S3/00 , H04S7/00 , G10L19/008
CPC classification number: H04S7/308 , G10L19/00 , G10L19/008 , G10L19/018 , G10L19/20 , H04S3/002 , H04S2400/11 , H04S2400/13 , H04S2400/15 , H04S2420/03 , H04S2420/07
Abstract: Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.
-
公开(公告)号:US11705143B2
公开(公告)日:2023-07-18
申请号:US17887429
申请日:2022-08-13
Inventor: Dirk Jeroen Breebaart , David Matthew Cooper , Leif Jonas Samuelsson
IPC: G10L19/02 , H04S7/00 , G10L19/008
CPC classification number: G10L19/0212 , G10L19/008 , G10L19/0204 , H04S7/308 , H04R2460/03 , H04S2400/01 , H04S2420/01 , H04S2420/03 , H04S2420/07
Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.
-
公开(公告)号:US11641560B2
公开(公告)日:2023-05-02
申请号:US17465733
申请日:2021-09-02
Inventor: Leif Jonas Samuelsson , Dirk Jeroen Breebaart , David Matthew Cooper , Jeroen Koppens
Abstract: Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.
-
公开(公告)号:US20220253274A1
公开(公告)日:2022-08-11
申请号:US17660951
申请日:2022-04-27
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Mark Alexander , Chunjian Li , Joshua Brandon Lando , Alan J. Seefeldt , C. Phillip Brown , Dirk Jeroen Breebaart
Abstract: Media input audio data corresponding to a media stream and microphone input audio data from at least one microphone may be received. A first level of at least one of a plurality of frequency bands of the media input audio data, as well as a second level of at least one of a plurality of frequency bands of the microphone input audio data, may be determined. Media output audio data and microphone output audio data may be produced by adjusting levels of one or more of the first and second plurality of frequency bands based on the perceived loudness of the microphone input audio data, of the microphone output audio data, of the media output audio data and the media input audio data. One or more processes may be modified upon receipt of a mode-switching indication.
-
公开(公告)号:US10893375B2
公开(公告)日:2021-01-12
申请号:US16516121
申请日:2019-07-18
Inventor: Dirk Jeroen Breebaart , David Matthew Cooper , Mark F. Davis , David S. McGrath , Kristofer Kjoerling , Harald Mundt , Rhonda J. Wilson
IPC: H04S7/00 , G10L19/008 , H04S3/00 , H04R5/033
Abstract: A method of encoding channel or object based input audio for playback, the method including the steps of: (a) initially rendering the channel or object based input audio into an initial output presentation; (b) determining an estimate of the dominant audio component from the channel or object based input audio and determining a series of dominant audio component weighting factors for mapping the initial output presentation into the dominant audio component; (c) determining an estimate of the dominant audio component direction or position; and (d) encoding the initial output presentation, the dominant audio component weighting factors, the dominant audio component direction or position as the encoded signal for playback.
-
公开(公告)号:US10595152B2
公开(公告)日:2020-03-17
申请号:US16009164
申请日:2018-06-14
Inventor: Dirk Jeroen Breebaart , Lie Lu , Nicolas R. Tsingos , Antonio Mateos Sole
IPC: H04R5/02 , H04S7/00 , G10L19/008 , G10L19/00 , H04S3/00 , G10L19/018 , G10L19/20
Abstract: Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.
-
10.
公开(公告)号:US10425763B2
公开(公告)日:2019-09-24
申请号:US15109541
申请日:2014-12-18
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Kuan-Chieh Yen , Dirk Jeroen Breebaart , Grant A. Davidson , Rhonda Wilson , David M. Cooper , Zhiwei Shuang
IPC: H04S7/00 , H04S3/00 , G10L19/008
Abstract: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feed-back delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.
-
-
-
-
-
-
-
-
-