-
公开(公告)号:US10021508B2
公开(公告)日:2018-07-10
申请号:US15357810
申请日:2016-11-21
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Sven Kordon , Johann-Markus Batke , Alexander Krueger , Mark R. P. Thomas
CPC classification number: H04S7/307 , H04R1/326 , H04R1/406 , H04R3/005 , H04R5/027 , H04R29/005 , H04R2201/401 , H04S3/002 , H04S2400/15 , H04S2420/11
Abstract: Spherical microphone arrays capture a three-dimensional sound field (P(Ωc, t)) for generating an Ambisonics representation (Anm(t)), where the pressure distribution on the surface of the sphere is sampled by the capsules of the array. The impact of the microphones on the captured sound field is removed using the inverse microphone transfer function. The equalization of the transfer function of the microphone array is a big problem because the reciprocal of the transfer function causes high gains for small values in the transfer function and these small values are affected by transducer noise. The invention minimizes that noise by using a Wiener filter processing in the frequency domain, which processing is automatically controlled per wave number by the signal-to-noise ratio of the microphone array.
-
公开(公告)号:US12273698B2
公开(公告)日:2025-04-08
申请号:US18255550
申请日:2021-12-02
Applicant: Dolby Laboratories Licensing Corporation
Abstract: Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first direct sequence spread spectrum (DSSS) signals, generating first modified audio playback signals by inserting the first DSSS signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth DSSS signals) played back by second through Nth audio devices, extracting second through Nth DSSS signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth DSSS signals.
-
公开(公告)号:US11315578B2
公开(公告)日:2022-04-26
申请号:US17047403
申请日:2019-04-15
Inventor: Nicolas R. Tsingos , Mark R. P. Thomas , Christof Fersch
IPC: G10L19/008 , H04S7/00
Abstract: Some disclosed methods involve encoding or decoding directional audio data. Some encoding methods may involve receiving a mono audio signal corresponding to an audio object and a representation of a radiation pattern corresponding to the audio object. The radiation pattern may include sound levels corresponding to plurality of sample times, a plurality of frequency bands and a plurality of directions. The methods may involve encoding the mono audio signal and encoding the source radiation pattern to determine radiation pattern metadata. Encoding the radiation pattern may involve determining a spherical harmonic transform of the representation of the radiation pattern and compressing the spherical harmonic transform to obtain encoded radiation pattern metadata.
-
公开(公告)号:US20240212693A1
公开(公告)日:2024-06-27
申请号:US18404520
申请日:2024-01-04
Inventor: Nicolas R. Tsingos , Mark R. P. Thomas , Christof Fersch
IPC: G10L19/008 , H04S7/00
CPC classification number: G10L19/008 , H04S7/302 , H04S2400/11 , H04S2420/01
Abstract: Some disclosed methods involve encoding or decoding directional audio data. Some encoding methods may involve receiving a mono audio signal corresponding to an audio object and a representation of a radiation pattern corresponding to the audio object. The radiation pattern may include sound levels corresponding to plurality of sample times, a plurality of frequency bands and a plurality of directions. The methods may involve encoding the mono audio signal and encoding the source radiation pattern to determine radiation pattern metadata. Encoding the radiation pattern may involve determining a spherical harmonic transform of the representation of the radiation pattern and compressing the spherical harmonic transform to obtain encoded radiation pattern metadata.
-
公开(公告)号:US20220328052A1
公开(公告)日:2022-10-13
申请号:US17727732
申请日:2022-04-23
Inventor: Nicolas R. Tsingos , Mark R. P. Thomas , Christof Fersch
IPC: G10L19/008 , H04S7/00
Abstract: Some disclosed methods involve encoding or decoding directional audio data. Some encoding methods may involve receiving a mono audio signal corresponding to an audio object and a representation of a radiation pattern corresponding to the audio object. The radiation pattern may include sound levels corresponding to plurality of sample times, a plurality of frequency bands and a plurality of directions. The methods may involve encoding the mono audio signal and encoding the source radiation pattern to determine radiation pattern metadata. Encoding the radiation pattern may involve determining a spherical harmonic transform of the representation of the radiation pattern and compressing the spherical harmonic transform to obtain encoded radiation pattern metadata.
-
公开(公告)号:US10721559B2
公开(公告)日:2020-07-21
申请号:US16270903
申请日:2019-02-08
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Mark R. P. Thomas , Jan-Hendrik Hanschke
Abstract: A microphone array for capturing sound field audio content may include a first set of directional microphones disposed on a first framework at a first radius from a center and arranged in at least a first portion of a first spherical surface. The microphone array may include a second set of directional microphones disposed on a second framework at a second radius from the center and arranged in at least a second portion of a second spherical surface. The second radius may be larger than the first radius. The directional microphones may capture information that allows for the extraction of Higher-Order Ambisonics (HOA) signals.
-
公开(公告)号:US12003673B2
公开(公告)日:2024-06-04
申请号:US17628732
申请日:2020-07-29
Inventor: Glenn N. Dickins , Christopher Graham Hines , David Gunawan , Richard J. Cartwright , Alan J. Seefeldt , Daniel Arteaga , Mark R. P. Thomas , Joshua B. Lando
CPC classification number: H04M9/082 , G10L15/22 , G10L2015/223
Abstract: An audio processing method may involve receiving output signals from each microphone of a plurality of microphones in an audio environment, the output signals corresponding to a current utterance of a person and determining, based on the output signals, one or more aspects of context information relating to the person, including an estimated current proximity of the person to one or more microphone locations. The method may involve selecting two or more loudspeaker-equipped audio devices based, at least in part, on the one or more aspects of the context information, determining one or more types of audio processing changes to apply to audio data being rendered to loudspeaker feed signals for the audio devices and causing one or more types of audio processing changes to be applied. In some examples, the audio processing changes have the effect of increasing a speech to echo ratio at one or more microphones.
-
公开(公告)号:US11968268B2
公开(公告)日:2024-04-23
申请号:US17630779
申请日:2020-07-28
Inventor: Glenn N. Dickins , Mark R. P. Thomas , Alan J. Seefeldt , Joshua B. Lando , Daniel Arteaga , Carlos Medaglia Dyonisio , David Gunawan , Richard J. Cartwright , Christopher Graham Hines
CPC classification number: H04L67/141 , H04R1/326 , H04R1/403 , H04R1/406 , H04R3/005 , H04R3/12 , H04S7/303
Abstract: An audio session management method may involve: determining, by an audio session manager, one or more first media engine capabilities of a first media engine of a first smart audio device, the first media engine being configured for managing one or more audio media streams received by the first smart audio device and for performing first smart audio device signal processing for the one or more audio media streams according to a first media engine sample clock; receiving, by the audio session manager and via a first application communication link, first application control signals from the first application; and controlling the first smart audio device according to the first media engine capabilities, by the audio session manager, via first audio session management control signals transmitted to the first smart audio device via a first smart audio device communication link and without reference to the first media engine sample clock.
-
公开(公告)号:US11477601B2
公开(公告)日:2022-10-18
申请号:US17286313
申请日:2019-10-16
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Charles Q. Robinson , Mark R. P. Thomas , Michael J. Smithers
IPC: H04S7/00
Abstract: Some disclosed methods involve multi-band bass management. Some such examples may involve applying multiple high-pass and low-pass filter frequencies for the purpose of bass input management. Some disclosed methods treat at least some low-frequency signals as audio objects that can be panned. Some disclosed methods involve panning low and high frequencies separately. Following high-pass rendering, a power audit may determine a low-frequency deficit factor that is to be reproduced by subwoofers or other low-frequency-capable loudspeakers.
-
公开(公告)号:US11322164B2
公开(公告)日:2022-05-03
申请号:US16963489
申请日:2019-01-17
Inventor: Kristofer Kjoerling , David S. McGrath , Heiko Purnhagen , Mark R. P. Thomas
IPC: G10L19/008
Abstract: The present document describes a method (400) for encoding a soundfield representation (SR) input signal (101, 301) describing a soundfield at a reference position, wherein the SR input signal (101, 301) comprises a plurality of channels for a plurality of different directivity patterns of the soundfield at the reference position. The method (400) comprises extracting (401) one or more audio objects (103, 303) from the SR input signal (101, 301). Furthermore, the method (400) comprises determining (402) a residual signal (102, 302) based on the SR input signal (101, 301) and based on the one or more audio objects (103, 303). The method (400) also comprises performing joint coding of the one or more audio objects (103, 303) and/or the residual signal (102, 302). In addition, the method (400) comprises generating (403) a bitstream (701) based on data generated in the context of joint coding of the one or more audio objects (103, 303) and/or the residual signal (102, 302).
-
-
-
-
-
-
-
-
-