Audio Object Clustering with Single Channel Quality Preservation

    公开(公告)号:US20170171687A1

    公开(公告)日:2017-06-15

    申请号:US15375488

    申请日:2016-12-12

    CPC classification number: H04S7/30 H04S2400/11 H04S2400/13

    Abstract: Example embodiments disclosed herein relate to audio object clustering with single channel quality preservation. A method of clustering audio objects is disclosed. The method includes determining cluster positions based on object positions of the audio objects and a reference speaker layout, the reference speaker layout indicating speakers located at different speaker positions. The method also includes determining object-to-cluster gains based on the determined cluster positions, the object positions and the reference speaker layout, an object-to-cluster gain defining a proportion of the respective audio object that is assigned to a cluster associated with one of the determined cluster positions. The method further includes clustering the audio objects based on the object-to-cluster gains and the cluster positions for generating cluster signals. Corresponding system, computer program product and device for clustering audio objects are also disclosed.

    Acoustic environment simulation
    15.
    发明授权

    公开(公告)号:US11721348B2

    公开(公告)日:2023-08-08

    申请号:US17510205

    申请日:2021-10-25

    CPC classification number: G10L19/008 G10L19/00 G10L19/012 G10L19/0212

    Abstract: Encoding/decoding an audio signal having one or more audio components, wherein each audio component is associated with a spatial location. A first audio signal presentation (z) of the audio components, a first set of transform parameters (w(f)), and signal level data (β2) are encoded and transmitted to the decoder. The decoder uses the first set of transform parameters (w(f)) to form a reconstructed simulation input signal intended for an acoustic environment simulation, and applies a signal level modification (α) to the reconstructed simulation input signal. The signal level modification is based on the signal level data (β2) and data (p2) related to the acoustic environment simulation. The attenuated reconstructed simulation input signal is then processed in an acoustic environment simulator. With this process, the decoder does not need to determine the signal level of the simulation input signal, thereby reducing processing load.

    Methods and systems for designing and applying numerically optimized binaural room impulse responses

    公开(公告)号:US11576004B2

    公开(公告)日:2023-02-07

    申请号:US17688744

    申请日:2022-03-07

    Abstract: Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.

    Audio decoder and decoding method
    17.
    发明授权

    公开(公告)号:US11423917B2

    公开(公告)日:2022-08-23

    申请号:US16882747

    申请日:2020-05-26

    Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.

    Methods, apparatus and systems for asymmetric speaker processing

    公开(公告)号:US10659880B2

    公开(公告)日:2020-05-19

    申请号:US16191123

    申请日:2018-11-14

    Abstract: A method of processing audio data for replay on a mobile device with a first speaker and a second speaker, wherein the audio data comprises a respective audio signal for each of the first and second speakers, includes: determining a device orientation of the mobile device; if the determined device orientation is vertical orientation, applying a first processing mode to the audio signals for the first and second speakers; and if the determined device orientation is horizontal orientation, applying a second processing mode to the audio signals for the first and second speakers. Applying the first processing mode involves: determining respective mono audio signals in at least two frequency bands based on the audio signals for the first and second speakers; in a first one of the at least two frequency bands, routing a larger portion of the respective mono audio signal to one of the first and second speakers; and in a second one of the at least two frequency bands, routing a larger portion of the respective mono audio signal to the other one of the first and second speakers. Applying the second processing mode involves applying cross-talk cancellation to the audio signals for the first and second speakers.

Patent Agency Ranking