Abstract:
According to the invention, a device for post-processing at least one channel signal of a plurality of channel signals of a multi-channel signal is described, the at least one channel signal being generated from a decoded downmix signal by a low-bit-rate audio coding/decoding system, the device comprising: a receiver for receiving the at least one channel signal generated from the decoded downmix signal, a time envelope of the decoded downmix signal, an interchannel time difference between the channel signal and the downmix signal, and a classification indication indicating a transient type of the downmix signal; and a post-processor for post-processing the at least one channel signal based on the time envelope of the decoded downmix signal weighted by a respective weighting factor and in dependence on the classification indication and the interchannel time difference.
Abstract:
The invention relates to an audio signal synthesizer, the audio signal synthesizer comprises a transformer for transforming the down-mix audio signal into frequency domain to obtain a transformed audio signal; a signal generator for generating a first auxiliary signal, for generating a second auxiliary signal, and for generating a third auxiliary signal upon the basis of the transformed audio signal; a de-correlator for generating a first de-correlated signal, and for generating a second de-correlated signal from the third auxiliary signal, the first de-correlated signal and the second de-correlated signal being at least partly de-correlated; and a combiner for combining the first auxiliary signal with the first de-correlated signal to obtain a first audio signal, and for combining the second auxiliary signal with the second de-correlated signal to obtain the second audio signal, the first audio signal and the second audio signal forming the multi-channel audio signal.
Abstract:
An audio rendering system is provided that comprises a plurality of loudspeakers arranged to approximate a desired spatial sound field within a predetermined reproduction region, wherein the loudspeakers are configured to approximate the sound field based on a weighted series of orthonormal basis functions for the reproduction region.
Abstract:
In accordance with an embodiment, a method of generating an encoded audio signal, the method includes estimating a time-frequency energy of an input audio signal from a time-frequency filter bank, computing a global variance of the time-frequency energy, determining a post-processing method according to the global variance, and transmitting an encoded representation of the input audio signal along with an indication of the determined post-processing method.
Abstract:
A method for parametric spatial audio coding of a multi-channel audio signal comprising a plurality of audio channel signals is provided, the method comprising: calculating at least two different spatial coding parameters for an audio channel signal of the plurality of audio channel signals, selecting at least one spatial coding parameter of the at least two different spatial coding parameters associated with the audio channel signal on the basis of the values of the calculated spatial coding parameters; including a quantized representation of the selected spatial coding parameter into a parameter section of an audio bitstream; and setting a parameter type flag in the parameter section of the audio bitstream indicating the type of the selected spatial coding parameter being included into the audio bitstream.
Abstract:
A method for reconstructing at least one target signal comprises determining a first set of feature vectors from the input signal, the first set of feature vectors forming a non-negative input matrix; determining a second set of feature vectors, the second set of feature vectors forming a non-negative noise matrix; decomposing the input matrix into a sum of a first matrix and a second matrix, the first matrix representing a product of a non-negative bases matrix and a non-negative weight matrix, and the second matrix representing a combination of the noise matrix and a noise weight vector; and reconstructing the at least one target signal based on the non-negative bases matrix and the non-negative weight matrix.
Abstract:
An embodiment of the present invention provides a method for generating a downmixed signal, including: performing a time-frequency transform on a received left sound channel signal and a received right sound channel signal to obtain a frequency domain signal, and dividing the frequency domain signal into several frequency bands; calculating a sound channel energy ratio and a sound channel phase difference of each frequency band; calculating a phase difference between the downmixed signal and a first sound channel signal in each frequency band according to the sound channel energy ratio and the sound channel phase difference; and calculating a frequency domain downmixed signal according to the left sound channel signal, the right sound channel signal, and the phase difference between the downmixed signal and the first sound channel signal in each frequency band. This method effectively improves quality of stereo encoding and decoding.
Abstract:
A method for parametric spatial audio coding of a multi-channel audio signal comprising a plurality of audio channel signals is provided, the method comprising: calculating at least two different spatial coding parameters for an audio channel signal of the plurality of audio channel signals, selecting at least one spatial coding parameter of the at least two different spatial coding parameters associated with the audio channel signal on the basis of the values of the calculated spatial coding parameters; including a quantized representation of the selected spatial coding parameter into a parameter section of an audio bitstream; and setting a parameter type flag in the parameter section of the audio bitstream indicating the type of the selected spatial coding parameter being included into the audio bitstream.
Abstract:
A method and apparatus for providing signal processing coefficients for processing an input signal at a predetermined signal processing sampling rate, wherein the input signal is received at an input signal sampling rate, the method comprising the steps of computing a correlation or covariance function based on the received input signal at the input signal sampling rate to provide correlation or covariance coefficients at the input signal sampling rate, re-sampling the computed correlation or covariance coefficients having the input signal sampling rate to provide correlation or covariance coefficients at the predetermined signal processing sampling rate, and calculating the signal processing coefficients based on the correlation or covariance coefficients at the predetermined signal processing sampling rate.
Abstract:
The invention relates to a method for determining an encoding parameter for an audio channel signal of a multi-channel audio signal, the method comprising: determining a frequency transform of the audio channel signal; determining a frequency transform of a reference audio signal; determining inter channel differences for at least each frequency sub-band of a subset of frequency sub-bands, each inter channel difference indicating a phase difference or time difference between a band-limited signal portion of the audio channel signal and a band-limited signal portion of the reference audio signal in the respective frequency sub-band the inter-channel difference is associated to; determining a first average based on positive values of the inter-channel differences and determining a second average based on negative values of the inter-channel differences; and determining the encoding parameter based on the first average and on the second average.