Abstract:
The invention relates to an apparatus for improving a stereo audio signal of an FM stereo radio receiver. The apparatus comprises a parametric stereo (PS) parameter estimation stage. The parameter estimation stage is configured to determine one or more parametric stereo parameters based on the stereo audio signal in a frequency-variant or frequency-invariant manner. Preferably, these PS parameters are time- and frequency-variant. Moreover, the apparatus comprises an upmix stage. The upmix stage is configured to generate the improved stereo signal based on a first audio signal and the one or more parametric stereo parameters. The first audio signal is obtained from the stereo audio signal, e.g. by a downmix operation in a downmix stage. The PS parameter estimation stage may be part of a PS encoder. The upmix stage may be part of a PS decoder.
Abstract:
A parameter representation of a multi-channel signal having several original channels includes a parameter set, which, when used together with at least one down-mix channel allows a multi-channel reconstruction. An additional level parameter is calculated such that an energy of the at least one downmix channel weighted by the level parameter is equal to a sum of energies of the original channels. The additional level parameter is transmitted to a multi-channel reconstructor together with the parameter set or together with a down-mix channel. An apparatus for generating a multi-channel representation uses the level parameter to correct the energy of the at least one transmitted downmix channel before entering the down-mix signal into an upmixer or within the up-mixing process.
Abstract:
A method and a device are described for processing a stereo signal obtained from an encoder, which encodes an N-channel audio signal into spatial parameters (P) and a stereo down-mix comprising first and second stereo signals (L0, R0). A first signal and a third signal are added in order to obtain a first output signal (Low), wherein the first signal (L0wL) comprises the first stereo signal (L0) modified by a first complex function (g1), and the third signal (L0wR) comprises the second stereo signal (R0) modified by a third complex function (g3). A second signal and a fourth signal are added to obtain a second output signal (R0w). The fourth signal (R0wR) comprises the second stereo signal (R0) modified by a fourth complex function (g4), and the second signal (R0wL) comprises the first stereo signal (L0) modified by a second complex function (g2). The complex functions (g1, g2, g3, g4) are functions of the spatial parameters (P) and are chosen to be such that an energy value of the difference (L0wL,R0wL) between the first signal and the second signal is larger than or equal to the energy value of the sum (L0wL+R0wL) of the first and the second signal, and the energy value of the difference (R0wR−L0wR) between the fourth signal and the third signal is larger than or equal to the energy value of the sum (R0wR+L0wR) of the fourth signal and the third signal.
Abstract:
A parameter calculator calculates lower resolution parametric information and interpolation information. On a decoder-side, an upmixer is used for generating the output channels. The upmixer uses high resolution parametric information generated by a parameter interpolator using the low resolution parametric information and decoder-side derived interpolation information or encoder-generated interpolation information for selecting one of a plurality of different interpolation characteristics.
Abstract:
An audio encoder (109) has a hierarchical encoding structure and generates a data stream comprising one or more audio channels as well as parametric audio encoding data. The encoder (109) comprises an encoding structure processor (305) which inserts decoder tree structure data into the data stream. The decoder tree structure data comprises at least one data value indicative of a channel split characteristic for an audio channel at a hierarchical layer of the hierarchical decoder structure and may specifically specify the decoder tree structures to be applied by a decoder. A decoder (115) comprises a receiver (401) which receives the data stream and a decoder structure processor (405) for generating the hierarchical decoder structure in response to the decoder tree structure data. A decode processor (403) then generates output audio channels from the data stream using the hierarchical decoder structure.
Abstract:
For a multi-channel reconstruction of audio signals based on at least one base channel, an energy measure is used for compensating energy losses due to an predictive upmix. The energy measure can be applied in the encoder or the decoder. Furthermore, a decorrelated signal is added to output channels generated by an energy-loss introducing upmix procedure. The energy of the decorrelated signal is smaller than or equal to an energy error introduced by the predictive upmix. Thus, problems occurring for prediction based up-mix methods such as up-mixing signals that are coded with High Frequency Reconstruction techniques are solved, so that the correct correlation between the up-mixed channels is obtained or the up-mix is adapted to arbitrary down-mixes.
Abstract:
A method and a device are described for processing a stereo signal obtained from an encoder, which encodes an N-channel audio signal into spatial parameters (P) and a stereo down-mix comprising first and second stereo signals (LO, RO). A first signal and a third signal are added in order to obtain a first output signal (L0w), wherein the first signal (L0wL) comprises the first stereo signal (LO) modified by a first complex function (g1), and the third signal (L0wR) comprises the second stereo signal (RO) modified by a third complex function (g3). A second signal and a fourth signal are added to obtain a second output signal (R0w). The fourth signal (R0wR) comprises the second stereo signal (RO) modified by a fourth complex function (g4), and the second signal (R0wL) comprises the first stereo signal (LO) modified by a second complex function (g2). The complex functions (g1, g2, g3, g4) are functions of the spatial parameters (P) and are chosen to be such that an energy value of the difference (L0wL,R0wL) between the first signal and the second signal is larger than or equal to the energy value of the sum (L0wL+R0wL) of the first and the second signal, and the energy value of the difference (R0wR−L0wR) between the fourth signal and the third signal is larger than or equal to the energy value of the sum (R0wR+L0wR) of the fourth signal and the third signal.
Abstract:
For a multi-channel reconstruction of audio signals based on at least one base channel, an energy measure is used for compensating energy losses due to an predictive upmix. The energy measure can be applied in the encoder or the decoder. Furthermore, a decorrelated signal is added to output channels generated by an energy-loss introducing upmix procedure. The energy of the decorrelated signal is smaller than or equal to an energy error introduced by the predictive upmix. Thus, problems occurring for prediction based up-mix methods such as up-mixing signals that are coded with High Frequency Reconstruction techniques are solved, so that the correct correlation between the up-mixed channels is obtained or the up-mix is adapted to arbitrary down-mixes.
Abstract:
A multi-channel synthesizer for generating at least three output channels using an input signal having at least one base channel, the base channel being derived from the original multi-channel signal, the input signal further including at least two different up-mixing parameters, and an up-mixer mode indication indicating, in a first state that a first up-mixing rule is to be performed, and, indicating, in a second state, that a different second up-mixing rule is to be performed, uses an up-mixer for up-mixing the at least one base channel using the at least two different up-mixing parameters based on the first or the second up-mixing rule in response to the up-mixer mode indication so that the at least three output channels are obtained.
Abstract:
A parametric representation of a multi-channel audio signal having parameters suited to be used together with a monophonic downmix signal to calculate a reconstruction of the multi-channel audio signal can efficiently be derived in a stereo-backwards compatible way when a parameter combiner is used to generate the parametric representation by combining a one or more spatial parameters and a stereo parameter resulting in a parametric representation having a decoder usable stereo parameter and an information on the one or more spatial parameters that represents, together with the decoder usable stereo parameter, the one or more spatial parameters.