摘要:
In the case of coding a plurality of signals which are not independent of e another, a selection of the suitable type of coding is made as a function of a similarity measure. According to one aspect of the invention, the similarity measure is determined by firstly coding one of the signals according to the intensity-stereo method and then decoding it in order to create a signal affected by coding error, whereupon the latter signal and the associated non-coded signal are transformed into the frequency domain. In the frequency domain, a selection or evaluation of the actually audible spectral components, as well as of the signal affected by coding error and of the associated signal not affected by coding error, is undertaken using a listening threshold which is determined by a psycho-acoustic calculation. Intensity-stereo coding is undertaken in the case of a high similarity measure, whereas otherwise a separate coding of the channels is performed.
摘要:
In coding of an audio signal, coded signals with low quality and bit rate on the one hand and coded signals with high quality and bit rate on the other hand are transmitted to a decoder. At first, the audio signal is coded with low bit rate and is transmitted to the decoder before an additional coded signal is transmitted to the decoder, which either alone or together with the first coded signal upon decoding thereof provides a decoded signal with high quality within the decoder. In this manner, a low-quality decoded signal is generated first in the decoder before decoding of the high-quality signal is possible.
摘要:
A method for detecting a transient in a discrete-time audio signal is performed completely in the time domain and includes the step of segmenting the discrete-time audio signal so as to generate consecutive segments of the same length with unfiltered discrete-time audio signals xs(T−1). The discrete-time audio signal in a current segment is subsequently filtered. Then either the energy of the filtered discrete-time audio signal in the current segment can be compared with the energy of the filtered discrete-time audio signal in a preceding segment or a current relationship between the energy of the filtered discrete-time audio signal in the current segment and the energy of the unfiltered discrete-time audio signal in the current segment can be formed and this current relationship compared with a preceding corresponding relationship. On the basis of the one and/or the other of these comparisons it is detected whether a transient is present in the discrete-time audio signal.
摘要:
In a method of coding discrete time signals (X1) sampled with a first sampling rate, second time signals (x2) are generated using the first time signals having a bandwidth corresponding to a second sampling rate, with the second sampling rate being lower than the first sampling rate. The second time signals are coded in accordance with a first coding algorithm. The coded second signals (X2c) are decoded again in order to obtain coded/decoded second time signals (X2cd) having a bandwidth corresponding to the second sampling frequency. The first time signals, by frequency domain transformation, become first spectral values (X1). Second spectral values (X2cd) are generated from the coded/decoded second time signals, the second spectral values being a representation of the coded/decoded time signals in the frequency domain. To obtain weighted spectral values, the first spectral values are weighted by means of the second spectral values, with the first and second spectral values having the same frequency and time resolution. The weighted spectral values (Xb) are coded in accordance with a second coding algorithm in consideration of a psychoacoustic model and written into a bit stream. Weighting the first spectral values and the second spectral values comprises the subtraction of the second spectral values from the first spectral values in to obtain differential spectral values.
摘要:
A method of coding a time-discrete stereo signal, the stereo signal having a first and a second channel, permits scalable stereo coding. At first, a mono signal is formed from the stereo signal, which is then coded, whereupon the coded mono signal is transmitted to a bit stream. Thereafter, the coded mono singal is decoded again, whereupon stereo information is formed on the basis of the coded/decoded mono signal and the first and second channels, with such stereo information being coded and being also written into the bit stream in order to obtain a bit stream comprising a complete coded monolayer as well as a layer with coded stereo information.
摘要:
The present invention permits a combination of a scalable audio coder with the TNS technique. In a method for coding time signals sampled in a first sampling rate, second time signals are first generated whose sampling rate is smaller than the first sampling rate. The second time signals are then coded according to a first coding algorithm and written into a bit stream. The coded second time signals are, however, decoded again, and, like the first time signals, transformed into the frequency domain. From a spectral representation of the first time signals, TNS prediction coefficients are calculated. The transformed output signal of the coder/decoder with the first coding algorithm, like the spectral representation of the first time signal, undergoes a prediction over the frequency to obtain residual spectral values for both signals, though only the prediction coefficients calculated on the basis of the first time signals are used. These two signals are evaluated against each other. The evaluated residual spectral values are then coded by means of a second coding algorithm to obtain coded evaluated residual spectral values, which, together with the side information containing the calculated prediction coefficients, are written into the bit stream.
摘要:
A method for detecting a transient in a discrete-time audio signal is performed completely in the time domain and includes the step of segmenting the discrete-time audio signal as to generate consecutive segments of the same length with unfiltered discrete-time audio signals. The discrete-time audio signal in a current segment is filtered. Either the energy of the filtered discrete-time audio signal in the current segment is compared with the energy of the filtered discrete-time audio signal in a preceding segment or a current relationship between the energy of the filtered discrete-time audio signal in the current segment and the energy of the unfiltered discrete-time audio signal in the current segment is formed and this current relationship compared with a preceding corresponding relationship. Whether a transient is present in the discrete-time audio signal is detected using one and/or the other of these comparisons.
摘要:
The tonality of an audio signal is determined by a method which includes the steps of blockwise frequency transforming a digital input signal x(n) to create a real positive-value representation X(k) of the input signal, where k designates the index of a frequency line, and determining the tonality T of the signal component for the frequency line k according to the following equation: ##EQU1## where F.sub.1 is the filter function of a first digital filter with a first, differentiating characteristic, F.sub.2 is the filter function of a second digital filter with a second, flat or integrating characteristic or with a characteristic which is less strongly differentiating than the first characteristic, and d.sub.1 and d.sub.2 are integer constants which, depending on the filter parameters, are so chosen that the delays of the filters are compensated for in each case.
摘要:
A process for transmitting and/or storing digital signals of multiple chals. This process is suited, in particular, for transmitting the five channels of 3/2 stereophony as well as for transmitting two stereo channels and three additional commentary channels. In this manner, by way of illustration, television programs with multi-language audio signals can be transmitted. This process is distinguished in that by reduction of the to-be-transmitted data, only a bit rate of 384 kbit/s is required for transmission. The reduction of the data is achieved by the K input channels being imaged in segments onto the N.ltoreq.K virtual spectral data channels, by the spectral data channels being quantized, coded, and transmitted taking into consideration the principles of psychoacoustics, and by K output channels being reproduced from the transmitted bit stream with the aid of a transmitted list from the N.ltoreq.K spectral data channels.
摘要:
A watermark generator for providing a watermark signal in dependence on binary message data includes an information processor configured to provide, in dependence on information units of the binary message data, a first time-frequency domain representation, values of which represent the binary message data. The watermark generator also includes a differential encoder configured to derive a second time-frequency domain representation from the first time-frequency-domain representation, such that the second time-frequency-domain representation includes a plurality of values, wherein a difference between two values of the second time-frequency-domain representation represents a corresponding value of the first time-frequency-domain representation, in order to obtain a differential encoding of the values of the first time-frequency-domain representation. The watermark generator also includes a watermark signal provider configured to provide the watermark signal on the basis of the second time-frequency-domain representation.