摘要:
In a method for coding an audio signal to obtain a coded bit stream, discrete-time samples of the audio signal are transformed into the frequency domain to obtain spectral values. The spectral values are coded with a code table having a limited number of code words of different lengths to obtain spectral values coded by code words, the length of a code word assigned to a spectral value being that much shorter the higher the probability of occurrence of the spectral value is. A raster is then specified for the coded bit stream, the raster having equidistant raster points and the distance between the raster points depending on the code table(s) used. In order to obtain error-tolerant Huffman coding, priority code words, which represent particular spectral values which are psychoacoustically more important than other spectral values, are so arranged in the raster that the start of each priority code word coincides with a raster point.
摘要:
A method for detecting a transient in a discrete-time audio signal is performed completely in the time domain and includes the step of segmenting the discrete-time audio signal so as to generate consecutive segments of the same length with unfiltered discrete-time audio signals xs(T−1). The discrete-time audio signal in a current segment is subsequently filtered. Then either the energy of the filtered discrete-time audio signal in the current segment can be compared with the energy of the filtered discrete-time audio signal in a preceding segment or a current relationship between the energy of the filtered discrete-time audio signal in the current segment and the energy of the unfiltered discrete-time audio signal in the current segment can be formed and this current relationship compared with a preceding corresponding relationship. On the basis of the one and/or the other of these comparisons it is detected whether a transient is present in the discrete-time audio signal.
摘要:
In determining a coding block raster on which a decoded signal is based, a segment of the decoded signal is picked out first, said segment beginning at a certain output sampling value of the decoded signal. Said segment is then converted into a spectral representation, whereupon said spectral representation is then evaluated in relation to a predetermined criterion in order to obtain an evaluation result for the segment. This procedure is repeated for a plurality of different segments beginning at different output sampling values each, in order to obtain a plurality of evaluation results. Finally, the plurality of the evaluation results is searched in order to establish the evaluation result that has an extreme value as compared to the other evaluation results, in such a way that it can be assumed that the segment to which this evaluation result is allocated matches the coding block raster on which the decoded signal is based. This method can be used to determine the coding block raster for any decoded signal that has no explicit information about its coding block raster.
摘要:
Jointly processed stereophonic audio signal properties are identified using a stereophonic signal as reference signal and creating a signal for testing by processing the stereophonic signal, e.g. by coding and subsequently decoding it. Both signals are transformed into the frequency domain to create representative spectral data for the respective subbands. Correlation coefficients are determined for each subband both of the reference signal and also of the signal for testing on the basis of the spectral data of the channels of the reference signal or of the signal for testing. From the comparison of the correlation coefficients belonging to the same subband, jointly processed stereophonic audio signals are detected if at least one of the correlation coefficients of the signal for testing greatly exceeds the correlation coefficient of the reference signal for the same subband.
摘要:
In the case of coding a plurality of signals which are not independent of e another, a selection of the suitable type of coding is made as a function of a similarity measure. According to one aspect of the invention, the similarity measure is determined by firstly coding one of the signals according to the intensity-stereo method and then decoding it in order to create a signal affected by coding error, whereupon the latter signal and the associated non-coded signal are transformed into the frequency domain. In the frequency domain, a selection or evaluation of the actually audible spectral components, as well as of the signal affected by coding error and of the associated signal not affected by coding error, is undertaken using a listening threshold which is determined by a psycho-acoustic calculation. Intensity-stereo coding is undertaken in the case of a high similarity measure, whereas otherwise a separate coding of the channels is performed.
摘要:
Techniques for introducing information into a data stream first obtains the spectral values of the short-term spectrum of the audio signal. Separately, information to be introduced are combined with a spread sequence obtaining a spread information signal, whereupon a spectral representation of the spread information is generated, then weighted with an established psychoacoustic maskable noise energy to generate a weighted information signal, wherein energy of the introduced information is substantially equal to or below the psychoacoustic masking threshold. The weighted information signal and the spectral values of the short-term spectrum of the audio signal are then summed and afterwards processed again to obtain a processed data stream including audio information and information to be introduced. Because the information to be introduced are introduced without changing to the time domain, the block rastering underlying the short-term spectrum are not touched, thus introducing a watermark will not lead to tandem encoding effects.
摘要:
Techniques for introducing information into a data stream first obtains the spectral values of the short-term spectrum of the audio signal. Separately, information to be introduced are combined with a spread sequence obtaining a spread information signal, whereupon a spectral representation of the spread information is generated, then weighted with an established psychoacoustic maskable noise energy to generate a weighted information signal, wherein energy of the introduced information is substantially equal to or below the psychoacoustic masking threshold. The weighted information signal and the spectral values of the short-term spectrum of the audio signal are then summed and afterwards processed again to obtain a processed data stream including audio information and information to be introduced. Because the information to be introduced are introduced without changing to the time domain, the block rastering underlying the short-term spectrum are not touched, thus introducing a watermark will not lead to tandem encoding effects.
摘要:
An inventive method for introducing information into a data stream including data about spectral values representing a short-term spectrum of an audio signal first performs a processing of the data stream to obtain the spectral values of the short-term spectrum of the audio signal. Apart from that, the information to be introduced are combined with a spread sequence to obtain a spread information signal, whereupon a spectral representation of the spread information is generated which will then be weighted with an established psychoacoustic maskable noise energy to generate a weighted information signal, wherein the energy of the introduced information is substantially equal to or below the psychoacoustic masking threshold. The weighted information signal and the spectral values of the short-term spectrum of the audio signal will then be summed and afterwards processed again to obtain a processed data stream including both audio information and information to be introduced. By the fact that the information to be introduced are introduced into the data stream without changing to the time domain, the block rastering underlying the short-term spectrum will not be touched, so that introducing a watermark will not lead to tandem encoding effects.
摘要:
An apparatus for compiling a test comprises a database having a plurality of test tasks stored therein, each test task being associated with a task type, means for selecting test tasks from the database to obtain a multitude of selected test tasks, and means for outputting the selected test tasks of the test to a user. The means for selecting test tasks comprises means for selecting, for a task type, at least one test task from the database and for taking the selected test task over to the multitude of selected test tasks if a test task for the task type is available in the database, and an exception-handling logic configured to search the database, for a task type for which no test task is available in the database, for a replacement test task according to a given replacement rule and take same over to the multitude of selected tasks.
摘要:
An apparatus for scalable encoding a spectrum of a signal including audio and/or video information, with the spectrum comprising binary spectral values, includes a means for generating a first sub-scaling layer and a second sub-scaling layer in addition to a means for forming the encoded signal, with the means for forming being implemented so as to include the first sub-scaling layer and the second sub-scaling layer into the encoded signal that the first and the second sub-scaling layer are separately decodable from each other. In contrast to a full-scaling layer, a sub-scaling layer includes only the bits of a certain order of a part of the binary spectral values in the band, so that, by additionally decoding a sub-scaling layer, a more finely controllable and a more finely scalable precision gain may be achieved.