摘要:
For determining a quantizer step size for quantizing a signal including audio or video information, a first quantizer step size as well as an interference threshold are provided. Then, the actual interference introduced by the first quantizer step size is determined and compared with the interference threshold. Despite the fact that the comparison reveals that the actually introduced interference exceeds the threshold, a second, coarser quantizer step size is nevertheless used, which will then be used for quantization if it turns out that the interference introduced by the coarser, second quantizer step size falls below the threshold or falls below the interference introduced by the first quantizer step size. Thus, the quantization interference is reduced while the quantization is coarsened and, thus, the compression gain is increased.
摘要:
The present document relates to processing of multimedia data, notably the encoding, the transmission, the decoding and the rendering of multimedia data, e.g. audio files or bitstreams. In particular, the present document relates to the implementation of loudness control in multimedia players. A method for providing loudness related data to a media player is described. The method comprises the steps of providing a first loudness related value associated with an audio signal; wherein the first loudness related value has been determined according to a first procedure; of converting the first loudness related value into a second loudness related value using a reversible relation; wherein the second loudness related value is associated with a second procedure for determining loudness related values; of storing the second loudness related value in metadata associated with the audio signal; and of providing the metadata to the media player.
摘要:
Many portable playback devices cannot decode and playback encoded audio content having wide bandwidth and wide dynamic range with consistent loudness and intelligibility unless the encoded audio content has been prepared specially for these devices. This problem can be overcome by including with the encoded content some metadata that specifies a suitable dynamic range compression profile by either absolute values or differential values relative to another known compression profile. A playback device may also adaptively apply gain and limiting to the playback audio. Implementations in encoders, in transcoders and in decoders are disclosed.
摘要:
The invention discloses a method and an encoder for processing a digital audio stereo signal. A digital audio encoder for coding such audio signal comprises a predictive Temporal Noise Shaping (TNS) filter, a Mid-/Side (M/S) coding unit, a control unit for determining a first prediction gain related to the unmodified L/R signal processed by the TNS filter and for determining a second prediction gain related to the M/S-coded L/R signal processed by the TNS filter, wherein the control unit is adapted to disable TNS-filtering—i.e. to bypass the TNS filter—for a current signal frame, if the first and second prediction gains differ by more than a pre-determined mismatch range. Preferably, the first and second prediction gains are determined from signal energy ratios calculated for each channel of the stereo signal including the signal energies of both the TNS-processed (unmodified) L- respectively (unmodified) R-signal and the TNS-processed M/S coded L- respectively M/S coded R-signal divided by the respective signal energies before TNS processing. Furthermore, the control unit is preferably adapted to overrule the disabling of the TNS filter, if the input signal is a near-mono audio signal exhibiting only low energy either in its M- or S-band. In that case, operation of the TNS filter on the stereo audio signal is maintained.
摘要:
An apparatus for encoding an information signal having discrete values includes a quantizer having a quantizer border, wherein the quantizer is adapted so that a discrete value above the quantization border is quantized to a quantization index, which is different from a quantization index obtained by quantizing a discrete value below the quantization border, a controller for modifying the quantization border, wherein the quantizer having a first quantization border setting is adapted to generate a first set of quantization indices for the discrete values, and wherein the quantizer having a second modified quantization border setting is adapted to generate a second set of quantization indices, and an output interface for outputting an encoded information signal which is either based on the first set of quantization indices or the second set of quantization indices dependent on a decision function.
摘要:
The present invention teaches a new audio coding system that can code both general audio and speech signals well at low bit rates. A proposed audio coding system comprises linear prediction unit for filtering an input signal based on an adaptive filter; a transformation unit for transforming a frame of the filtered input signal into a transform domain; and a quantization unit for quantizing the transform domain signal. The quantization unit decides, based on input signal characteristics, to encode the transform domain signal with a model-based quantizer or a non-model-based quantizer. Preferably, the decision is based on the frame size applied by the transformation unit.
摘要:
An apparatus for processing a multi-channel signal includes a means for determining a similarity between a first one of two channels and a second one of the two channels. Furthermore, a means for performing a prediction filtering of the spectral coefficients is provided, which is formed to perform a prediction filtering with only a single prediction filter for both channels in case of high similarity between the first and the second channel, and to perform a prediction filtering with two separate prediction filters in case of a dissimilarity between the first and the second channel. With this, an introduction of stereo artifacts and a deterioration of the coding gain in stereo coding techniques are avoided.
摘要:
An apparatus for encoding an audio signal to obtain an encoded audio signal to be used by a decoder having a high frequency reconstruction module for performing a high frequency reconstruction for a frequency range above a crossover frequency includes, a core encoder for encoding a lower frequency band of the audio signal up to the crossover frequency, the crossover frequency being variable, and the core encoder being operable on a block-wise frame by frame basis, and a crossover frequency control module for estimating, dependent on a measure of the degree of difficulty for encoding the audio signal by the core encoder and/or a boarder between a tonal and a noise-like frequency range of the audio signal, the crossover frequency to be selected by the core encoder for a frame of a series of subsequent frames, so that the crossover frequency is variable adaptively over time for the series of subsequent frames.
摘要:
An apparatus for generating a data stream having a series of segments using data organized in subsequent data frames. The apparatus comprises a packetiser for packetising data from a data frame into the series of segments having a first segment and a second segment, where the second segment has interpretable data entities and has a data entity fragment, the data entity fragment including only a part of an interpretable data entity preceding an interpretable data entity. The apparatus comprises furthermore an information block adder for adding an information block associated with the second segment, the information block indicating an entry point into the second segment, the entry point indicating a start of the interpretable data entity following the data entity fragment.
摘要:
For determining a quantizer step size for quantizing a signal including audio or video information, a first quantizer step size as well as an interference threshold are provided. Then, the actual interference introduced by the first quantizer step size is determined and compared with the interference threshold. Despite the fact that the comparison reveals that the actually introduced interference exceeds the threshold, a second, coarser quantizer step size is nevertheless used, which will then be used for quantization if it turns out that the interference introduced by the coarser, second quantizer step size falls below the threshold or falls below the interference introduced by the first quantizer step size. Thus, the quantization interference is reduced while the quantization is coarsened and, thus, the compression gain is increased.