摘要:
An audio encoder for encoding audio samples has a first time domain aliasing introducing encoder configured to decode audio samples in a first encoding domain and having a first framing rule, a start window and a stop window. The audio encoder further has a second encoder configured to encode samples in a second encoding domain and having a predetermined frame size number of audio samples, and a coding warm-up period number of audio samples, the second encoder having a different second framing rule, a frame of the second encoder being an encoded representation of a number of successive audio samples that is equal to the predetermined frame size number of audio samples. The audio encoder further has a controller switching from the first to the second encoder and for modifying the second framing rule or for modifying the start or the stop window of the first encoder.
摘要:
An audio decoder has an arithmetic decoder for providing decoded spectral values on the basis of an arithmetically-encoded representation and a frequency-domain-to-time-domain converter for providing a time-domain audio representation. The arithmetic decoder selects a mapping rule describing a mapping of a code value onto a symbol code in dependence on a context state described by a numeric current context value which is determined in dependence on previously decoded spectral values. The arithmetic decoder obtains a plurality of context subregion values on the basis of previously decoded spectral values and derives a numeric current context value associated with one or more spectral values to be decoded in dependence on stored context subregion values. The arithmetic decoder computes the norm of a vector formed by a plurality of previously decoded spectral values in order to obtain a common context subregion value. An audio encoder uses a similar concept.
摘要:
An audio decoder includes an arithmetic decoder for providing a plurality of decoded spectral values on the basis of an arithmetically encoded representation of the spectral values, and a frequency-domain-to-time-domain converter for providing a time-domain audio representation using the decoded spectral values. The arithmetic decoder selects a mapping rule describing a mapping of a code value onto a symbol code in dependence on a context state described by a numeric current context value. The arithmetic decoder determines the numeric current context value in dependence on a plurality of previously decoded spectral values. The arithmetic decoder evaluates a hash table, entries of which define both significant state values and boundaries of intervals of numeric context values, in order to select the mapping rule. A mapping rule index value is individually associated to a numeric context value being a significant state value.
摘要:
An audio decoder for providing a decoded audio information includes an arithmetic decoder for providing a plurality of decoded spectral values on the basis of an arithmetically-encoded representation of the spectral values and a frequency-domain-to-time-domain converter for providing a time-domain audio representation using decoded spectral values. The arithmetic decoder is configured to select a mapping rule describing a mapping of a code value onto a symbol code in dependence on a context state. The arithmetic decoder is configured to determine a numeric current context value describing the current context state in dependence on a plurality of previously decoded spectral values and also in dependence on whether a spectral value to be decoded is in a first predetermined frequency region or in a second predetermined frequency region. An audio encoder provides an encoded audio information on the basis of an input audio information.
摘要:
An embodiment of an apparatus for generating audio subband values in audio subband channels has an analysis windower for windowing a frame of time-domain audio input samples being in a time sequence extending from an early sample to a later sample using an analysis window function having a sequence of window coefficients to obtain windowed samples. The analysis window function has a first group of window coefficients and a second group of window coefficients. The first group of window coefficients is used for windowing later time-domain samples and the second group of window coefficients is used for windowing an earlier time-domain samples. The apparatus further has a calculator for calculating the audio subband values using the windowed samples.
摘要:
In an embodiment, bitstream elements of sub-frames are encoded differentially to a global gain value so that a change of the global gain value results in an adjustment of an output level of the decoded representation of the audio content. Concurrently, the differential coding saves bits. Even further, the differential coding enables the lowering of the burden of globally adjusting the gain of an encoded bitstream. In another embodiment, a global gain control across CELP coded frames and transform coded frames is achieved by co-controlling the gain of the codebook excitation of the CELP codec, along with a level of the transform or inverse transform of the transform coded frames. In another embodiment, the gain value determination in CELP coding is performed in the weighted domain of the excitation signal.
摘要:
An audio decoder for providing a decoded audio information on the basis of an encoded audio information includes a window-based signal transformer configured to map a time-frequency representation, which is described by the encoded audio information, to a time-domain representation. The window-based signal transformer is configured to select a window, out of a plurality of windows including windows of different transition slopes and windows of different transform length, on the basis of a window information. The audio decoder includes a window selector configured to evaluate a variable-codeword-length window information in order to select a window for a processing of a given portion of the time-frequency representation associated with a given frame of the audio information.
摘要:
An apparatus calculates a number of spectral envelopes to be derived by a spectral band replication (SBR) encoder, wherein the SBR encoder is adapted to encode an audio signal using a plurality of sample values within a predetermined number of subsequent time portions in an SBR frame extending from an initial time to a final time, the predetermined number of subsequent time portions being arranged in a time sequence given by the audio signal. The apparatus has a decision value calculator for determining a decision value, the decision value measuring a deviation in spectral energy distributions of a pair of neighboring time portions. The apparatus further has a detector for detecting a violation of a threshold by the decision value and a processor for determining a first envelope border between the pair of neighboring time portions when the violation of the threshold is detected.
摘要:
In a method for encoding a symbol it is determined whether the symbol can be encoded by a codeword of a first codebook. In case this is true, the appropriate codeword for the symbol is selected from the first codebook. Otherwise, a codeword is selected from the first codebook indicating that the symbol cannot be encoded by a codeword of the first codebook and the symbol is split into a plurality of first sub-symbols and for at least one of the first sub-symbols a codeword is selected from a second codebook. Also a corresponding method for decoding is described.
摘要:
An audio decoder for providing a decoded representation of an audio content on the basis of an encoded representation of the audio content comprises a linear-prediction-domain decoder core configured to provide a time-domain representation of an audio frame on the basis of a set of linear-prediction domain parameters associated with the audio frame and a frequency-domain decoder core configured to provide a time-domain representation of an audio frame on the basis of a set of frequency-domain parameters, taking into account a transform window out of a set comprising a plurality of different transform windows. The audio decoder comprises a signal combiner configured to overlap-and-add-time-domain representations of subsequent audio frames encoded in different domains, in order to smoothen a transition between the time-domain representations of the subsequent frames. The set of transform windows comprises one or more windows specifically adapted for a transition between a frequency-domain core mode and a linear-prediction-domain core mode.