摘要:
A multi-reference quantization device and method for quantizing an input LPC filter, comprises a plurality of differential quantizers using respective, different references, and a selector of a reference amongst the different references of the differential quantizers using a reference selection criterion. The input LPC filter is differentially quantized by the differential quantizer using the selected reference. A device and method for inverse quantizing a multi-reference differentially quantized LPC filter extracted from a bitstream, comprises an extractor from the bitstream of information about a reference amongst a plurality of possible references used for quantizing the multi-reference differentially quantized LPC filter, and a differential inverse quantizer using the reference corresponding to the extracted reference information to inverse quantize the multi-reference differentially quantized LPC filter.
摘要:
In a method and device for low-frequency emphasis, where the spectrum of a sound signal is transformed in a frequency domain and comprises transform coefficients grouped in a number of blocks, a maximum energy for one block having a position index is calculated. Also, a factor having a position index smaller than the position index of the block with maximum energy is calculated for each block. For each block, an energy of the block is calculated, the factor is computed from the calculated maximum energy and the computed energy of the block, and a gain is determined from the factor and applied to the transform coefficients of the block.
摘要:
A perceptual weighting device for producing a perceptually weighted signal in response to a wideband signal comprises a signal pre-emphasis filter, a synthesis filter calculator, and a perceptual weighting filter. The signal pre-emphasis filter enhances the high frequency content of the wideband signal to thereby produce a pre-emphasized signal. The signal pre-emphasis filter has a transfer function of the form: P(z)=1−μz−1, wherein μ is a pre-emphasis factor having a value located between 0 and 1. The synthesis filter calculator is responsive to the pre-emphasized signal for producing synthesis filter coefficients. Finally, the perceptual weighting filter processes the pre-emphasized signal in relation to the synthesis filter coefficients to produce the perceptually weighted signal. The perceptual weighting filter has a transfer function, with fixed denominator, of the form: W(z)=A(z/γ1)/(1−γ2z−1) where 0
摘要:
A system and method code an object-based audio signal comprising audio objects in response to audio streams with associated metadata. In the system and method, a metadata processor codes the metadata and generates information about bit-budgets for the coding of the metadata of the audio objects. An encoder codes the audio streams while a bit-budget allocator is responsive to the information about the bit-budgets for the coding of the metadata of the audio objects from the metadata processor to allocate bitrates for the coding of the audio streams by the encoder.
摘要:
A method and device for detecting an attack in a sound signal to be coded wherein the sound signal is processed in successive frames each including a number of sub-frames. The device comprises a first-stage attack detector for detecting the attack in a last sub-frame of a current frame, and a second-stage attack detector for detecting the attack in one of the sub-frames of the current frame, including the sub-frames preceding the last sub-frame. No attack is detected when the current frame is not an active frame previously classified to be coded using a generic coding mode. A method and device for coding an attack in a sound signal are also provided. The coding device comprises the above mentioned attack detecting device and an encoder of the sub-frame comprising the detected attack using a transition coding mode using a glottal-shape codebook populated with glottal impulse shapes.
摘要:
A stereo sound signal encoding method and system for time domain down mixing right and left channels of an input stereo sound signal into primary and secondary channels, determine normalised correlations of the left channel and right channel in relation to a monophonic signal version of the sound. A long-term correlation difference is determined on the basis of the normalised correlation of the left channel and the normalized correlation of the right channel. The long-term correlation difference is converted into a factor β, and the left and right channels are mixed to produce the primary and secondary channels using the factor β, wherein the factor β determines respective contributions of the left and right channels upon production of the primary and secondary channels.
摘要:
A device and method for quantizing a gain of a fixed contribution of an excitation in a frame, including sub-frames, of a coded sound signal, wherein the gain of the fixed excitation contribution is estimated in a sub-frame using a parameter representative of a classification of the frame. The gain of the fixed excitation contribution is then quantized in the sub-frame using the estimated gain. The device and method is used in jointly quantizing gains of adaptive and fixed contributions of an excitation in a frame of a coded sound signal. For retrieving a quantized gain of a fixed contribution of an excitation in a sub-frame of a frame, the gain of the fixed excitation contribution is estimated using a parameter representative of a classification of the frame, a gain codebook supplies a correction factor in response to a received, gain codebook index, and a multiplier multiplies the estimated gain by the correction factor to provide a quantized gain of the fixed excitation contribution.
摘要:
The present disclosure relates to a device and method for reducing quantization noise in a signal contained in a time-domain excitation decoded by a time-domain decoder. The decoded time-domain excitation is converted into a frequency-domain excitation. A weighting mask is produced for retrieving spectral information lost in the quantization noise. The frequency-domain excitation is modified to increase spectral dynamics by application of the weighting mask. The modified frequency-domain excitation is converted into a modified time-domain excitation. The method and device can be used for improving music content rendering of linear-prediction (LP) based codecs. Optionally, a synthesis of the decoded time-domain excitation may be classified into one of a first set of excitation categories and a second set of excitation categories, the second set including INACTIVE or UNVOICED categories, the first set including an OTHER category.
摘要:
A device and a method for quantizing, in a super-frame including a sequence of frames, LPC filters calculated during the frames of the sequence. The LPC filter quantizing device and method comprises: an absolute quantizer for first quantizing one of the LPC filters using absolute quantization; and at least one quantizer of the other LPC filters using a quantization mode selected from the group consisting of absolute quantization and differential quantization relative to at least one previously quantized filter amongst the LPC filters. For inverse quantizing, at least the first quantized LPC filter is received and an inverse quantizer inverse quantizes the first quantized LPC filter using absolute inverse quantization. If any quantized LPC filter other than the first quantized LPC filter is received, an inverse quantizer inverse quantizes this quantized LPC filter using one of absolute inverse quantization and differential inverse quantization relative to at least one previously received quantized LPC filter.
摘要:
A frequency-domain noise shaping method and device interpolates a spectral shape and a time-domain envelope of a quantization noise in a windowed and transform-coded audio signal. In the method and device, transform coefficients of the windowed and transform-coded audio signal are split into a plurality of spectral bands. For each spectral band, a first gain representing a spectral shape of the quantization noise at a first transition between a first time window and a second time window is calculated, a second gain representing a spectral shape of the quantization noise at a second transition between the second time window and a third time window is calculated, and the transform coefficients of the second time window are filtered based on the first and second gains, to interpolate between the first and second transitions the spectral shape and the time-domain envelope of the quantization noise.