摘要:
In this invention, the design of the Huffman table can be done offline with a large input sequence database. The range of the quantization indices (or differential indices) for Huffman coding is identified. For each value of range, all the input signal which have the same range will be gathered and the probability distribution of each value of the quantization indices (or differential indices) within the range is calculated. For each value of range, one Huffman table is designed according to the probability. And in order to improve the bits efficiency of the Huffman coding, apparatus and methods to reduce the range of the quantization indices (or differential indices) are also introduced.
摘要:
A stereo signal encoding device is provided that enables a lower bitrate without decreasing quality when applying an intermittent transmission technique to a stereo signal. A stereo encoding unit generates first stereo encoded data by encoding the stereo signal when the stereo signal of the current frame is an audio section. A stereo DTX encoding unit is a means for encoding the stereo signal when the stereo signal of the current frame is a non-audio section. The stereo DTX encoding unit generates second stereo encoded data by encoding each of a monaural signal spectral parameter that is a spectral parameter of a monaural signal generated using the first channel signal and the second channel signal, first channel signal information relating to the first channel signal, and second channel signal information relating to the second channel signal.
摘要:
An encoding device is provided for improving decoded signal quality. A local search unit conducts a local search on a plurality of sub-bands generated by dividing spectrum data, and calculates lattice vectors for the spectra in the plurality of sub-bands. A multi-rate indexing unit uses the lattice vectors to perform multi-rate indexing on each of the sub-bands, and generates indexing information showing the results thereof. A band selection unit determines certain sub-bands from amongst the plurality of sub-bands in a plurality of encoding layers as perceptually important sub-band groups, where these are: within a selection range of sub-bands wherein the total number of encoding bits allocated to each of the plurality of sub-bands in the indexing information is equal to or less than an already set value, and within a sub-band selection range with the highest total energy of each of the plurality of sub-bands.
摘要:
By copying to a high-frequency band portion (extension band) a low-frequency band portion in which peaking has been set to a sufficiently low state, this encoding device is capable of preventing generation of a spectrum with overly high peaking in the high-frequency band portion, and of generating a high-quality extension band spectrum. This device comprises: a maximum value search unit which searches, in each of multiple sub-bands obtained by dividing the low-frequency band portion of an audio signal and/or music signal below a prescribed frequency, for the maximum value of the amplitude of a first spectrum obtained by decoding first encoded data, which is encoded data in the low-frequency band portion; and an amplitude normalization unit which obtains a normalized spectrum by normalizing, at the maximum values of the amplitude of each sub-band, the first spectrum contained in each sub-band.
摘要:
A voice coding device capable of preventing overall quality degradation even when the bit rate for coding is lowered. The voice coding device codes a wide band signal in a first layer, and codes an extended band signal whose frequency band is located in higher frequency than the wide band signal in an extended band layer. An adaptive band selection unit (301) selects a frequency band to be excluded from a coding object in the extended band layer or a frequency band whose energy is to be attenuated in the extended band layer. A band-limited signal generation unit (302) excludes, within the frequency band of an input signal, the frequency band selected by the adaptive band selection unit (301) from the coding object, or attenuates the energy of the frequency band selected by the adaptive band selection unit (301).
摘要:
Provided is a decoding device which suppresses generation of an abnormal sound caused by a layer switch. The decoding device includes: a first layer decoding unit (202) which performs a decoding process on first layer encoded data so as to generate a first layer decoding signal; a second layer decoding unit (203) which performs a decoding process on second layer encoded data so as to generate a first layer decoding error signal; an adder (204) which adds the first layer decoding signal and the first layer decoding error signal so as to generate a second layer decoding signal; a switching unit (205) which performs switching between the first layer signal and the second layer decoding signal for output according to layer information; and a post-filter (206) which selects a control parameter corresponding to the respective layer information and performs a control parameter smoothing process so as to generate a smoothed control parameter and performs a filter process on the decoding signal from the switching unit (205) by using the generated smoothed control parameter.
摘要:
A decoding device is capable of flexibly calculating high-band spectrum data with a high accuracy in accordance with an encoding band selected by an upper-node layer of the encoding side. In this device: a first layer decoder decodes first layer encoded information to generate a first layer decoded signal; a second layer decoder decodes second layer encoded information to generate a second layer decoded signal; a spectrum decoder performs a band extension process by using the second layer decoded signal and the first layer decoded signal up-sampled in an up-sampler so as to generate an all-band decoded signal; and a switch outputs the first layer decoded signal or the all-band decoded signal according to the control information generated in a controller.
摘要:
There is disclosed an encoding device capable of improving similarity between the high frequency band spectrum of the original signal and a new spectrum to be generated while realizing a low bit rate when encoding a wide-band signal spectrum. The encoding device has sub-band amplitude calculation units (122, 128) for calculating the amplitude of the respective sub-bands for the high frequency band spectrum obtained from the wide-band signal. A search unit (124) and a gain codebook (125) select some sub-bands from a plurality of sub-bands and only the gain of the selected sub-bands is subjected to encoding. An interpolation unit (126) expresses the gain of the sub-band not selected, by mutually interpolating the selected gains.
摘要:
There is provided an audio encoding device capable of maintaining continuity of spectrum energy and preventing degradation of audio quality even when a spectrum of a low range of an audio signal is copied at a high range a plurality of times. The audio encoding device (100) includes: an LPC quantization unit (102) for quantizing an LPC coefficient; an LPC decoding unit (103) for decoding the quantized LPC coefficient; an inverse filter unit (104) for flattening the spectrum of the input audio signal by the inverse filter configured by using the decoding LPC coefficient; a frequency region conversion unit (105) for frequency-analyzing the flattened spectrum; a first layer encoding unit (106) for encoding the low range of the flattened spectrum to generate first layer encoded data; a first layer decoding unit (107) for decoding the first layer encoded data to generate a first layer decoded spectrum, and a second layer encoding unit (108) for encoding.
摘要:
A sound encoding device enabling the amount of delay to be kept small and the distortion between frames to be mitigated. In the sound encoding device, a window multiplication part (211) of a long analysis section (21) multiplies a long analysis frame signal of analysis length M1 by an analysis window, the resultant signal multiplied by the analysis window is outputted to an MDCT section (212), and the MDCT section (212) performs MDCT of the input signal to obtain the transform coefficients of the long analysis frame and outputs it to a transform coefficient encoding section (30). The window multiplication part (221) of a short analysis section (22) multiplies a short analysis frame signal of analysis length M2 (M2