Abstract:
A vector joint encoding/decoding method and a vector joint encoder/decoder are provided, more than two vectors are jointly encoded, and an encoding index of at least one vector is split and then combined between different vectors, so that encoding idle spaces of different vectors can be recombined, thereby facilitating saving of encoding bits, and because an encoding index of a vector is split and then shorter split indexes are recombined, thereby facilitating reduction of requirements for the bit width of operating parts in encoding/decoding calculation.
Abstract:
A voice signal encoding and decoding method, device, and codec system are provided. The coding method includes: encoding an input voice signal to obtain a broadband code stream, where the broadband code stream includes a core layer bit stream and an extension enhancement layer bit stream (101); compressing the core layer bit stream to obtain a compressed code stream (102); and packing the compressed code stream and the extension enhancement layer bit stream to obtain a packed code stream (103). The core layer bit stream compressed, and the compressed code stream and the extension enhancement layer bit stream are packed, thereby reducing transmission bandwidth occupied by the input voice signal. Since the broadband voice encoding is performed on the input voice signal, a broadband voice code stream is transmitted by using narrowband transmission bandwidth, thereby improving the cost performance of voice signal transmission.
Abstract:
The present disclosure relates to a signal analyzer for processing an overlapped input signal frame comprising 2N subsequent input signal values. The signal analyzer comprises: a windower adapted to window the overlapped input signal frame to obtain a windowed signal, wherein the windower is adapted to zero M+N/2 subsequent input signal values of the overlapped input signal frame, wherein M is equal or greater than 1 and smaller than N/2; and a transformer adapted to transform the remaining 3N/2−M subsequent windowed signal values of the windowed signal using N−M sets of transform parameters to obtain a transformed-domain signal comprising N−M transformed-domain signal values.
Abstract:
In a method to decode signals, a computing device decodes spectral coefficients of a current frame are grouped into a plurality of sub-bands. The computing device classifies a sub-band as a bit allocation unsaturated sub-band based on an average quantity of allocated bits per spectral coefficient of a sub-band of the plurality of sub-bands and a threshold. The computing device obtains a noise filling gain based on an envelope of the sub-band, and obtains a reconstructed spectral coefficient of the sub-band by performing noise filling based on the noise filling gain. The computing device then obtains a frequency domain audio signal based on spectral coefficients in the sub-band obtained by decoding and the reconstructed spectral coefficient.
Abstract:
In a method to decode signals, a computing device decodes spectral coefficients of a current frame are grouped into a plurality of sub-bands. The computing device classifies a sub-band as a bit allocation unsaturated sub-band based on an average quantity of allocated bits per spectral coefficient of a sub-band of the plurality of sub-bands and a threshold. The computing device obtains a noise filling gain based on an envelope of the sub-band, and obtains a reconstructed spectral coefficient of the sub-band by performing noise filling based on the noise filling gain. The computing device then obtains a frequency domain audio signal based on spectral coefficients in the sub-band obtained by decoding and the reconstructed spectral coefficient.
Abstract:
A method includes detecting whether there is a very short pitch lag in a speech or audio signal that is shorter than a conventional minimum pitch limitation using a combination of time domain and frequency domain pitch detection techniques. The pitch detection techniques include using pitch correlations in a time domain and detecting a lack of low frequency energy in the speech or audio signal in a frequency domain. The detected very short pitch lag is coded using a pitch range from a predetermined minimum very short pitch limitation that is smaller than the conventional minimum pitch limitation.
Abstract:
A audio signal encoding method and apparatus includes: obtaining an audio signal comprising a plurality of sub-bands, wherein each sub-band has an index; obtaining a spectrum energy of each sub-band of at least a part of the plurality of sub-bands; obtaining a highest index of a sub-band to be allocated bits according to the spectrum energy and a ratio factor, wherein the ratio factor is greater than 0 and less than 1; allocating at least one bit for a sub-band having an index no greater than the highest index; and encoding a spectrum coefficient of the sub-band having the index no greater than the highest index with the allocated at least one bit. In this manner, the signal bandwidth is effectively coded and decoded by centralizing the bits.
Abstract:
A method and device for decoding a signal, where the method includes: obtaining an average quantity of allocated bits per spectral coefficient of a sub-band of a current frame of the audio signal, wherein the sub-band includes a plurality of spectral coefficients; obtaining a noise filling gain for the sub-band when the average quantity of allocated bits per spectral coefficient is less than a classification threshold; reconstructing, according to the noise filling gain, at least some of the spectral coefficients to generate reconstructed spectral coefficients when the average quantity of allocated bits per spectral coefficient is less than a classification threshold; obtaining a frequency domain signal according to the reconstructed spectral coefficients; and generating a time domain signal based on the frequency domain signal. Therefore, a sub-band with unsaturated bit allocation in a frequency domain signal may be obtained by classification, thereby improving signal decoding quality.
Abstract:
A method for predicting a bandwidth extension frequency band signal includes demultiplexing a received bitstream to obtain a frequency domain signal; determining whether a highest frequency bin, to which a bit is allocated, of the frequency domain signal is less than a preset start frequency bin of a bandwidth extension frequency band; predicting an excitation signal of the bandwidth extension frequency band according to the determination; and predicting the bandwidth extension frequency band signal according to the predicted excitation signal of the bandwidth extension frequency band and a frequency envelope of the bandwidth extension frequency band.
Abstract:
A method and an apparatus for detecting correctness of a pitch period, where the method for detecting correctness of a pitch period includes determining, according to an initial pitch period of an input signal in a time domain, a pitch frequency bin of the input signal, where the initial pitch period is obtained by performing open-loop detection on the input signal, determining, based on an amplitude spectrum of the input signal in a frequency domain, a pitch period correctness decision parameter, associated with the pitch frequency bin, of the input signal, and determining correctness of the initial pitch period according to the pitch period correctness decision parameter. Hence, the method and apparatus for detecting correctness of the pitch period improve, based on a relatively less complex algorithm, accuracy of detecting correctness of the pitch period.