Abstract:
Provided is a method of encoding an audio/speech signal, the method including determining a variable length of a frame, that is, a processing unit of an input signal in accordance with a position of an attack in the input signal; transforming each frame of the input signal to a frequency domain and dividing the frame into a plurality of sub frequency bands; and, if a signal of a sub frequency band is determined to be encoded in the frequency domain, encoding the signal of the sub frequency band in the frequency domain, and if the signal of the sub frequency band is determined to be encoded in a time domain, inverse transforming the signal of the sub frequency band to the time domain and encoding the inverse transformed signal in the time domain. According to the present invention, the audio/speech signal may be efficiently encoded by controlling time resolution and frequency resolution.
Abstract:
Provided are a method and apparatus for encoding and decoding a stereo signal or a multi-channel signal. According to the method and apparatus, a stereo signal or a multi-channel signal can be encoded and/or decoded by generating parameters based on a mono signal.
Abstract:
Provided is a method of encoding an audio signal. A method of encoding an audio signal includes generating a modified signal of a time domain to compensate a frequency resolution in frame units, analysis-windowing the modified signal of the time domain by using a window type which is designed to have an overlapping section less than 50%, and generating transform coefficients of a frequency domain by transforming the analysis-windowed signal of the time domain.
Abstract:
Disclosed are a method and an apparatus for high frequency decoding for bandwidth extension. The method for high frequency decoding for bandwidth extension comprises the steps of: decoding an excitation class; transforming a decoded low frequency spectrum on the basis of the excitation class; and generating a high frequency excitation spectrum on the basis of the transformed low frequency spectrum. The method and apparatus for high frequency decoding for bandwidth extension according to an embodiment can transform a restored low frequency spectrum and generate a high frequency excitation spectrum, thereby improving the restored sound quality without an excessive increase in complexity.
Abstract:
Provided are a signal processing method and apparatus for enhancing sound quality. The signal processing method performed by a signal transmitting apparatus includes determining, based on a plurality of parameters, a valid bandwidth so as to encode an input signal; performing pre-processing on the input signal, based on the valid bandwidth; and encoding the pre-processed input signal, based on the valid bandwidth, and the signal processing method performed by a signal receiving apparatus includes decoding a bitstream or a packet received via a transmission channel; determining a valid bandwidth, based on a plurality of parameters used in the decoding; and performing post-processing on a decoded signal, based on the valid bandwidth.
Abstract:
Disclosed is an electronic apparatus. The electronic apparatus includes a storage for storing a plurality of filters trained in a plurality of convolutional neural networks (CNNs) respectively and a processor configured to acquire a first spectrogram corresponding to a damaged audio signal, input the first spectrogram to a CNN corresponding to each frequency band to apply the plurality of filters trained in the plurality of CNNs respectively, acquire a second spectrogram by merging output values of the CNNs to which the plurality of filters are applied, and acquire an audio signal reconstructed based on the second spectrogram.
Abstract:
A voice signal processing method according to an embodiment of the present disclosure for overcoming the problem includes: acquiring a real-time near-end noise signal; acquiring a far-end voice signal according to an incoming call; measuring subjective speech quality and perceptual-objective speech quality of test signals generated based on a reference signal and the real-time near-end noise signal; selecting at least one speech quality enhancement method based on the subjective speech quality and the perceptual-objective speech quality, and determining parameters that are to be applied to the selected at least one speech quality enhancement method; and enhancing speech quality of the far-end voice signal by using the selected at least one speech quality enhancement method, based on the determined parameters, :herein the test signals are generated by mixing the acquired real-time near-end noise signal with the reference signal whose speech quality is enhanced by applying a combination of parameter values to speech quality enhancement methods.
Abstract:
A high-band encoding/decoding method and device for bandwidth extension are provided. A high-band encoding method comprising the steps of: generating sub band-specific bit allocation information on the basis of a low-band envelope; determining, on the basis of the sub band-specific bit allocation information, the sub band requiring an envelope update in a high band; and generating, for the determined sub band, refinement data relating to the envelope update. A high-band decoding method comprising the steps of: generating sub band-specific bit allocation information on the basis of a low-band envelope; determining, on the basis of the sub band-specific bit allocation information, the sub band requiring an envelope update in a high band; and decoding, for the determined sub band, refinement data relating to the envelope update, thereby updating the envelope.
Abstract:
A method and an apparatus for packet loss concealment, and a decoding method and an apparatus employing same are provided. A method for time domain packet loss concealment includes checking whether a current frame is either an erased frame or a good frame after the erased frame, when the current frame is either the erased frame or the good frame after the erased frame, obtaining signal characteristics, selecting one of a phase matching tool and a smoothing tool based on a plurality of parameters including the signal characteristics, and performing a packet loss concealment processing on the current frame based on the selected tool.
Abstract:
A playout delay adjustment method includes: adjusting a playout delay surplus based on a difference value between a first playout delay obtained in a first scheme and a second playout delay obtained in a second scheme and determining an adaptation type of a current frame according to whether a previous frame is an active frame; and when the determined adaptation type is signal-based adaptation, performing time scale modification (TSM) according to an adaptation scheme determined according to a comparison result between the first playout delay and the second playout delay and a comparison result between a target delay and the first playout delay.