Abstract:
Provided are an apparatus and a method for integrally encoding and decoding a speech signal and a audio signal. The encoding apparatus may include: an input signal analyzer to analyze a characteristic of an input signal; a first conversion encoder to convert the input signal to a frequency domain signal, and to encode the input signal when the input signal is a audio characteristic signal; a Linear Predictive Coding (LPC) encoder to perform LPC encoding of the input signal when the input signal is a speech characteristic signal; and a bitstream generator to generate a bitstream using an output
Abstract:
Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
Abstract:
A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.
Abstract:
The present invention relates to a method and an apparatus for processing a signal, which are used to effectively reproduce an audio signal, and more particularly, to a method for generating a filter for an audio signal, which are used for implementing a filtering for input audio signals with a low computational complexity and a parameterization apparatus therefor.To this end, provided are a method for generating a filter of an audio signal, including: receiving at least one proto-type filter coefficient for filtering each subband signal of an input audio signal; converting the proto-type filter coefficient into a plurality of subband filter coefficients; truncating each of the subband filter coefficients based on filter order information obtained by at least partially using characteristic information extracted from the corresponding subband filter coefficients, the length of at least one truncated subband filter coefficients being different from the length of truncated subband filter coefficients of another subband; and generating FFT filter coefficients by fast Fourier transforming (FFT) the truncated subband filter coefficients by a predetermined block size in the corresponding subband and a parameterization unit using the same.
Abstract:
Provided is a method and apparatus for generating a side information bitstream of a multi-object audio signal. The apparatus for generating a side information bitstream of a multi-object audio signal includes a spatial cue information input unit configured to receive spatial cue information generated in an encoder of the multi-object audio signal, a preset information input unit configured to receive preset information for the multi-object audio signal, and a side information bitstream generator configured to generate the side information bitstream based on the spatial cue information and the preset information. The side information bitstream includes a header region and a frame region, and the preset information is included in the frame region.
Abstract:
A method of playing a sound source is provided. The method includes identifying a playback area for at least one higher order ambisonic (HOA) sound source existing in a target space, determining a coordinate calculation standard for playing the HOA sound source based on a positional relationship between the playback area of the HOA sound source and a listener existing in the target space, and calculating a head related transfer function (HRTF) rendering angle of the HOA sound source according to the determined coordinate calculation standard and playing the HOA sound source.
Abstract:
An encoding apparatus and a decoding apparatus in a transform between a Modified Discrete Cosine Transform (MDCT)-based coder and a different coder are provided. The encoding apparatus may encode additional information to restore an input signal encoded according to the MIDCT-based coding scheme, when switching occurs between the MDCT-based coder and the different coder. Accordingly, an unnecessary bitstream may be prevented from being generated, and minimum additional information may be encoded.
Abstract:
A rendering method of an object-based audio signal and an apparatus for performing the same are provided. The rendering method of an object-based audio signal includes obtaining a rendered audio signal, performing clipping prevention on the rendered audio signal using a first limiter, mixing a signal output by the first limiter using a mixer, and performing clipping prevention on the mixed signal using a second limiter.
Abstract:
Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
Abstract:
A method of rendering object-based audio and an electronic device performing the method are disclosed. The method includes identifying metadata of object-based audio, identifying an audio source distance between the object-based audio and a listener using the metadata, determining a minimum distance of the object-based audio to apply attenuation according to the audio source distance, based on a reference distance of the object-based audio in the metadata, and rendering the object-based audio using the audio source distance and the minimum distance.