Abstract:
Provided are an apparatus and method for coding and decoding a multi-object audio signal. The apparatus includes a down-mixer for down-mixing the audio signals into one down-mixed audio signal and extracting supplementary information including header information and spatial cue information for each of the audio signals, a coder for coding the down-mixed audio signal, and a supplementary information coder for generating the supplementary information as a bit stream. The header information includes identification information for each of the audio signals and channel information for the audio signals.
Abstract:
Disclosed herein is a context-adaptive entropy model for end-to-end optimized image compression. The entropy model exploits two types of contexts. The two types of contexts are a bit-consuming context and a bit-free context, respectively, and these contexts are classified depending on the corresponding context requires the allocation of additional bits. Based on these contexts, the entropy model may more accurately estimate the distribution of each latent representation using a more generalized form of entropy models, thus improving compression performance.
Abstract:
The present research relates to controlling rendering of multi-object or multi-channel audio signals. The present research provides a method and apparatus for controlling rendering of multi-object or multi-channel audio signals based on spatial cues in a process of decoding the multi-object or multi-channel audio signals. To achieve the purpose, the method suggested in the research controls rendering in a spatial cue domain in the process of decoding the multi-object or multi-channel audio signals.
Abstract:
Provided are an apparatus and method for coding and decoding a multi object audio signal with multi channel. The apparatus includes: a multi channel encoding means for down-mixing an audio signal including a plurality of channels, generating a spatial cue for the audio signal including the plurality of channels, and generating first rendering information including the generated spatial cue; and a multi object encoding unit for down-mixing an audio signal including a plurality of objects, which includes the down-mixed signal from the multi channel encoding unit, generating a spatial cue for the audio signal including the plurality of objects, and generating second rendering information including the generated spatial cue, wherein the multichannel encoding unit generates a spatial cue for the audio signal including the plurality of objects regardless of a Coder-DECoder (CODEC) scheme the limits the multi channel encoding unit.
Abstract:
Provided are an apparatus and method for coding and decoding a multi-object audio signal. The apparatus includes a down-mixer for down-mixing the audio signals into one down-mixed audio signal and extracting supplementary information including header information and spatial cue information for each of the audio signals, a coder for coding the down-mixed audio signal, and a supplementary information coder for generating the supplementary information as a bit stream. The header information includes identification information for each of the audio signals and channel information for the audio signals.
Abstract:
The present invention discloses an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.
Abstract:
The present invention discloses an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.
Abstract:
The present invention discloses an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.