摘要:
An object-based audio system, an object-based audio providing method, and an object-based audio playback method using a preset are provided. The object-based audio system includes a reference information providing unit to provide reference information used to refer to a storage location of an object-based audio file, and a preset information providing unit to provide preset information used to control at least one audio object forming the object-based audio file.
摘要:
Provided are an object-based three dimensional (3-D) audio service system using preset audio scenes and a method thereof. The system and the method are suggested for enabling a user to easily and conveniently watch and listen an object based 3-D audio service by eliminating inconvenience that requires a user to control each of object audio signals of sound sources. The system includes: audio input means for inputting an audio signal; preset audio scene generating means for extracting object audio signals from the audio signal inputted through the audio input means and generating more than one of 3-D audio scene information by arranging the extracted object audio signals in a 3-D space and editing features of each object; and encoding means for encoding and multiplexing the audio signal and the 3-D audio scene information for each object audio signal.
摘要:
Provided are an apparatus for multi-stage transforming a plurality of unit blocks in multi-dimension that can improve compression efficiency of video data by collecting Discrete Cosine Transforming (DCT) coefficients of neighboring blocks and performing an additional transformation based on the DCT coefficients of an original picture and a differential picture. The method includes the steps of: performing a Discrete Cosine Transform (DCT) on inputted picture data and selecting R blocks of a predetermined size from DCT picture data, where R is a natural number equal to or greater than 2; arranging DCT coefficients of each of the selected R blocks according to each frequency in one-dimension; and performing one-dimensional transformation again on the DCT coefficients arranged in one-dimension.
摘要:
The present invention discloses an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes: a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.
摘要:
Provided are a multi-object audio encoding and decoding method and an apparatus thereof. The multi-object encoding method includes generating a down-mix signal and a residual signal by down-mixing a foreground audio object and a background audio object, and generating a bitstream including the down-mix signal and the residual signal.
摘要:
Provided is a system and method for transmitting/receiving an object-based audio. The system includes: a pre-processing unit for receiving an audio signal from diverse sources of outside and creating an object-based audio signal through a pre-processing procedure; an object-based audio editing unit for editing the object-based audio signal from the pre-processing unit and organizing an audio scene; an object-based audio coding unit for coding/multiplexing information on the object-based audio signal and the audio scene from the object-based audio editing unit and creating object-based audio contents; and a transmitting unit for transmitting the object-based audio contents from the object-based audio coding unit.
摘要:
There is an apparatus for evaluating the audio quality of a multi-channel audio codec, including: a preprocessing unit for synthesizing binaural signals based on multi-channel audio signals transmitted through a multi-channel of a multi-channel audio reproduction system; an output variable calculator for calculating an interaural cross-correlation coefficient distortion (IACCDist) and other output variables of the binaural signals; and an artificial neural network circuit for outputting a grade of the perceived quality based on the interaural cross-correlation coefficient distortion (IACCDist) and other output variables calculated in the output variable calculator.
摘要:
The present research relates to controlling rendering of multi-object or multi-channel audio signals. The present research provides a method and apparatus for controlling rendering of multi-object or multi-channel audio signals based on spatial cues in a process of decoding the multi-object or multi-channel audio signals. To achieve the purpose, the method suggested in the research controls rendering in a spatial cue domain in the process of decoding the multi-object or multi-channel audio signals.
摘要:
Provided is an apparatus and method for encoding/decoding moving pictures based on adaptive scanning. The moving picture apparatus and method can increase a compression rate based on adaptive scanning by performing intra prediction onto blocks of a predetermined size, and scanning coefficients acquired from Discrete Cosine Transform (DCT) of a residue signal and quantization differently according to the intra prediction mode. The moving picture encoding apparatus includes: a mode selector for selecting and outputting a prediction mode; a predictor for predicting pixel values of pixels to be encoded of an input video based on the prediction mode to thereby output a residue signal block; a transform/quantization unit for performing DCT onto the residue signal block and quantizing the transformed residue signal block; and an encoder for adaptively scanning and encoding the quantized residue signal block based on the prediction mode.
摘要:
A method of generating and consuming 3D audio scene with extended spatiality of sound source describes the shape and size attributes of the sound source. The method includes the steps of: generating audio object; and generating 3D audio scene description information including attributes of the sound source of the audio object.