Abstract:
An electronic apparatus which acquires input data to be input into a TTS module for outputting a voice through the TTS module, acquires a voice signal corresponding to the input data through the TTS module, detects an error in the acquired voice signal based on the input data, corrects the input data based on the detection result, and acquires a corrected voice signal corresponding to the corrected input data through the TTS module.
Abstract:
An electronic apparatus which acquires input data to be input into a TTS module for outputting a voice through the TTS module, acquires a voice signal corresponding to the input data through the TTS module, detects an error in the acquired voice signal based on the input data, corrects the input data based on the detection result, and acquires a corrected voice signal corresponding to the corrected input data through the TTS module.
Abstract:
An electronic apparatus, based on a text sentence being input, obtains prosody information of the text sentence, segments the text sentence into a plurality of sentence elements, obtains a speech in which prosody information is reflected to each of the plurality of sentence elements in parallel by inputting the plurality of sentence elements and the prosody information of the text sentence to a text to speech (TTS) module, and merges the speech for the plurality of sentence elements that are obtained in parallel to output speech for the text sentence.
Abstract:
An apparatus for and a method of processing a multi-channel audio signal using space information. The apparatus includes: a main coding unit down mixing a multi-channel audio signal by applying space information to surround components included in the multi-channel audio signal, generating side information using the multi-channel audio signal or a stereo signal of a down-mixed result, coding the stereo signal and the side information, and transmitting the coded result as a coding signal; and a main decoding unit receiving the coding signal, decoding the stereo signal and the side information using the received coding signal, up mixing the decoded stereo signal using the decoded side information, and restoring the multi-channel audio signal.
Abstract:
An apparatus and method for encoding/decoding a multi-channel signal may be provided. The apparatus of encoding a multi-channel signal may insert information about whether to encode a phase parameter indicating phase information of a plurality of channels, included in the multi-channel signal, in a bitstream of the multi-channel signal. The apparatus of decoding a multi-channel signal may determine whether to up-mix a mono signal using the phase parameter based on the information about whether to encode.
Abstract:
An apparatus for and a method of processing a multi-channel audio signal using space information. The apparatus includes: a main coding unit down mixing a multi-channel audio signal by applying space information to surround components included in the multi-channel audio signal, generating side information using the multi-channel audio signal or a stereo signal of a down-mixed result, coding the stereo signal and the side information, and transmitting the coded result as a coding signal; and a main decoding unit receiving the coding signal, decoding the stereo signal and the side information using the received coding signal, up mixing the decoded stereo signal using the decoded side information, and restoring the multi-channel audio signal.
Abstract:
Adaptive time/frequency-based audio encoding and decoding apparatuses and methods. The encoding apparatus includes a transformation & mode determination unit to divide an input audio signal into a plurality of frequency-domain signals and to select a time-based encoding mode or a frequency-based encoding mode for each respective frequency-domain signal, an encoding unit to encode each frequency-domain signal in the respective encoding mode, and a bitstream output unit to output encoded data, division information, and encoding mode information for each respective frequency-domain signal. In the apparatuses and methods, acoustic characteristics and a voicing model are simultaneously applied to a frame, which is an audio compression processing unit. As a result, a compression method effective for both music and voice can be produced, and the compression method can be used for mobile terminals that require audio compression at a low bit rate.
Abstract:
A method, medium, and system generating a 3-dimensional (3D) stereo signal in a decoder by using a surround data stream. According to such a method, medium, and system, a head related transfer function (HRTF) is applied in a quadrature mirror filter (QMF) domain, thereby generating a 3D stereo signal by using a surround data stream.