Abstract:
A method of matching a voice for each object included in a video, includes: separating a plurality of voices in a video; determining a dissimilarity between the plurality of voices; selecting a partial duration in an entire duration of the video as a matching duration, based on the dissimilarity between the plurality of voices; matching, within the matching duration, the plurality of voices with a plurality of objects in the video respectively, based on mouth movements of the plurality of objects; and matching the plurality of voices with the plurality of objects respectively in the entire duration of the video, based on results of the matching between the plurality of voices and the plurality of objects within the matching duration.
Abstract:
A sound output device according to an embodiment of the present disclosure includes: a speaker configured to generate sound; a guide tube formed in a shape of a hollow tube, the guide tube configured to receive the sound generated from the speaker, through an end of the guide tube, and output the received sound; and a waveguide disposed between the speaker and the guide tube. The waveguide includes: a throat tube configured to connect the speaker and the guide tube to each other, and formed in a shape of a hollow tube, and at least one path change structure configured to adjust a predetermined frequency band of the sound in a process of transmitting the sound generated from the speaker to the guide tube.
Abstract:
A display apparatus and a method for controlling the display apparatus are disclosed. The display apparatus includes: a communication interface comprising communication circuitry, a display, a plurality of speakers, and a processor, and the processor is configured to: control the communication interface to receive an acoustic content corresponding to a sound signal of a plurality of channels corresponding to a specified frequency bandwidth, extend a frequency bandwidth of at least one channel of the plurality of channels based on a specified parametric equalizer (PEQ), perform correction of reducing a gain in middle frequency and high frequency bands of at least one channel of the plurality of channels with the extended frequency bandwidth, and output each sound signal of the plurality of corrected channels to the plurality of corresponding speakers so that a sound image of the sound signal is formed in a center area of the display.
Abstract:
A display apparatus including a display panel configured to display an image in a front direction; a main speaker provided on a rear surface of the display panel; and an auxiliary speaker provided on the rear surface of the display panel, the auxiliary speaker being configured to output a sound in a rear direction opposing the front direction.
Abstract:
An apparatus for and a method of processing a multi-channel audio signal using space information. The apparatus includes: a main coding unit down mixing a multi-channel audio signal by applying space information to surround components included in the multi-channel audio signal, generating side information using the multi-channel audio signal or a stereo signal of a down-mixed result, coding the stereo signal and the side information, and transmitting the coded result as a coding signal; and a main decoding unit receiving the coding signal, decoding the stereo signal and the side information using the received coding signal, up mixing the decoded stereo signal using the decoded side information, and restoring the multi-channel audio signal.
Abstract:
An apparatus for and a method of processing a multi-channel audio signal using space information. The apparatus includes: a main coding unit down mixing a multi-channel audio signal by applying space information to surround components included in the multi-channel audio signal, generating side information using the multi-channel audio signal or a stereo signal of a down-mixed result, coding the stereo signal and the side information, and transmitting the coded result as a coding signal; and a main decoding unit receiving the coding signal, decoding the stereo signal and the side information using the received coding signal, up mixing the decoded stereo signal using the decoded side information, and restoring the multi-channel audio signal.
Abstract:
An audio processing apparatus may obtain second audio signals corresponding to channels included in a second channel group from first audio signals corresponding to channels included in a first channel group, downsample at least one third audio signal corresponding to at least one channel identified based on a correlation with the second channel group from among the channels included in the first channel group, by using an artificial intelligence (AI) model, and generate a bitstream including the second audio signals corresponding to the channels included in the second channel group and the downsampled at least one third audio signal. The first channel group includes a channel group of an original audio signal, and the second channel group is constructed by combining at least two channels from among the channels included in the first channel group.
Abstract:
An apparatus for outputting an audio signal includes: a channel processor configured to generate two or more channel signals from audio data; a signal processor configured to render the generated two or more channel signals; and a directional speaker configured to reproduced a rendered channel signal as an audible sound. The signal processor may include a frequency converter configured to generate a channel signal of a frequency domain by converting the generated two or more channel signals through frequency conversion, and a re-panner configured to change a channel gain of at least one of the generated channel signals by as much as an adjustment value for the channel gain, wherein the adjustment value is monotonically changed as a frequency of the channel signal of the frequency domain increases.
Abstract:
According to various embodiments of the disclosure, an audio processing apparatus includes at least one processor configured to execute one or more instructions to obtain a second audio signal down-mixed from at least one first audio signal, obtain information related to error removal for the at least one first audio signal, de-mix the at least one first audio signal from the down-mixed second audio signal, and reconstruct the at least one first audio signal by applying the information related to the error removal for the at least one first audio signal to the at least one first audio signal de-mixed from the second audio signal. The information related to the error removal having been generated using at least one of an original signal power of the at least one first audio signal or a second signal power of the at least one first audio signal after decoding.
Abstract:
A sound output device according to an embodiment of the present disclosure includes a speaker configured to generate sound; a guide tube formed in a shape of a hollow tube, the guide tube configured to receive the sound generated from the speaker, through an end of the guide tube, and output the received sound; and a waveguide disposed between the speaker and the guide tube. The waveguide includes a throat tube configured to connect the speaker and the guide tube to each other, and formed in a shape of a hollow tube, and at least one path change structure configured to adjust a predetermined frequency band of the sound in a process of transmitting the sound generated from the speaker to the guide tube.