Abstract:
An augmented broadcasting metadata (ABM) transmission apparatus is provided, which includes a metadata generation unit to generate ABM which is necessary for augmented content to be overlapped with broadcasting content; and a metadata transmission unit to transmit the ABM to a user terminal.
Abstract:
The present invention relates to a method for estimating network jitter, which has the effect of more precisely estimating network jitter by using time information corresponding to a transmission time, which is transmitted from a transport layer in a transmitting end to a receiving end.
Abstract:
An augmented broadcasting stream transmission device and method and an augmented broadcasting service providing device and method capable of ensuring that augmented broadcasting metadata arrive at a receive terminal in a time more rapid as compared to a corresponding video frame by a predetermined time are provided.
Abstract:
Disclosed is a method and apparatus for label encoding in a multi-sound event interval. The method includes identifying an event interval in which a plurality of sound events occurs in a sound signal, separating a sound source into sound event signals corresponding to each sound event by performing sound source separation on the event interval, determining energy information for each of the sound event signals, and performing label encoding based on the energy information.
Abstract:
Disclosed are a training method for a learning model for recognizing an acoustic signal, a method of recognizing an acoustic signal using the learning model, and devices for performing the methods. The method of recognizing an acoustic signal using a learning model includes identifying an acoustic signal including an acoustic event or acoustic scene, determining an acoustic feature of the acoustic signal, dividing the determined acoustic feature for each of a plurality of frequency band intervals, and determining the acoustic event or acoustic scene included in the acoustic signal by inputting the divided acoustic features to a trained learning model.
Abstract:
A method of detecting a sound event includes receiving sound signals using one or more directional microphones, extracting a time interval of each of the sound signals, extracting time information and an azimuth of a sound event included in the sound signals during the extracted time interval, mixing the sound signals received from the directional microphones using the extracted time interval, and determining a direction of the sound event generated at a specific time from a mixed sound signal obtained through the mixing using the extracted time information and azimuth of the sound event.
Abstract:
Disclosed is a bitstream generation method performed by an acoustic data transmission (ADT) encoder, the method including receiving a first audio signal, receiving additional information converted into a bitstream, and transmitting a second audio signal obtained by inserting the bitstream into the first audio signal, to an ADT decoder.
Abstract:
The system for providing targeting augmented contents includes: an augmented metadata generation apparatus that generates augmented metadata designating specific space and time of broadcast contents as an augmented area; a broadcast content providing apparatus that transmits the augmented metadata to a first broadcast terminal apparatus and transmits the augmented metadata and the broadcast contents to a second broadcast terminal apparatus; a first broadcast terminal apparatus that transmits augmented contents displayed in the augmented area in which the augmented metadata are designated to the augmented content providing apparatus; an augmented content providing apparatus that transmits the augmented contents to a second broadcast terminal apparatus; and a second broadcast terminal apparatus that receives the broadcast contents and the augmented metadata from the broadcast content providing apparatus and receives the augmented contents from the augmented content providing apparatus based on the augmented metadata.
Abstract:
Provided are a method and apparatus for transmitting and receiving media data, which can provide D-layer timing information, which is transmitted from a media transmission service based on an MMT system and required for timely synchronization playout time of the media and media. The apparatus for transmitting the media data comprises a packetizer for generating a delivery layer packet (D-layer packet), which packetizes encapsulation layer data (E-layer data) to include timing information, wherein the timing information comprises sampling time information and transmission process delay information.
Abstract:
Disclosed is a media content reception method for providing augmenting media contents using graphic objects, including: receiving metadata including information representing each event of any one of broadcast contents or moving picture contents, and any one of the broadcast contents or the moving picture contents and graphic object related information associated with events, the events including at least one of a specific scene, a specific situation, and a specific phenomenon of any one of the broadcast contents or the moving picture contents; analyzing the received metadata; designating the graphic objects to correspond to each event within any one of the broadcast contents or the moving picture contents based on the analyzed metadata; and displaying the designated graphic objects to meet each event at the time of playing any one of the broadcast contents or the moving picture contents.