Abstract:
A method for providing multimodal translation of a content in a source language is provided. The method includes receiving a user input with respect to a translation request of text included in the content, in response to receiving the user input, acquiring a multimodal input from the content, the multimodal input including location information related to the content other multimodal inputs, generating scene information representing the multimodal input related to the content by using a fusion layer based on the location information and the other multimodal inputs, identifying a candidate word set in a target language, determining at least one candidate word from the candidate word set based on the scene information, and translating the text included in the content into the target language using a translation model based on the determined at least one candidate word.
Abstract:
An electronic device and method are provided. The electronic device includes a directional coupler, a sense pair connected to the directional coupler, and an analog-to-digital converter (ADC) connected to the sense pair. The ADC directly digitizes a signal current received from the sense pair.
Abstract:
An apparatus and method for parsing a human body image may be implemented by acquiring a depth image including a human body, and detecting a plurality of points in the acquired depth image by conducting a minimum energy skeleton scan on the depth image.
Abstract:
An apparatus and method for parsing a human body image may be implemented by acquiring a depth image including a human body, and detecting a plurality of points in the acquired depth image by conducting a minimum energy skeleton scan on the depth image.
Abstract:
Provided is a device and method for estimating a head pose which may obtain an excellent head pose recognition result free from the influence of an illumination change, the device including a head area extracting unit to extract a head area from an input depth image, a head pitch angle estimating unit to estimate a pitch angle of a head in the head area, a head yaw angle estimating unit to estimate a yaw angle of the head in the head area, and a head pose displaying unit to display a head pose based on the estimated pitch angle of the head and the estimated yaw angle of the head.
Abstract:
An apparatus for detecting a body part from a user image may include an image acquirer to acquire a depth image, an extractor to extract the user image from a foreground of the acquired depth image, and a body part detector to detect the body part from the user image, using a classifier trained based on at least one of a single-user image sample and a multi-user image sample. The single-user image may be an image representing non-overlapping users, and the multi-user image may be an image representing overlapping users.
Abstract:
A device and a method for image processing include an image processing device that may extract a foreground moving object from a depth map of a three-dimensional (3D) image that may include an image depth map acquirer to obtain the depth map of a successive 3D image over a period of time, a moving object segmenter to segment a moving object from the obtained depth map, and a moving object tracker to identify and track the segmented moving object.