摘要:
A processor-implemented method including generating a depth-aware feature of an image dependent on image features extracted from image data of the image and generating image data, representing information corresponding to one or more segmentations of the image, based on the depth-aware feature and a depth-aware representation, the depth-aware representation being depth-related information and visual-related information for the image.
摘要:
A processor-implemented method includes: determining a probability that a pixel of an input image belongs to each of a plurality of preset categories; and determining a category of the pixel to be a category corresponding to either one or both of a plurality of category areas and a category determined based on the probability that the pixel belongs to each of the preset categories, based on a result of comparing, to a preset threshold value, a probability that the pixel belongs to the category corresponding to the category areas.
摘要:
A processor-implemented method includes acquiring, by a processor, a first undirected graph and a second undirected graph, generating, by the processor, a first lattice for the first undirected graph and a second lattice for the second undirected graph; matching, by the processor, the first lattice and the second lattice based on a first global structure of the first lattice and a second global structure of the second lattice, the first global structure corresponding to nodes of the first undirected graph and the second global structure corresponding to nodes of the second undirected graph, and processing the first undirected graph and the second undirected graph based on a result of the matching of the first lattice and the second lattice.
摘要:
A method to analyze a facial image includes: inputting a facial image to a residual network including residual blocks that are sequentially combined and arranged in a direction from an input to an output; processing the facial image using the residual network; and acquiring an analysis map from an output of an N-th residual block among the residual blocks using a residual deconvolution network, wherein the residual network transfers the output of the N-th residual block to the residual deconvolution network, and N is a natural number that is less than a number of all of the residual blocks, and wherein the residual deconvolution network includes residual deconvolution blocks that are sequentially combined, and the residual deconvolution blocks correspond to respective residual blocks from a first residual block among the residual blocks to the N-th residual block.
摘要:
A processor-implemented method includes: obtaining a video feature of a video comprising a plurality of video frames; determining a target object representation of the video based on the video feature using a neural network; and generating a panorama segmentation result of the video based on the target object representation.
摘要:
An image capturing apparatus and an image capturing method are provided. The image capturing apparatus includes an image capturing unit configured to capture an image; and a controller connected to the image capturing unit, wherein the controller is configured to obtain a background image with depth information, position a three-dimensional (3D) virtual image representing a target object in the background image based on the depth information, and control the image capturing unit to capture the target object based on a difference between the target object viewed from the image capturing apparatus and the 3D virtual image in the background image.
摘要:
A method and apparatus for detecting a three-dimensional (3D) point cloud point of interest (POI), the apparatus comprising a 3D point cloud data acquirer to acquire 3D point cloud data, a shape descriptor to generate a shape description vector describing a shape of a surface in which a pixel point of a 3D point cloud and a neighboring point of the pixel point are located, and a POI extractor to extract a POI based on the shape description vector is disclosed.
摘要:
A processor implemented method of processing a facial expression image, the method includes controlling a camera to capture a first facial expression image and a second facial expression image, acquiring a first expression feature of the first facial expression image, acquiring a second expression feature of the second facial expression image, generating a new expression feature dependent on differences between the acquired first expression feature and the acquired second expression feature, and adjusting a target facial expression image based on the new expression feature.
摘要:
A gaze tracking method and apparatus, and a gaze tracking neural network training method and apparatus are provided. The gaze tracking apparatus includes one or more processors and a memory, and the one or more processors obtain output position information from an input face image of a user using a neural network model, determines a position adjustment parameter for the user, and predicts gaze position information of the user by adjusting the output position information based on the position adjustment parameter.
摘要:
An apparatus and corresponding method are provided to match images and include assigning depth candidate values to a pixel in a first image, and reassigning third depth candidate values to a first pixel in the first image based on first depth candidate values assigned to the first pixel and second depth candidate values assigned to a second pixel adjacent to the first pixel. The apparatus and method also include determining one of the third depth candidate values to be a depth value of the first pixel, and matching the first pixel and a third pixel in a second image corresponding to the determined depth value of the first pixel.