Abstract:
A method, non-transitory computer readable medium, and apparatus for localizing a region of interest using a hand gesture are disclosed. For example, the method acquires an image containing the hand gesture from the ego-centric video, detects pixels that correspond to one or more hands in the image using a hand segmentation algorithm, identifies a hand enclosure in the pixels that are detected within the image, localizes a region of interest based on the hand enclosure and performs an action based on the object in the region of interest.
Abstract:
A method, non-transitory computer-readable medium, and apparatus for adaptive sampling an ego-centric video to extract features for performing an analysis are disclosed. For example, the method captures the ego-centric video, determines a spatio-temporal location of interest within the ego-centric video, applies an adaptive sampling centered around the spatio-temporal location of interest to obtain one or more spatio-temporal patches, extracts one or more features using the one or more spatio-temporal patches and performs an analysis based on the one or more features.
Abstract:
A method, non-transitory computer-readable medium, and apparatus for localizing a region of interest using a dynamic hand gesture are disclosed. For example, the method captures the ego-centric video containing the dynamic hand gesture, analyzes a frame of the ego-centric video to detect pixels that correspond to a fingertip using a hand segmentation algorithm, analyzes temporally one or more frames of the ego-centric video to compute a path of the fingertip in the dynamic hand gesture, localizes the region of interest based on the path of the fingertip in the dynamic hand gesture and performs an action based on an object in the region of interest.