Abstract:
A computer-implemented method of tracking a target object in an object recognition system includes acquiring a plurality of images with a camera and simultaneously tracking the target object and dynamically building online map data from the plurality of images. Tracking of the target object is based on the online map data and the offline map data. In one aspect, tracking the target object includes enabling only one of the online map data and offline map data for tracking based on whether tracking is successful. In another aspect, tracking the target object includes fusing the online map data with the offline map data to generate a fused online model.
Abstract:
Disclosed are a system, apparatus, and method for detecting objects. Input image frames may be received and distinct regions created within each frame. Descriptors may be extracted from the regions according to their associated probability. Extracted descriptors may be matched to reference descriptors. Votes or confidence is cast for particular regions according to region properties. The region properties may be determined from center voting methods based on vector intersection to other vectors or intersections with a region. The probability of selecting particular regions can increase with each vote or increase in confidence for a region. In response to updating probabilities, additional regions may be selected and additional descriptors may be extracted. Additional voting iterations can update the probability of selecting a next region. An object pose may be estimated in response to meeting one or more thresholds.
Abstract:
Disclosed are a system, apparatus, and method for detecting objects. Input image frames may be received and distinct regions created within each frame. Descriptors may be extracted from the regions according to their associated probability. Extracted descriptors may be matched to reference descriptors. Votes or confidence is cast for particular regions according to region properties. The region properties may be determined from center voting methods based on vector intersection to other vectors or intersections with a region. The probability of selecting particular regions can increase with each vote or increase in confidence for a region. In response to updating probabilities, additional regions may be selected and additional descriptors may be extracted. Additional voting iterations can update the probability of selecting a next region. An object pose may be estimated in response to meeting one or more thresholds.
Abstract:
Exemplary methods, apparatuses, and systems for performing object detection on a mobile device are disclosed. A reference dataset comprising a set of reference keyframes for an object captured in a plurality of different lighting environments is obtained. An image of the object in a current lighting environment is captured. Reference keyframes are grouped into respective subsets according to one or more of: a reference keyframe camera position and orientation (pose), a reference keyframe lighting environment, or a combination thereof. Feature points of the image are compared with feature points of the reference keyframes in each of the respective subsets. A candidate subset of reference keyframes from the respective subsets is selected in response to the comparing feature points. A reference keyframe from the candidate subset of reference keyframes is selected for triangulation with the image of the object.