Abstract:
To determine real-world information about objects moving in a scene, the camera capturing the scene is typically calibrated to the scene. Automatic scene calibration can be accomplished using people that are found moving about in the scene. During a calibration period, a video content analysis system processing video frames from a camera can identify blobs that are associated with people. Using an estimated height of a typical person, the video content analysis system can use the location of the person's head and feet to determine a mapping between the person's location in the 2-D video frame and the person's location in the 3-D real world. This mapping can be used to determine a cost for estimated extrinsic parameters for the camera. Using a hierarchical global estimation mechanism, the video content analysis system can determine the estimated extrinsic parameters with the lowest cost.
Abstract:
To determine real-world information about objects moving in a scene, the camera capturing the scene is typically calibrated to the scene. Automatic scene calibration can be accomplished using people that are found moving about in the scene. During a calibration period, a video content analysis system processing video frames from a camera can identify blobs that are associated with people. Using an estimated height of a typical person, the video content analysis system can use the location of the person's head and feet to determine a mapping between the person's location in the 2-D video frame and the person's location in the 3-D real world. This mapping can be used to determine a cost for estimated extrinsic parameters for the camera. Using a hierarchical global estimation mechanism, the video content analysis system can determine the estimated extrinsic parameters with the lowest cost.
Abstract:
A method for detecting and tracking a target object is described. The method includes performing motion-based tracking for a current video frame by comparing a previous video frame and the current video frame. The method also includes selectively performing object detection in the current video frame based on a tracked parameter.
Abstract:
A method performed by an electronic device is described. The method includes determining a local motion pattern by determining a set of local motion vectors within a region of interest between a previous frame and a current frame. The method also includes determining a global motion pattern by determining a set of global motion vectors between the previous frame and the current frame. The method further includes calculating a separation metric based on the local motion pattern and the global motion pattern. The separation metric indicates a motion difference between the local motion pattern and the global motion pattern. The method additionally includes tracking an object based on the separation metric.
Abstract:
A method of generating metadata includes using at least one digital image to select a plurality of objects, wherein the at least one digital image depicts the plurality of objects in relation to a physical space. The method also includes, by at least one processor and based on information indicating positions of the selected objects in a location space, producing metadata that identifies one among a plurality of candidate geometrical arrangements of the selected objects.
Abstract:
A method performed by an electronic device is described. The method includes interleaving multiple input image channels to produce an interleaved multi-channel input. The method also includes loading the interleaved multi-channel input to a single-instruction multiple data (SIMD) processor. The method further includes convolving the interleaved multi-channel input with a multi-channel filter.
Abstract:
A method for object classification by an electronic device is described. The method includes obtaining an image frame that includes an object. The method also includes determining samples from the image frame. Each of the samples represents a multidimensional feature vector. The method further includes adding the samples to a training set for the image frame. The method additionally includes pruning one or more samples from the training set to produce a pruned training set. One or more non-support vector negative samples are pruned first. One or more non-support vector positive samples are pruned second if necessary to avoid exceeding a sample number threshold. One or more support vector samples are pruned third if necessary to avoid exceeding the sample number threshold. The method also includes updating classifier model weights based on the pruned training set.
Abstract:
A method performed by an electronic device is described. The method includes determining a local motion pattern by determining a set of local motion vectors within a region of interest between a previous frame and a current frame. The method also includes determining a global motion pattern by determining a set of global motion vectors between the previous frame and the current frame. The method further includes calculating a separation metric based on the local motion pattern and the global motion pattern. The separation metric indicates a motion difference between the local motion pattern and the global motion pattern. The method additionally includes tracking an object based on the separation metric.
Abstract:
A method for image scanning by an electronic device is described. The method includes obtaining an image pyramid including a plurality of scale levels and at least a first pyramid level for a frame. The method also includes providing a scanning window. The method further includes scanning at least two of the plurality of scale levels of the frame at a plurality of scanning window locations. A number of scanning window locations is equal for each scale level of the at least two scale levels of the first pyramid level.
Abstract:
A method for object classification by an electronic device is described. The method includes obtaining an image frame that includes an object. The method also includes determining samples from the image frame. Each of the samples represents a multidimensional feature vector. The method further includes adding the samples to a training set for the image frame. The method additionally includes pruning one or more samples from the training set to produce a pruned training set. One or more non-support vector negative samples are pruned first. One or more non-support vector positive samples are pruned second if necessary to avoid exceeding a sample number threshold. One or more support vector samples are pruned third if necessary to avoid exceeding the sample number threshold. The method also includes updating classifier model weights based on the pruned training set.