Abstract:
A method for stitching images by an electronic device is described. The method includes obtaining at least two images. The method also includes selecting a stitching scheme from a set of stitching schemes based on one or more content measures of the at least two images. The set of stitching schemes includes a first stitching scheme, a second stitching scheme, and a third stitching scheme. The method further includes stitching the at least two images based on a selected stitching scheme.
Abstract:
A method for stitching images by an electronic device is described. The method includes obtaining at least two images. The method also includes selecting a stitching scheme from a set of stitching schemes based on one or more content measures of the at least two images. The set of stitching schemes includes a first stitching scheme, a second stitching scheme, and a third stitching scheme. The method further includes stitching the at least two images based on a selected stitching scheme.
Abstract:
Techniques are described in which a device is configured to determine an overlap region between a first image and a second image, determine a first histogram based on color data included in the first image that corresponds to the overlap region, and determine a second histogram based on color data included in the second image that corresponds to the overlap region. The processor is further configured to determine, based on the first and second histograms, a mapping function that substantially maps the second histogram to the first histogram and apply the mapping function to the second image to generate a normalized second image with respect to the first image.
Abstract:
A method includes receiving, at a device, a plurality of image frames corresponding to a video stream. The plurality of image frames include a first image frame having a first resolution and a second image frame having a second resolution that is lower than the first resolution. The method also includes detecting, at the device, a trigger by analyzing the second image frame. The method further includes designating, at the device, the first image frame as an action frame based on the trigger.
Abstract:
Systems, apparatuses, and methods to relate images of words to a list of words are provided. A trellis based word decoder analyses a set of OCR characters and probabilities using a forward pass across a forward trellis and a reverse pass across a reverse trellis. Multiple paths may result, however, the most likely path from the trellises has the highest probability with valid links. A valid link is determined from the trellis by some dictionary word traversing the link. The most likely path is compared with a list of words to find the word closest to the most.
Abstract:
Certain aspects of the present disclosure relate to techniques for low-complexity encoding (compression) of broad class of signals, which are typically not well modeled as sparse signals in either time-domain or frequency-domain. First, the signal can be split in time-segments that may be either sparse in time domain or sparse in frequency domain, for example by using absolute second order differential operator on the input signal. Next, different encoding strategies can be applied for each of these time-segments depending in which domain the sparsity is present.
Abstract:
An attribute is computed based on pixel intensities in an image of the real world, and thereafter used to identify at least one input for processing the image to identify at least a first maximally stable extremal region (MSER) therein. The at least one input is one of (A) a parameter used in MSER processing or (B) a portion of the image to be subject to MSER processing. The attribute may be a variance of pixel intensities, or computed from a histogram of pixel intensities. The attribute may be used with a look-up table, to identify parameter(s) used in MSER processing. The attribute may be a stroke width of a second MSER of a subsampled version of the image. The attribute may be used in checking whether a portion of the image satisfies a predetermined test, and if so including the portion in a region to be subject to MSER processing.
Abstract:
Embodiments disclosed facilitate robust, accurate, and reliable recovery of words and/or characters in the presence of non-uniform lighting and/or shadows. In some embodiments, a method to recover text from image may comprise: expanding a Maximally Stable Extremal Region (MSER) in an image, the neighborhood comprising a plurality of sub-blocks; thresholding a subset of the plurality of sub-blocks in the neighborhood, the subset comprising sub-blocks with text, wherein each sub-block in the subset is thresholded using a corresponding threshold associated with the sub-block; and obtaining a thresholded neighborhood.
Abstract:
An electronic device and method use a camera to capture an image of an environment outside followed by identification of regions therein. A subset of the regions is selected, based on attributes of the regions, such as aspect ratio, height, and variance in stroke width. Next, a number of angles that are candidates for use as skew of the image are determined (e.g. one angle is selected for each region. based on peakiness of a histogram of the region, evaluated at different angles). Then, an angle that is most common among these candidates is identified as the angle of skew of the image. The just-described identification of skew angle is performed prior to classification of any region as text or non-text. After skew identification, at least all regions in the subset are rotated by negative of the skew angle, to obtain skew-corrected regions for use in optical character recognition.
Abstract:
In several aspects of described embodiments, an electronic device and method use a camera to capture an image or a frame of video of an environment outside the electronic device followed by identification of blocks of regions in the image. Each block that contains a region is checked, as to whether a test for presence of a line of pixels is met. When the test is met for a block, that block is identified as pixel-line-present. Pixel-line-present blocks are used to identify blocks that are adjacent. One or more adjacent block(s) may be merged with a pixel-line-present block when one or more rules are found to be satisfied, resulting in a merged block. The merged block is then subject to the above-described test, to verify presence of a line of pixels therein, and when the test is satisfied the merged block is processed normally, e.g. classified as text or non-text.