Abstract:
An electronic device and method capture multiple images of a scene of real world at a several zoom levels, the scene of real world containing text of one or more sizes. Then the electronic device and method extract from each of the multiple images, one or more text regions, followed by analyzing an attribute that is relevant to OCR in one or more versions of a first text region as extracted from one or more of the multiple images. When an attribute has a value that meets a limit of optical character recognition (OCR) in a version of the first text region, the version of the first text region is provided as input to OCR.
Abstract:
An electronic device and method identify regions that are likely to be text in a natural image or video frame, followed by processing as follows: lines that are nearly vertical are automatically identified in a selected text region, oriented relative to the vertical axis within a predetermined range −max_theta to +max_theta, followed by determination of an angle θ of the identified lines, followed by use of the angle θ to perform perspective correction by warping the selected text region. After perspective correction in this manner, each text region is processed further, to recognize text therein, by performing OCR on each block among a sequence of blocks obtained by slicing the potential text region. Thereafter, the result of text recognition is used to display to the user, either the recognized text or any other information obtained by use of the recognized text.
Abstract:
In several aspects of described embodiments, an electronic device and method use a camera to capture an image or a frame of video of an environment outside the electronic device followed by identification of blocks of regions in the image. Each block that contains a region is checked, as to whether a test for presence of a line of pixels is met. When the test is met for a block, that block is identified as pixel-line-present. Pixel-line-present blocks are used to identify blocks that are adjacent. One or more adjacent block(s) may be merged with a pixel-line-present block when one or more rules are found to be satisfied, resulting in a merged block. The merged block is then subject to the above-described test, to verify presence of a line of pixels therein, and when the test is satisfied the merged block is processed normally, e.g. classified as text or non-text.