Method of improving contrast for text extraction and recognition applications
    11.
    发明授权
    Method of improving contrast for text extraction and recognition applications 有权
    提高文本提取和识别应用对比度的方法

    公开(公告)号:US09171224B2

    公开(公告)日:2015-10-27

    申请号:US14023306

    申请日:2013-09-10

    CPC classification number: G06K9/36

    Abstract: An electronic device and method receive (for example, from a memory), a grayscale image of a scene of real world captured by a camera of a mobile device. The electronic device and method also receive a color image from which the grayscale image is generated, wherein each color pixel is stored as a tuple of multiple components. The electronic device and method determine a new intensity for at least one grayscale pixel in the grayscale image, based on at least one component of a tuple of a color pixel located in correspondence to the at least one grayscale pixel. The determination may be done conditionally, by checking whether a local variance of intensities is below a predetermined threshold in a subset of grayscale pixels located adjacent to the at least one grayscale pixel, and selecting the component to provide most local variance of intensities.

    Abstract translation: 接收(例如,从存储器)的电子设备和方法,由移动设备的照相机捕获的真实世界场景的灰度图像。 电子设备和方法还接收生成灰度图像的彩色图像,其中每个颜色像素被存储为多个分量的元组。 电子设备和方法基于与至少一个灰度像素相对应的彩色像素的元组的至少一个分量,确定灰度图像中的至少一个灰度像素的新强度。 可以通过在位于与至少一个灰度像素相邻的灰度级像素的子集中检查强度的局部方差是否低于预定阈值,并且选择该组件以提供最大局部的强度方差来有条件地进行确定。

    AUTOMATIC CORRECTION OF SKEW IN NATURAL IMAGES AND VIDEO
    12.
    发明申请
    AUTOMATIC CORRECTION OF SKEW IN NATURAL IMAGES AND VIDEO 有权
    自动校正自然图像和视频

    公开(公告)号:US20140022406A1

    公开(公告)日:2014-01-23

    申请号:US13831237

    申请日:2013-03-14

    Abstract: An electronic device and method use a camera to capture an image of an environment outside followed by identification of regions therein. A subset of the regions is selected, based on attributes of the regions, such as aspect ratio, height, and variance in stroke width. Next, a number of angles that are candidates for use as skew of the image are determined (e.g. one angle is selected for each region. based on peakiness of a histogram of the region, evaluated at different angles). Then, an angle that is most common among these candidates is identified as the angle of skew of the image. The just-described identification of skew angle is performed prior to classification of any region as text or non-text. After skew identification, at least all regions in the subset are rotated by negative of the skew angle, to obtain skew-corrected regions for use in optical character recognition.

    Abstract translation: 电子设备和方法使用相机来捕获外部环境的图像,然后识别其中的区域。 基于区域的属性,例如纵横比,高度和笔画宽度方差,选择区域的子集。 接下来,确定作为图像的偏斜使用的候选者的多个角度(例如,基于区域的直方图的峰值,以不同的角度进行评估,针对每个区域选择一个角度)。 然后,在这些候选中最常见的角度被识别为图像的偏斜角。 在将任何区域分类为文本或非文本之前执行刚刚描述的倾斜角的识别。 在偏斜识别之后,子集中的至少所有区域以歪斜角度的相位旋转,以获得用于光学字符识别的偏斜校正区域。

    Trellis based word decoder with reverse pass

    公开(公告)号:US09639783B2

    公开(公告)日:2017-05-02

    申请号:US14698528

    申请日:2015-04-28

    CPC classification number: G06K9/72 G06K2209/01

    Abstract: Systems, apparatuses, and methods to relate images of words to a list of words are provided. A trellis based word decoder analyses a set of OCR characters and probabilities using a forward pass across a forward trellis and a reverse pass across a reverse trellis. Multiple paths may result, however, the most likely path from the trellises has the highest probability with valid links. A valid link is determined from the trellis by some dictionary word traversing the link. The most likely path is compared with a list of words to find the word closest to the most.

    Parameter selection and coarse localization of interest regions for MSER processing
    14.
    发明授权
    Parameter selection and coarse localization of interest regions for MSER processing 有权
    MSER处理的兴趣区域的参数选择和粗定位

    公开(公告)号:US09183458B2

    公开(公告)日:2015-11-10

    申请号:US13796729

    申请日:2013-03-12

    Abstract: An attribute is computed based on pixel intensities in an image of the real world, and thereafter used to identify at least one input for processing the image to identify at least a first maximally stable extremal region (MSER) therein. The at least one input is one of (A) a parameter used in MSER processing or (B) a portion of the image to be subject to MSER processing. The attribute may be a variance of pixel intensities, or computed from a histogram of pixel intensities. The attribute may be used with a look-up table, to identify parameter(s) used in MSER processing. The attribute may be a stroke width of a second MSER of a subsampled version of the image. The attribute may be used in checking whether a portion of the image satisfies a predetermined test, and if so including the portion in a region to be subject to MSER processing.

    Abstract translation: 基于现实世界的图像中的像素强度来计算属性,然后用于识别用于处理图像的至少一个输入以识别其中的至少第一最大稳定的极值区域(MSER)。 至少一个输入是(A)在MSER处理中使用的参数或(B)要进行MSER处理的图像的一部分中的一个。 该属性可以是像素强度的方差,或者从像素强度的直方图计算。 该属性可以与查找表一起使用,以识别在MSER处理中使用的参数。 属性可以是图像的子采样版本的第二MSER的笔画宽度。 该属性可以用于检查图像的一部分是否满足预定的测试,并且如果包括要进行MSER处理的区域中的部分。

    PROCESSING TEXT IMAGES WITH SHADOWS
    15.
    发明申请
    PROCESSING TEXT IMAGES WITH SHADOWS 有权
    用SHADOWS处理文本图像

    公开(公告)号:US20150193667A1

    公开(公告)日:2015-07-09

    申请号:US14150682

    申请日:2014-01-08

    Abstract: Embodiments disclosed facilitate robust, accurate, and reliable recovery of words and/or characters in the presence of non-uniform lighting and/or shadows. In some embodiments, a method to recover text from image may comprise: expanding a Maximally Stable Extremal Region (MSER) in an image, the neighborhood comprising a plurality of sub-blocks; thresholding a subset of the plurality of sub-blocks in the neighborhood, the subset comprising sub-blocks with text, wherein each sub-block in the subset is thresholded using a corresponding threshold associated with the sub-block; and obtaining a thresholded neighborhood.

    Abstract translation: 所公开的实施例有助于在存在非均匀照明和/或阴影的情况下对字和/或字符的鲁棒,准确和可靠的恢复。 在一些实施例中,从图像中恢复文本的方法可以包括:在图像中扩展最大稳定的极大区域(MSER),所述邻域包括多个子块; 阈值化所述邻域中的所述多个子块的子集,所述子集包括具有文本的子块,其中使用与所述子块相关联的对应阈值对所述子集中的每个子块进行阈值化; 并获得阈值邻域。

    Automatic correction of skew in natural images and video
    16.
    发明授权
    Automatic correction of skew in natural images and video 有权
    自动修正自然图像和视频中的偏斜

    公开(公告)号:US09076242B2

    公开(公告)日:2015-07-07

    申请号:US13831237

    申请日:2013-03-14

    Abstract: An electronic device and method use a camera to capture an image of an environment outside followed by identification of regions therein. A subset of the regions is selected, based on attributes of the regions, such as aspect ratio, height, and variance in stroke width. Next, a number of angles that are candidates for use as skew of the image are determined (e.g. one angle is selected for each region. based on peakiness of a histogram of the region, evaluated at different angles). Then, an angle that is most common among these candidates is identified as the angle of skew of the image. The just-described identification of skew angle is performed prior to classification of any region as text or non-text. After skew identification, at least all regions in the subset are rotated by negative of the skew angle, to obtain skew-corrected regions for use in optical character recognition.

    Abstract translation: 电子设备和方法使用相机来捕获外部环境的图像,然后识别其中的区域。 基于区域的属性,例如纵横比,高度和笔画宽度方差,选择区域的子集。 接下来,确定作为图像的偏斜使用的候选者的多个角度(例如,基于区域的直方图的峰值,以不同的角度进行评估,针对每个区域选择一个角度)。 然后,在这些候选中最常见的角度被识别为图像的偏斜角。 在将任何区域分类为文本或非文本之前执行刚刚描述的倾斜角的识别。 在偏斜识别之后,子集中的至少所有区域以歪斜角度的相位旋转,以获得用于光学字符识别的偏斜校正区域。

Patent Agency Ranking