Processing text images with shadows
    1.
    发明授权
    Processing text images with shadows 有权
    用阴影处理文本图像

    公开(公告)号:US09460357B2

    公开(公告)日:2016-10-04

    申请号:US14150682

    申请日:2014-01-08

    Abstract: Embodiments disclosed facilitate robust, accurate, and reliable recovery of words and/or characters in the presence of non-uniform lighting and/or shadows. In some embodiments, a method to recover text from image may comprise: expanding a Maximally Stable Extremal Region (MSER) in an image, the neighborhood comprising a plurality of sub-blocks; thresholding a subset of the plurality of sub-blocks in the neighborhood, the subset comprising sub-blocks with text, wherein each sub-block in the subset is thresholded using a corresponding threshold associated with the sub-block; and obtaining a thresholded neighborhood.

    Abstract translation: 所公开的实施例有助于在存在非均匀照明和/或阴影的情况下对字和/或字符的鲁棒,准确和可靠的恢复。 在一些实施例中,从图像中恢复文本的方法可以包括:在图像中扩展最大稳定的极大区域(MSER),所述邻域包括多个子块; 阈值化所述邻域中的所述多个子块的子集,所述子集包括具有文本的子块,其中使用与所述子块相关联的对应阈值对所述子集中的每个子块进行阈值化; 并获得阈值邻域。

    Method of perspective correction for devanagari text
    2.
    发明授权
    Method of perspective correction for devanagari text 有权
    偏离文本的透视校正方法

    公开(公告)号:US09171204B2

    公开(公告)日:2015-10-27

    申请号:US13842985

    申请日:2013-03-15

    CPC classification number: G06K9/00469 G06K9/3283 G06K2009/363 G06K2209/01

    Abstract: An electronic device and method identify regions that are likely to be text in a natural image or video frame, followed by processing as follows: lines that are nearly vertical are automatically identified in a selected text region, oriented relative to the vertical axis within a predetermined range −max_theta to +max_theta, followed by determination of an angle θ of the identified lines, followed by use of the angle θ to perform perspective correction by warping the selected text region. After perspective correction in this manner, each text region is processed further, to recognize text therein, by performing OCR on each block among a sequence of blocks obtained by slicing the potential text region. Thereafter, the result of text recognition is used to display to the user, either the recognized text or any other information obtained by use of the recognized text.

    Abstract translation: 电子设备和方法识别可能是自然图像或视频帧中的文本的区域,随后如下处理:在所选择的文本区域中自动识别几乎垂直的行,所述文本区域相对于预定的 范围-max_theta到+ max_theta,然后确定角度和角度; 的确定线,然后使用角度和角度; 通过扭曲所选择的文本区域来执行透视校正。 在以这种方式进行透视校正之后,通过对通过切割潜在文本区域获得的块序列中的每个块执行OCR,进一步处理每个文本区域以识别其中的文本。 此后,文本识别的结果用于向用户显示识别的文本或通过使用识别的文本获得的任何其他信息。

    Method Of Improving Contrast For Text Extraction And Recognition Applications
    3.
    发明申请
    Method Of Improving Contrast For Text Extraction And Recognition Applications 有权
    改进对比文本提取和识别应用的方法

    公开(公告)号:US20150010233A1

    公开(公告)日:2015-01-08

    申请号:US14023306

    申请日:2013-09-10

    CPC classification number: G06K9/36

    Abstract: An electronic device and method receive (for example, from a memory), a grayscale image of a scene of real world captured by a camera of a mobile device. The electronic device and method also receive a color image from which the grayscale image is generated, wherein each color pixel is stored as a tuple of multiple components. The electronic device and method determine a new intensity for at least one grayscale pixel in the grayscale image, based on at least one component of a tuple of a color pixel located in correspondence to the at least one grayscale pixel. The determination may be done conditionally, by checking whether a local variance of intensities is below a predetermined threshold in a subset of grayscale pixels located adjacent to the at least one grayscale pixel, and selecting the component to provide most local variance of intensities.

    Abstract translation: 接收(例如,从存储器)的电子设备和方法,由移动设备的照相机捕获的真实世界场景的灰度图像。 电子设备和方法还接收生成灰度图像的彩色图像,其中每个颜色像素被存储为多个分量的元组。 电子设备和方法基于与至少一个灰度像素相对应的彩色像素的元组的至少一个分量,确定灰度图像中的至少一个灰度像素的新强度。 可以通过在位于与至少一个灰度像素相邻的灰度级像素的子集中检查强度的局部方差是否低于预定阈值,并且选择该组件以提供最大局部的强度方差来有条件地进行确定。

    Method of improving contrast for text extraction and recognition applications
    4.
    发明授权
    Method of improving contrast for text extraction and recognition applications 有权
    提高文本提取和识别应用对比度的方法

    公开(公告)号:US09171224B2

    公开(公告)日:2015-10-27

    申请号:US14023306

    申请日:2013-09-10

    CPC classification number: G06K9/36

    Abstract: An electronic device and method receive (for example, from a memory), a grayscale image of a scene of real world captured by a camera of a mobile device. The electronic device and method also receive a color image from which the grayscale image is generated, wherein each color pixel is stored as a tuple of multiple components. The electronic device and method determine a new intensity for at least one grayscale pixel in the grayscale image, based on at least one component of a tuple of a color pixel located in correspondence to the at least one grayscale pixel. The determination may be done conditionally, by checking whether a local variance of intensities is below a predetermined threshold in a subset of grayscale pixels located adjacent to the at least one grayscale pixel, and selecting the component to provide most local variance of intensities.

    Abstract translation: 接收(例如,从存储器)的电子设备和方法,由移动设备的照相机捕获的真实世界场景的灰度图像。 电子设备和方法还接收生成灰度图像的彩色图像,其中每个颜色像素被存储为多个分量的元组。 电子设备和方法基于与至少一个灰度像素相对应的彩色像素的元组的至少一个分量,确定灰度图像中的至少一个灰度像素的新强度。 可以通过在位于与至少一个灰度像素相邻的灰度级像素的子集中检查强度的局部方差是否低于预定阈值,并且选择该组件以提供最大局部的强度方差来有条件地进行确定。

    AUTOMATIC CORRECTION OF SKEW IN NATURAL IMAGES AND VIDEO
    5.
    发明申请
    AUTOMATIC CORRECTION OF SKEW IN NATURAL IMAGES AND VIDEO 有权
    自动校正自然图像和视频

    公开(公告)号:US20140022406A1

    公开(公告)日:2014-01-23

    申请号:US13831237

    申请日:2013-03-14

    Abstract: An electronic device and method use a camera to capture an image of an environment outside followed by identification of regions therein. A subset of the regions is selected, based on attributes of the regions, such as aspect ratio, height, and variance in stroke width. Next, a number of angles that are candidates for use as skew of the image are determined (e.g. one angle is selected for each region. based on peakiness of a histogram of the region, evaluated at different angles). Then, an angle that is most common among these candidates is identified as the angle of skew of the image. The just-described identification of skew angle is performed prior to classification of any region as text or non-text. After skew identification, at least all regions in the subset are rotated by negative of the skew angle, to obtain skew-corrected regions for use in optical character recognition.

    Abstract translation: 电子设备和方法使用相机来捕获外部环境的图像,然后识别其中的区域。 基于区域的属性,例如纵横比,高度和笔画宽度方差,选择区域的子集。 接下来,确定作为图像的偏斜使用的候选者的多个角度(例如,基于区域的直方图的峰值,以不同的角度进行评估,针对每个区域选择一个角度)。 然后,在这些候选中最常见的角度被识别为图像的偏斜角。 在将任何区域分类为文本或非文本之前执行刚刚描述的倾斜角的识别。 在偏斜识别之后,子集中的至少所有区域以歪斜角度的相位旋转,以获得用于光学字符识别的偏斜校正区域。

    PROCESSING TEXT IMAGES WITH SHADOWS
    6.
    发明申请
    PROCESSING TEXT IMAGES WITH SHADOWS 有权
    用SHADOWS处理文本图像

    公开(公告)号:US20150193667A1

    公开(公告)日:2015-07-09

    申请号:US14150682

    申请日:2014-01-08

    Abstract: Embodiments disclosed facilitate robust, accurate, and reliable recovery of words and/or characters in the presence of non-uniform lighting and/or shadows. In some embodiments, a method to recover text from image may comprise: expanding a Maximally Stable Extremal Region (MSER) in an image, the neighborhood comprising a plurality of sub-blocks; thresholding a subset of the plurality of sub-blocks in the neighborhood, the subset comprising sub-blocks with text, wherein each sub-block in the subset is thresholded using a corresponding threshold associated with the sub-block; and obtaining a thresholded neighborhood.

    Abstract translation: 所公开的实施例有助于在存在非均匀照明和/或阴影的情况下对字和/或字符的鲁棒,准确和可靠的恢复。 在一些实施例中,从图像中恢复文本的方法可以包括:在图像中扩展最大稳定的极大区域(MSER),所述邻域包括多个子块; 阈值化所述邻域中的所述多个子块的子集,所述子集包括具有文本的子块,其中使用与所述子块相关联的对应阈值对所述子集中的每个子块进行阈值化; 并获得阈值邻域。

    Automatic correction of skew in natural images and video
    7.
    发明授权
    Automatic correction of skew in natural images and video 有权
    自动修正自然图像和视频中的偏斜

    公开(公告)号:US09076242B2

    公开(公告)日:2015-07-07

    申请号:US13831237

    申请日:2013-03-14

    Abstract: An electronic device and method use a camera to capture an image of an environment outside followed by identification of regions therein. A subset of the regions is selected, based on attributes of the regions, such as aspect ratio, height, and variance in stroke width. Next, a number of angles that are candidates for use as skew of the image are determined (e.g. one angle is selected for each region. based on peakiness of a histogram of the region, evaluated at different angles). Then, an angle that is most common among these candidates is identified as the angle of skew of the image. The just-described identification of skew angle is performed prior to classification of any region as text or non-text. After skew identification, at least all regions in the subset are rotated by negative of the skew angle, to obtain skew-corrected regions for use in optical character recognition.

    Abstract translation: 电子设备和方法使用相机来捕获外部环境的图像,然后识别其中的区域。 基于区域的属性,例如纵横比,高度和笔画宽度方差,选择区域的子集。 接下来,确定作为图像的偏斜使用的候选者的多个角度(例如,基于区域的直方图的峰值,以不同的角度进行评估,针对每个区域选择一个角度)。 然后,在这些候选中最常见的角度被识别为图像的偏斜角。 在将任何区域分类为文本或非文本之前执行刚刚描述的倾斜角的识别。 在偏斜识别之后,子集中的至少所有区域以歪斜角度的相位旋转,以获得用于光学字符识别的偏斜校正区域。

    Method of Perspective Correction For Devanagari Text
    8.
    发明申请
    Method of Perspective Correction For Devanagari Text 有权
    梵文文本视角校正方法

    公开(公告)号:US20140161365A1

    公开(公告)日:2014-06-12

    申请号:US13842985

    申请日:2013-03-15

    CPC classification number: G06K9/00469 G06K9/3283 G06K2009/363 G06K2209/01

    Abstract: An electronic device and method identify regions that are likely to be text in a natural image or video frame, followed by processing as follows: lines that are nearly vertical are automatically identified in a selected text region, oriented relative to the vertical axis within a predetermined range −max_theta to +max_theta, followed by determination of an angle θ of the identified lines, followed by use of the angle θ to perform perspective correction by warping the selected text region. After perspective correction in this manner, each text region is processed further, to recognize text therein, by performing OCR on each block among a sequence of blocks obtained by slicing the potential text region. Thereafter, the result of text recognition is used to display to the user, either the recognized text or any other information obtained by use of the recognized text.

    Abstract translation: 电子设备和方法识别可能是自然图像或视频帧中的文本的区域,随后如下处理:在所选择的文本区域中自动识别几乎垂直的行,所述文本区域相对于预定的 范围-max_theta到+ max_theta,然后确定角度和角度; 的确定线,然后使用角度和角度; 通过扭曲所选择的文本区域来执行透视校正。 在以这种方式进行透视校正之后,通过对通过切割潜在文本区域获得的块序列中的每个块执行OCR,进一步处理每个文本区域以识别其中的文本。 此后,文本识别的结果用于向用户显示识别的文本或通过使用识别的文本获得的任何其他信息。

Patent Agency Ranking