Processing text images with shadows
    1.
    发明授权
    Processing text images with shadows 有权
    用阴影处理文本图像

    公开(公告)号:US09460357B2

    公开(公告)日:2016-10-04

    申请号:US14150682

    申请日:2014-01-08

    Abstract: Embodiments disclosed facilitate robust, accurate, and reliable recovery of words and/or characters in the presence of non-uniform lighting and/or shadows. In some embodiments, a method to recover text from image may comprise: expanding a Maximally Stable Extremal Region (MSER) in an image, the neighborhood comprising a plurality of sub-blocks; thresholding a subset of the plurality of sub-blocks in the neighborhood, the subset comprising sub-blocks with text, wherein each sub-block in the subset is thresholded using a corresponding threshold associated with the sub-block; and obtaining a thresholded neighborhood.

    Abstract translation: 所公开的实施例有助于在存在非均匀照明和/或阴影的情况下对字和/或字符的鲁棒,准确和可靠的恢复。 在一些实施例中,从图像中恢复文本的方法可以包括:在图像中扩展最大稳定的极大区域(MSER),所述邻域包括多个子块; 阈值化所述邻域中的所述多个子块的子集,所述子集包括具有文本的子块,其中使用与所述子块相关联的对应阈值对所述子集中的每个子块进行阈值化; 并获得阈值邻域。

    TRELLIS BASED WORD DECODER WITH REVERSE PASS
    2.
    发明申请
    TRELLIS BASED WORD DECODER WITH REVERSE PASS 有权
    基于TRELLIS的文字解码器与反向通过

    公开(公告)号:US20150242710A1

    公开(公告)日:2015-08-27

    申请号:US14698528

    申请日:2015-04-28

    CPC classification number: G06K9/72 G06K2209/01

    Abstract: Systems, apparatuses, and methods to relate images of words to a list of words are provided. A trellis based word decoder analyses a set of OCR characters and probabilities using a forward pass across a forward trellis and a reverse pass across a reverse trellis. Multiple paths may result, however, the most likely path from the trellises has the highest probability with valid links. A valid link is determined from the trellis by some dictionary word traversing the link. The most likely path is compared with a list of words to find the word closest to the most.

    Abstract translation: 提供了将词的图像与词列表相关联的系统,装置和方法。 基于网格的字解码器使用前向网格的正向传递和跨反向网格的反向传递来分析一组OCR字符和概率。 可能会导致多个路径,但是,来自网格的最可能的路径具有有效链接的最高概率。 通过一些通过链接的字典字词从网格确定有效的链接。 最可能的路径与单词列表进行比较,以找到最接近的单词。

    Identifying A Maximally Stable Extremal Region (MSER) In An Image By Skipping Comparison Of Pixels In The Region
    3.
    发明申请
    Identifying A Maximally Stable Extremal Region (MSER) In An Image By Skipping Comparison Of Pixels In The Region 有权
    通过跳过区域中像素的比较,在图像中识别最大稳定的极地区(MSER)

    公开(公告)号:US20140023271A1

    公开(公告)日:2014-01-23

    申请号:US13797433

    申请日:2013-03-12

    Abstract: A difference in intensities of a pair of pixels in an image is repeatedly compared to a threshold, with the pair of pixels being separated by at least one pixel (“skipped pixel”). When the threshold is found to be exceeded, a selected position of a selected pixel in the pair, and at least one additional position adjacent to the selected position are added to a set of positions. The comparing and adding are performed multiple times to generate multiple such sets, each set identifying a region in the image, e.g. an MSER. Sets of positions, identifying regions whose attributes satisfy a test, are merged to obtain a merged set. Intensities of pixels identified in the merged set are used to generate binary values for the region, followed by classification of the region as text/non-text. Regions classified as text are supplied to an optical character recognition (OCR) system.

    Abstract translation: 图像中的一对像素的强度的差异与阈值重复比较,其中该对像素被至少一个像素(“跳过的像素”)分开。 当发现阈值被超过时,将一对所选择的像素的选定位置和与选定位置相邻的至少一个附加位置添加到一组位置。 执行比较和添加多次以产生多个这样的集合,每个集合标识图像中的区域,例如。 一个MSER。 集合的位置,识别属性满足测试的区域被合并以获得合并集。 在合并集中识别的像素的强度用于生成该区域的二进制值,随后将该区域分类为文本/非文本。 分类为文本的区域被提供给光学字符识别(OCR)系统。

    Identifying a maximally stable extremal region (MSER) in an image by skipping comparison of pixels in the region
    4.
    发明授权
    Identifying a maximally stable extremal region (MSER) in an image by skipping comparison of pixels in the region 有权
    通过跳过该区域中的像素的比较来识别图像中最大稳定的极值区域(MSER)

    公开(公告)号:US09014480B2

    公开(公告)日:2015-04-21

    申请号:US13797433

    申请日:2013-03-12

    Abstract: A difference in intensities of a pair of pixels in an image is repeatedly compared to a threshold, with the pair of pixels being separated by at least one pixel (“skipped pixel”). When the threshold is found to be exceeded, a selected position of a selected pixel in the pair, and at least one additional position adjacent to the selected position are added to a set of positions. The comparing and adding are performed multiple times to generate multiple such sets, each set identifying a region in the image, e.g. an MSER. Sets of positions, identifying regions whose attributes satisfy a test, are merged to obtain a merged set. Intensities of pixels identified in the merged set are used to generate binary values for the region, followed by classification of the region as text/non-text. Regions classified as text are supplied to an optical character recognition (OCR) system.

    Abstract translation: 图像中的一对像素的强度的差异与阈值重复比较,其中该对像素被至少一个像素(“跳过的像素”)分开。 当发现阈值被超过时,将一对所选择的像素的选定位置和与选定位置相邻的至少一个附加位置添加到一组位置。 执行比较和添加多次以产生多个这样的集合,每个集合标识图像中的区域,例如。 一个MSER。 集合的位置,识别属性满足测试的区域被合并以获得合并集。 在合并集中识别的像素的强度用于生成该区域的二进制值,随后将该区域分类为文本/非文本。 分类为文本的区域被提供给光学字符识别(OCR)系统。

    Method Of Improving Contrast For Text Extraction And Recognition Applications
    5.
    发明申请
    Method Of Improving Contrast For Text Extraction And Recognition Applications 有权
    改进对比文本提取和识别应用的方法

    公开(公告)号:US20150010233A1

    公开(公告)日:2015-01-08

    申请号:US14023306

    申请日:2013-09-10

    CPC classification number: G06K9/36

    Abstract: An electronic device and method receive (for example, from a memory), a grayscale image of a scene of real world captured by a camera of a mobile device. The electronic device and method also receive a color image from which the grayscale image is generated, wherein each color pixel is stored as a tuple of multiple components. The electronic device and method determine a new intensity for at least one grayscale pixel in the grayscale image, based on at least one component of a tuple of a color pixel located in correspondence to the at least one grayscale pixel. The determination may be done conditionally, by checking whether a local variance of intensities is below a predetermined threshold in a subset of grayscale pixels located adjacent to the at least one grayscale pixel, and selecting the component to provide most local variance of intensities.

    Abstract translation: 接收(例如,从存储器)的电子设备和方法,由移动设备的照相机捕获的真实世界场景的灰度图像。 电子设备和方法还接收生成灰度图像的彩色图像,其中每个颜色像素被存储为多个分量的元组。 电子设备和方法基于与至少一个灰度像素相对应的彩色像素的元组的至少一个分量,确定灰度图像中的至少一个灰度像素的新强度。 可以通过在位于与至少一个灰度像素相邻的灰度级像素的子集中检查强度的局部方差是否低于预定阈值,并且选择该组件以提供最大局部的强度方差来有条件地进行确定。

    Method Of Handling Complex Variants Of Words Through Prefix-Tree Based Decoding For Devanagiri OCR
    6.
    发明申请
    Method Of Handling Complex Variants Of Words Through Prefix-Tree Based Decoding For Devanagiri OCR 有权
    通过基于前缀树的解码处理复杂变体的方法用于Devanagiri OCR

    公开(公告)号:US20140023274A1

    公开(公告)日:2014-01-23

    申请号:US13828060

    申请日:2013-03-14

    Abstract: An electronic device and method identify a block of text in a portion of an image of real world captured by a camera of a mobile device, slice sub-blocks from the block and identify characters in the sub-blocks that form a first sequence to a predetermined set of sequences to identify a second sequence therein. The second sequence may be identified as recognized (as a modifier-absent word) when not associated with additional information. When the second sequence is associated with additional information, a check is made on pixels in the image, based on a test specified in the additional information. When the test is satisfied, a copy of the second sequence in combination with the modifier is identified as recognized (as a modifier-present word). Storage and use of modifier information in addition to a set of sequences of characters enables recognition of words with or without modifiers.

    Abstract translation: 电子设备和方法识别由移动设备的照相机捕获的真实世界的图像的一部分中的文本块,从块中切片子块,并将形成第一序列的子块中的字符识别为 用于识别其中的第二序列的预定序列集合。 当不与附加信息相关联时,第二序列可以被识别为被识别(作为不包含修饰词的词)。 当第二个序列与附加信息相关联时,将根据附加信息中指定的测试对图像中的像素进行检查。 当测试被满足时,与修饰符组合的第二个序列的拷贝被识别为识别(作为修饰语现在的词)。 修改器信息的存储和使用除了一组字符序列之外,还可以识别具有或不具有修饰符的单词。

    Trellis based word decoder with reverse pass
    7.
    发明授权
    Trellis based word decoder with reverse pass 有权
    基于网格的字解码器,带反向通道

    公开(公告)号:US09047540B2

    公开(公告)日:2015-06-02

    申请号:US13829960

    申请日:2013-03-14

    CPC classification number: G06K9/72 G06K2209/01

    Abstract: Systems, apparatuses, and methods to relate images of words to a list of words are provided. A trellis based word decoder analyses a set of OCR characters and probabilities using a forward pass across a forward trellis and a reverse pass across a reverse trellis. Multiple paths may result, however, the most likely path from the trellises has the highest probability with valid links. A valid link is determined from the trellis by some dictionary word traversing the link. The most likely path is compared with a list of words to find the word closest to the most.

    Abstract translation: 提供了将词的图像与词列表相关联的系统,装置和方法。 基于网格的字解码器使用前向网格的正向传递和跨反向网格的反向传递来分析一组OCR字符和概率。 可能会导致多个路径,但是,来自网格的最可能的路径具有有效链接的最高概率。 通过一些通过链接的字典字词从网格确定有效的链接。 最可能的路径与单词列表进行比较,以找到最接近的单词。

    TRELLIS BASED WORD DECODER WITH REVERSE PASS
    8.
    发明申请
    TRELLIS BASED WORD DECODER WITH REVERSE PASS 有权
    基于TRELLIS的文字解码器与反向通过

    公开(公告)号:US20140023273A1

    公开(公告)日:2014-01-23

    申请号:US13829960

    申请日:2013-03-14

    CPC classification number: G06K9/72 G06K2209/01

    Abstract: Systems, apparatuses, and methods to relate images of words to a list of words are provided. A trellis based word decoder analyses a set of OCR characters and probabilities using a forward pass across a forward trellis and a reverse pass across a reverse trellis. Multiple paths may result, however, the most likely path from the trellises has the highest probability with valid links. A valid link is determined from the trellis by some dictionary word traversing the link. The most likely path is compared with a list of words to find the word closest to the most.

    Abstract translation: 提供了将词的图像与词列表相关联的系统,装置和方法。 基于网格的字解码器使用前向网格的正向传递和跨反向网格的反向传递来分析一组OCR字符和概率。 可能会导致多个路径,但是,来自网格的最可能的路径具有有效链接的最高概率。 通过一些通过链接的字典字词从网格确定有效的链接。 最可能的路径与单词列表进行比较,以找到最接近的单词。

    Parameter Selection and Coarse Localization of Interest Regions for MSER Processing
    9.
    发明申请
    Parameter Selection and Coarse Localization of Interest Regions for MSER Processing 有权
    MSER处理的兴趣区域的参数选择和粗定位

    公开(公告)号:US20140023270A1

    公开(公告)日:2014-01-23

    申请号:US13796729

    申请日:2013-03-12

    Abstract: An attribute is computed based on pixel intensities in an image of the real world, and thereafter used to identify at least one input for processing the image to identify at least a first maximally stable extremal region (MSER) therein. The at least one input is one of (A) a parameter used in MSER processing or (B) a portion of the image to be subject to MSER processing. The attribute may be a variance of pixel intensities, or computed from a histogram of pixel intensities. The attribute may be used with a look-up table, to identify parameter(s) used in MSER processing. The attribute may be a stroke width of a second MSER of a subsampled version of the image. The attribute may be used in checking whether a portion of the image satisfies a predetermined test, and if so including the portion in a region to be subject to MSER processing.

    Abstract translation: 基于现实世界的图像中的像素强度来计算属性,然后用于识别用于处理图像的至少一个输入以识别其中的至少第一最大稳定的极值区域(MSER)。 至少一个输入是(A)在MSER处理中使用的参数或(B)要进行MSER处理的图像的一部分中的一个。 该属性可以是像素强度的方差,或者从像素强度的直方图计算。 该属性可以与查找表一起使用,以识别在MSER处理中使用的参数。 属性可以是图像的子采样版本的第二MSER的笔画宽度。 该属性可以用于检查图像的一部分是否满足预定的测试,并且如果包括要进行MSER处理的区域中的部分。

    Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR
    10.
    发明授权
    Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR 有权
    通过基于前缀树的解码处理复杂变体词的方法,用于Devanagiri OCR

    公开(公告)号:US09262699B2

    公开(公告)日:2016-02-16

    申请号:US13828060

    申请日:2013-03-14

    Abstract: An electronic device and method identify a block of text in a portion of an image of real world captured by a camera of a mobile device, slice sub-blocks from the block and identify characters in the sub-blocks that form a first sequence to a predetermined set of sequences to identify a second sequence therein. The second sequence may be identified as recognized (as a modifier-absent word) when not associated with additional information. When the second sequence is associated with additional information, a check is made on pixels in the image, based on a test specified in the additional information. When the test is satisfied, a copy of the second sequence in combination with the modifier is identified as recognized (as a modifier-present word). Storage and use of modifier information in addition to a set of sequences of characters enables recognition of words with or without modifiers.

    Abstract translation: 电子设备和方法识别由移动设备的相机捕获的真实世界的图像的一部分中的文本块,从块中切片子块,并将形成第一序列的子块中的字符识别为 用于识别其中的第二序列的预定序列集合。 当不与附加信息相关联时,第二序列可以被识别为被识别(作为不包含修饰词的词)。 当第二个序列与附加信息相关联时,将根据附加信息中指定的测试对图像中的像素进行检查。 当测试被满足时,与修饰符组合的第二个序列的拷贝被识别为识别(作为修饰语现在的词)。 修改器信息的存储和使用除了一组字符序列之外,还可以识别具有或不具有修饰符的单词。

Patent Agency Ranking