Neural network-based diacritical marker recognition system and method
    1.
    发明授权
    Neural network-based diacritical marker recognition system and method 失效
    基于神经网络的变异标记识别系统及方法

    公开(公告)号:US5373566A

    公开(公告)日:1994-12-13

    申请号:US996440

    申请日:1992-12-24

    CPC分类号: G06K9/685

    摘要: A diacritical marker recognition system and method recognizes diacritical markers in a character image based upon an analysis by a neural network of the portion of the character image most likely to contain a diacritical marker. Once the neural network determines that a diacritical marker most likely exists in the character image, the system determines by using heuristics whether a diacritical marker exists or whether the character image appears to contain a diacritical marker which is actually a regular character.

    摘要翻译: 变异性标记识别系统和方法基于神经网络对最可能包含变音标记的角色图像的部分的分析来识别字符图像中的变音符号。 一旦神经网络确定字符图像中最可能存在变音符号,则系统通过使用启发式来确定是否存在变音符号,或者字符图像是否包含实际上是常规字符的变音标记。

    Method for identifying and resolving erroneous characters output by an
optical character recognition system
    3.
    发明授权
    Method for identifying and resolving erroneous characters output by an optical character recognition system 失效
    用于识别和解决由光学字符识别系统输出的错误字符的方法

    公开(公告)号:US5418864A

    公开(公告)日:1995-05-23

    申请号:US272451

    申请日:1994-07-11

    CPC分类号: G06K9/6292

    摘要: A post-processing method for an optical character recognition (OCR) method for combining different OCR engines to identify and resolve characters and attributes of the characters that are erroneously recognized by multiple optical character recognition engines. The characters can originate from many different types of character environments. OCR engine outputs are synchronized in order to detect matches and mismatches between said OCR engine outputs by using synchronization heuristics. The mismatches are resolved using resolution heuristics and neural networks. The resolution heuristics and neural networks are based on observing many different conventional OCR engines in different character environments to find what specific OCR engine correctly identifies a certain character having particular attributes. The results are encoded into the resolution heuristics and neural networks to create an optimal OCR post-processing solution.

    摘要翻译: 一种用于组合不同OCR引擎以识别和解析由多个光学字符识别引擎错误识别的字符的字符和属性的光学字符识别(OCR)方法。 角色可以源于许多不同类型的角色环境。 OCR引擎输出同步,以通过使用同步启发式来检测所述OCR引擎输出之间的匹配和不匹配。 使用分辨率启发式和神经网络来解决不匹配问题。 分辨率启发式和神经网络基于在不同字符环境中观察许多不同的常规OCR引擎,以找出具体的OCR引擎是否正确地识别具有特定属性的特定字符。 将结果编码为分辨率启发式和神经网络,以创建最佳的OCR后处理解决方案。

    Method and system for lexical processing
    4.
    发明授权
    Method and system for lexical processing 失效
    词法处理方法与系统

    公开(公告)号:US5802205A

    公开(公告)日:1998-09-01

    申请号:US573711

    申请日:1995-12-18

    摘要: A lexical processor and its method of use is provided. The lexical processor includes an input interface (300) and a word generator (302) for producing an output as a function of an input word and a confusion matrix. The confusion matrix is a handwriting error model that is based on the recognition capabilities of classifiers used in preprocessing inputs to the lexical processor. The lexical processor output comprises any of the following: the input word, a rejection indicator, a candidate replacement word, or a suggestion list of related words.

    摘要翻译: 提供了一种词法处理器及其使用方法。 词汇处理器包括用于产生作为输入字和混淆矩阵的函数的输出的输入接口(300)和字生成器(302)。 混淆矩阵是一种手写错误模型,其基于用于预处理输入到词法处理器的分类器的识别能力。 词汇处理器输出包括以下任一项:输入字,拒绝指示符,候选替换字或相关词的建议列表。