发明授权
- 专利标题: Detecting errors in recognized text
- 专利标题(中): 检测识别文本中的错误
-
申请号: US13612273申请日: 2012-09-12
-
公开(公告)号: US09384389B1公开(公告)日: 2016-07-05
- 发明人: Viswanath Sankaranarayanan , Sridhar Jayaraman
- 申请人: Viswanath Sankaranarayanan , Sridhar Jayaraman
- 申请人地址: US NV Reno
- 专利权人: Amazon Technologies, Inc.
- 当前专利权人: Amazon Technologies, Inc.
- 当前专利权人地址: US NV Reno
- 代理机构: Lee & Hayes, PLLC
- 主分类号: G06K9/00
- IPC分类号: G06K9/00
摘要:
Some examples include detecting errors in text that has been recognized using automated text recognition technology. For instance, errors in the recognized text may be detected based on glyph image similarity and the use of a language model, dictionary information, or the like. Some implementations may group together glyphs based on association of the glyphs with the same glyph identifier and a similarity of the appearance of the glyphs. Furthermore, the words associated with each glyph may be checked against a language model, such as to check a spelling or other validity of the words, and a score may be assigned to each group of glyphs based on the validity of the words corresponding to the glyphs in that group. Groups that have a score that fails to meet a threshold may be reviewed by a person or may undergo automated correction techniques.
信息查询