- 专利标题: Image reading apparatus and information processing apparatus that reads documents and generates image data
-
申请号: US17105594申请日: 2020-11-26
-
公开(公告)号: US11223727B2公开(公告)日: 2022-01-11
- 发明人: Aida Yagon , Ronald Reyes , Charles Allera
- 申请人: KYOCERA Document Solutions Inc.
- 申请人地址: JP Osaka
- 专利权人: KYOCERA Document Solutions Inc.
- 当前专利权人: KYOCERA Document Solutions Inc.
- 当前专利权人地址: JP Osaka
- 代理机构: Hawaii Patent Services
- 代理商 Nathaniel K. Fedde; Kenton N. Fedde
- 优先权: JPJP2019-213576 20191126
- 主分类号: H04N1/00
- IPC分类号: H04N1/00
摘要:
Provided is an image reading apparatus capable of eliminating the need for a user to correct a character portion that cannot be recognized by OCR and improve the operation burden on the user. A non-word detecting unit detects a non-word that is not considered to be a word among a plurality of words constituting the text in a document. A determining unit determines whether or not a compound word obtained by combining the non-word with at least one of the word immediately before the non-word and the word immediately after the non-word in that arrangement order is a word. A character correcting unit identifies the text portion corresponding to the compound word in the text in the document as a failed character recognition portion, and corrects the text of the failed character recognition portion to the text of the compound word.
公开/授权文献
信息查询