发明授权
- 专利标题: Document type identifying method and document type identifying apparatus
- 专利标题(中): 文件类型识别方法和文件类型识别装置
-
申请号: US12585155申请日: 2009-09-04
-
公开(公告)号: US08275792B2公开(公告)日: 2012-09-25
- 发明人: Akihiro Minagawa , Hiroaki Takebe , Katsuhito Fujimoto
- 申请人: Akihiro Minagawa , Hiroaki Takebe , Katsuhito Fujimoto
- 申请人地址: JP Kawasaki
- 专利权人: Fujitsu Limited
- 当前专利权人: Fujitsu Limited
- 当前专利权人地址: JP Kawasaki
- 代理机构: Staas & Halsey LLP
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
A document type identifying apparatus includes in advance a database storing therein keywords used as keys that identify document types in association with each document type. The document type identifying apparatus aligns word strings written on a document and generates partial keyword strings for each keyword by using the keywords stored in the database. The partial keyword strings are to be checked for matching with the word strings written on the document. Then, the document type identifying apparatus checks matching of the grouped and aligned word strings with the partial keyword strings and obtains, for each keyword, each number of matched words with the highest matching rates between the grouped word strings that are successfully matched and the partial keyword strings. Then, each number of matched words is used to calculate each evaluation value to determine the document type.
公开/授权文献
信息查询