发明授权
- 专利标题: Method and system for automatically classifying page images
- 专利标题(中): 自动分类页面图像的方法和系统
-
申请号: US11513444申请日: 2006-08-30
-
公开(公告)号: US08306326B2公开(公告)日: 2012-11-06
- 发明人: Bradley Jeffery Behm , Brent Eric Wood
- 申请人: Bradley Jeffery Behm , Brent Eric Wood
- 申请人地址: US NV Reno
- 专利权人: Amazon Technologies, Inc.
- 当前专利权人: Amazon Technologies, Inc.
- 当前专利权人地址: US NV Reno
- 代理机构: Knobbe Martens Olson & Bear LLP
- 主分类号: G06K9/34
- IPC分类号: G06K9/34
摘要:
A system and method are disclosed for automatically classifying images of pages of a source, such as a book, into classifications such as front cover, copyright page, table of contents, text, index, etc. In one embodiment, three phases are provided in the classification process. During a first phase of the classification process, a first classifier may be used to determine a preliminary classification of a page image based on single-page criteria. During a second phase of the classification process, a second classifier may be used to determine a final classification for the page image based on multiple-page and/or global criteria. During an optional third phase of classification, a verifier may be used to verify the final classification of the page image based on verification criteria. If automatic classification fails, the page image may be passed on to a human operator for manual classification.
公开/授权文献
信息查询