发明授权
- 专利标题: Multiple channels of rasterized content for page decomposition using machine learning
-
申请号: US16655363申请日: 2019-10-17
-
公开(公告)号: US11386685B2公开(公告)日: 2022-07-12
- 发明人: Verena Sabine Kaynig-Fittkau , Smitha Bangalore Naresh , Shawn Alan Gaither , Richard Cohn , Paul John Asente , Eylon Stroh , Emily Seminerio
- 申请人: Adobe Inc.
- 申请人地址: US CA San Jose
- 专利权人: Adobe Inc.
- 当前专利权人: Adobe Inc.
- 当前专利权人地址: US CA San Jose
- 代理机构: Finch & Maloney PLLC
- 主分类号: G06V30/413
- IPC分类号: G06V30/413 ; G06N20/00 ; G06V30/412 ; G06V30/414
摘要:
Techniques are provided for identifying structural elements of a document. One Methodology includes generating a first channel of rasterized content by rasterizing a full page of the document and generating one or more additional channels of rasterized content from the page of the document by rasterizing one or more corresponding content types from the page of the document. Each of the one or more additional channels includes a specific type of content that is different from each of the other one or more additional channels. The methodology further includes inputting the first channel of rasterized content and the one or more additional channels of rasterized content into a machine learning (ML) model. The methodology continues with determining location and classification for each of a plurality of structural elements on the page of the document using the ML model.
公开/授权文献
信息查询