Multiple channels of rasterized content for page decomposition using machine learning

发明授权

US11386685B2 Multiple channels of rasterized content for page decomposition using machine learning 有权

请登陆查看更多内容

专利标题： Multiple channels of rasterized content for page decomposition using machine learning
申请号： US16655363

申请日： 2019-10-17
公开(公告)号： US11386685B2

公开(公告)日： 2022-07-12
发明人: Verena Sabine Kaynig-Fittkau , Smitha Bangalore Naresh , Shawn Alan Gaither , Richard Cohn , Paul John Asente , Eylon Stroh , Emily Seminerio
申请人： Adobe Inc.
申请人地址： US CA San Jose
专利权人： Adobe Inc.
当前专利权人： Adobe Inc.
当前专利权人地址： US CA San Jose
代理机构： Finch & Maloney PLLC
主分类号： G06V30/413
IPC分类号： G06V30/413 ; G06N20/00 ; G06V30/412 ; G06V30/414

Multiple channels of rasterized content for page decomposition using machine learning

摘要：

Techniques are provided for identifying structural elements of a document. One Methodology includes generating a first channel of rasterized content by rasterizing a full page of the document and generating one or more additional channels of rasterized content from the page of the document by rasterizing one or more corresponding content types from the page of the document. Each of the one or more additional channels includes a specific type of content that is different from each of the other one or more additional channels. The methodology further includes inputting the first channel of rasterized content and the one or more additional channels of rasterized content into a machine learning (ML) model. The methodology continues with determining location and classification for each of a plurality of structural elements on the page of the document using the ML model.

公开/授权文献

US20210117666A1 MULTIPLE CHANNELS OF RASTERIZED CONTENT FOR PAGE DECOMPOSITION USING MACHINE LEARNING 公开/授权日：2021-04-22

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V30/00	字符识别；数字墨迹识别；面向文档的基于图像的模式识别（文档等的扫描、传输或复制 H04N1/00）
G06V30/40	.面向文档的基于图像的模式识别
G06V30/41	..文件内容分析（基于代码标记的印刷字符识别G06V30/224）
G06V30/413	...内容分类，例如文字、照片或表格