- 专利标题: Holistic document search
-
申请号: US16536968申请日: 2019-08-09
-
公开(公告)号: US11093469B2公开(公告)日: 2021-08-17
- 发明人: Ying Chen , Ioana Roxana Stanoi , Su Yan , Mo Yu
- 申请人: International Business Machines Corporation
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理商 Jeffrey S LaBaw; William Stock; Ingrid M Foerster
- 主分类号: G06F16/22
- IPC分类号: G06F16/22 ; G06F16/93 ; G06F16/36
摘要:
A set of documents is parsed. Members of the set of documents include a set of text elements and a set of visual elements. A text content stream based on the set of text elements and a visual content stream based on the set of visual elements are produced. For respective documents, a set of respective visual element summarizations is built from the visual content stream. Each visual summarization includes a textual description of a respective visual element in the respective document. A holistic index is created by indexing the text content from the text content stream and the text descriptions of the visual elements for each document in a single search index. The indexing uses a set of semantic relationships between the text content from the text content stream and the textual descriptions of the visual elements. A user interface allows a user to selectively search text content and visual content.
信息查询