- 专利标题: SYSTEM AND METHOD TO EFFICIENTLY LABEL DOCUMENTS ALTERNATING MACHINE AND HUMAN LABELLING STEPS
-
申请号: US15181714申请日: 2016-06-14
-
公开(公告)号: US20170357909A1公开(公告)日: 2017-12-14
- 发明人: Jutta Katharina Willamowski , Yves Hoppenot , Jerome Pouyadou , Michel Langlais , Juan-Pablo Suarez
- 申请人: Xerox Corporation
- 申请人地址: US CT Norwalk
- 专利权人: Xerox Corporation
- 当前专利权人: Xerox Corporation
- 当前专利权人地址: US CT Norwalk
- 主分类号: G06N99/00
- IPC分类号: G06N99/00 ; G06F17/30
摘要:
A system and method that supports the efficient interactive identification of the most paper intensive document categories such that a maximum number of the documents belonging to those categories can be correctly categorized with a minimum effort and within a minimum amount of time is disclosed. Further, an iterative method combining automatic grouping mechanisms with human labelling. The system and method are configured to allow the automatic machine labelling to run iteratively to generate improved document clustering and categorization.
信息查询