发明申请
- 专利标题: AUTO-MAINTAINED DOCUMENT CLASSIFICATION
- 专利标题(中): 自动维护的文档分类
-
申请号: US14492914申请日: 2014-09-22
-
公开(公告)号: US20150012470A1公开(公告)日: 2015-01-08
- 发明人: Yigal S. Dayan , Gil Fuchs , Josemina M. Magdalen , Irit Maharian , Yariv Tzaban
- 申请人: International Business Machines Corporation
- 主分类号: G06N5/04
- IPC分类号: G06N5/04 ; G06N99/00
摘要:
Machines, systems and methods for maintaining a representative data set in a document classification system, the method comprising: including an initial set of seed representative data in a representative data set (RDS) implemented for a knowledge base (KB), wherein the KB is trained to classify documents provided to a document classification system based on analysis of the representative documents included in the RDS and a set of rules, wherein the seed representative data includes a balanced number of representative data across a plurality of classes; updating the RDS by adding or removing representative data from the RDS based on feedback received about accuracy of classification of one or more documents by the classification system; and retraining the KB, wherein the retraining is performed based on occurrence of one or more events.
公开/授权文献
- US09195947B2 Auto-maintained document classification 公开/授权日:2015-11-24
信息查询