Invention Grant
US08015188B2 System and method for thematically grouping documents into clusters
有权
将文档主题分组为集群的系统和方法
- Patent Title: System and method for thematically grouping documents into clusters
- Patent Title (中): 将文档主题分组为集群的系统和方法
-
Application No.: US12897710Application Date: 2010-10-04
-
Publication No.: US08015188B2Publication Date: 2011-09-06
- Inventor: Dan Gallivan , Kenji Kawai
- Applicant: Dan Gallivan , Kenji Kawai
- Applicant Address: US MD Baltimore
- Assignee: FTI Technology LLC
- Current Assignee: FTI Technology LLC
- Current Assignee Address: US MD Baltimore
- Agent Patrick J.S. Inouye; Krista A. Wittman
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
A system and method for thematically grouping documents into clusters is provided. Concepts are extracted from a plurality of documents. The concepts include nouns or noun phrases. A number of occurrences for each concept are determined within each document. A bounded range is applied to the concepts and a subset of the concepts is selected by removing the concepts that fall outside the bounded range. The bounded range includes upper edge conditions and lower edge conditions. Themes are generated from the subset of concepts by identifying two or more concepts with common semantic meaning. Clusters of the documents are generated based on the themes.
Public/Granted literature
- US20110022597A1 System And Method For Thematically Grouping Documents Into Clusters Public/Granted day:2011-01-27
Information query