Invention Grant
US08015188B2 System and method for thematically grouping documents into clusters 有权
将文档主题分组为集群的系统和方法

  • Patent Title: System and method for thematically grouping documents into clusters
  • Patent Title (中): 将文档主题分组为集群的系统和方法
  • Application No.: US12897710
    Application Date: 2010-10-04
  • Publication No.: US08015188B2
    Publication Date: 2011-09-06
  • Inventor: Dan GallivanKenji Kawai
  • Applicant: Dan GallivanKenji Kawai
  • Applicant Address: US MD Baltimore
  • Assignee: FTI Technology LLC
  • Current Assignee: FTI Technology LLC
  • Current Assignee Address: US MD Baltimore
  • Agent Patrick J.S. Inouye; Krista A. Wittman
  • Main IPC: G06F17/30
  • IPC: G06F17/30
System and method for thematically grouping documents into clusters
Abstract:
A system and method for thematically grouping documents into clusters is provided. Concepts are extracted from a plurality of documents. The concepts include nouns or noun phrases. A number of occurrences for each concept are determined within each document. A bounded range is applied to the concepts and a subset of the concepts is selected by removing the concepts that fall outside the bounded range. The bounded range includes upper edge conditions and lower edge conditions. Themes are generated from the subset of concepts by identifying two or more concepts with common semantic meaning. Clusters of the documents are generated based on the themes.
Information query
Patent Agency Ranking
0/0