发明申请
- 专利标题: METHOD AND SYSTEM FOR IDENTIFYING CLUSTERS WITHIN A COLLECTION OF DATA ENTITIES
- 专利标题(中): 在收集数据实体中识别群集的方法和系统
-
申请号: US13686995申请日: 2012-11-28
-
公开(公告)号: US20140149410A1公开(公告)日: 2014-05-29
- 发明人: Richard J. Kenefic , John G. Watts
- 申请人: Richard J. Kenefic , John G. Watts
- 申请人地址: US MA Waltham
- 专利权人: Raytheon Company
- 当前专利权人: Raytheon Company
- 当前专利权人地址: US MA Waltham
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
Embodiments of a method and system for identifying clusters in collections of data entities are generally described herein. In some embodiments, the method includes defining a metric space over the data entities. A distance function of the metric space may satisfy the triangle inequality. The method may include determining, based on the distance function of the metric space, a value for a number of clusters that minimizes a number of data bits used to define a model of the collection of the data entities. The model may thereby describe the collection of data entities using a minimum description length (MDL). The method may include assigning data entities of the collection of data entities to the clusters. The number of clusters to which the data entities are assigned may correspond to the determined value.
公开/授权文献
信息查询