DATA CLUSTERING SYSTEM AND METHOD
    1.
    发明申请
    DATA CLUSTERING SYSTEM AND METHOD 审中-公开
    数据聚类系统和方法

    公开(公告)号:US20150134660A1

    公开(公告)日:2015-05-14

    申请号:US14080096

    申请日:2013-11-14

    CPC classification number: G06F16/285

    Abstract: A system includes identification of a first dataset comprising n data samples, identification of b data samples of the n data samples of the first dataset, wherein b is less than n, creation of a first plurality of datasets, each of the first plurality of datasets comprising m data samples, where m is greater than b, and wherein each of the m data samples of each of the first plurality of datasets is selected from the b data samples, identification of c data samples of the n data samples of the first dataset, wherein c is less than n, and wherein the c data samples are not identical to the b data samples, creation of a second plurality of datasets, each of the second plurality of datasets comprising p data samples, where p is greater than c, and wherein each of the p data samples of each of the second plurality of datasets is selected from the c data samples, identification, for each of the b data samples, of a cluster based on the first plurality of datasets, and identification, for each of the c data samples, of a cluster based on the second plurality of datasets.

    Abstract translation: 系统包括识别包括n个数据样本的第一数据集,识别第一数据集的n个数据样本的b个数据样本,其中b小于n,创建第一多个数据集,第一多个数据集中的每一个 包括m个数据样本,其中m大于b,并且其中第一多个数据集中的每一个的m个数据样本中的每一个从b个数据样本中选择,第一数据集的n个数据样本的c个数据样本的识别 其中c小于n,并且其中所述c个数据样本不与b个数据样本相同,创建第二多个数据集,所述第二多个数据集中的每一个包括p个数据样本,其中p大于c, 并且其中,所述第二多个数据集中的每一个的所述p个数据样本中的每一个从所述c个数据样本中选择,对于所述b个数据样本中的每一个,基于所述第一多个数据集的识别,以及对于e 基于第二多个数据集的簇的c个数据样本的ach。

Patent Agency Ranking