DATA CLUSTERING SYSTEM AND METHOD
    1.
    发明申请
    DATA CLUSTERING SYSTEM AND METHOD 审中-公开
    数据聚类系统和方法

    公开(公告)号:US20150134660A1

    公开(公告)日:2015-05-14

    申请号:US14080096

    申请日:2013-11-14

    CPC classification number: G06F16/285

    Abstract: A system includes identification of a first dataset comprising n data samples, identification of b data samples of the n data samples of the first dataset, wherein b is less than n, creation of a first plurality of datasets, each of the first plurality of datasets comprising m data samples, where m is greater than b, and wherein each of the m data samples of each of the first plurality of datasets is selected from the b data samples, identification of c data samples of the n data samples of the first dataset, wherein c is less than n, and wherein the c data samples are not identical to the b data samples, creation of a second plurality of datasets, each of the second plurality of datasets comprising p data samples, where p is greater than c, and wherein each of the p data samples of each of the second plurality of datasets is selected from the c data samples, identification, for each of the b data samples, of a cluster based on the first plurality of datasets, and identification, for each of the c data samples, of a cluster based on the second plurality of datasets.

    Abstract translation: 系统包括识别包括n个数据样本的第一数据集,识别第一数据集的n个数据样本的b个数据样本,其中b小于n,创建第一多个数据集,第一多个数据集中的每一个 包括m个数据样本,其中m大于b,并且其中第一多个数据集中的每一个的m个数据样本中的每一个从b个数据样本中选择,第一数据集的n个数据样本的c个数据样本的识别 其中c小于n,并且其中所述c个数据样本不与b个数据样本相同,创建第二多个数据集,所述第二多个数据集中的每一个包括p个数据样本,其中p大于c, 并且其中,所述第二多个数据集中的每一个的所述p个数据样本中的每一个从所述c个数据样本中选择,对于所述b个数据样本中的每一个,基于所述第一多个数据集的识别,以及对于e 基于第二多个数据集的簇的c个数据样本的ach。

    SYSTEM AND METHOD FOR DISTRIBUTED COMPUTING USING AUTOMATED PROVISONING OF HETEROGENEOUS COMPUTING RESOURCES
    2.
    发明申请
    SYSTEM AND METHOD FOR DISTRIBUTED COMPUTING USING AUTOMATED PROVISONING OF HETEROGENEOUS COMPUTING RESOURCES 审中-公开
    使用自动提供异构计算资源进行分布式计算的系统和方法

    公开(公告)号:US20140189703A1

    公开(公告)日:2014-07-03

    申请号:US13730450

    申请日:2012-12-28

    CPC classification number: G06F9/50 G06F9/5027 G06F2209/5011

    Abstract: A system for distributed computing includes a job scheduler module configured to identify a job request including request requirements and comprising one or more individual jobs. The system also includes a resource module configured to determine an execution set of computing resources from a pool of computing resources based on the request requirements. Each computing resource of the pool of computing resources has an application programming interface. The pool of computing resources comprises public cloud computing resources and internal computing resources. The system further includes a plurality of interface modules, where each interface module is configured to facilitate communication with the computing resources using the associated application programming interface. The system also includes an executor module configured to identify the appropriate interface module based on facilitating communication with the execution computing resource and transmit jobs for execution to the execution computing resource using the interface modules.

    Abstract translation: 一种用于分布式计算的系统包括作业调度器模块,其配置为识别包括请求要求并包括一个或多个单独作业的作业请求。 该系统还包括资源模块,该资源模块被配置为基于请求要求从计算资源池确定计算资源的执行集。 计算资源池的每个计算资源都有一个应用程序编程接口。 计算资源池包括公共云计算资源和内部计算资源。 该系统还包括多个接口模块,其中每个接口模块被配置为便于使用相关联的应用编程接口与计算资源进行通信。 该系统还包括执行器模块,该执行器模块被配置为基于促进与执行计算资源的通信来识别适当的接口模块,并且使用接口模块将执行的作业发送到执行计算资源。

Patent Agency Ranking