SYSTEM AND METHOD FOR ANALYZING AND CORRECTING RETAIL DATA

    公开(公告)号:US20080162464A1

    公开(公告)日:2008-07-03

    申请号:US11926360

    申请日:2007-10-29

    IPC分类号: G06F7/06 G06F17/30

    摘要: A computer system and method is disclosed that analyzes and corrects retail data. The system and method includes several client workstations and one or more servers coupled together over a network. A database stores various data used by the system. A business logic server uses competitive and complementary fusion to analyze and correct some of the data sources stored in database server. The data fusion process itself is an iterative one—utilizing both competitive and complementary fusion methods. In competitive fusion, two or more data sources that provide overlapping attributes are compared against each other. More accurate/reliable sources are used to correct less accurate/reliable sources. In complementary fusion, relationships modeled where data sources overlap are projected to areas of the data framework in which fewer sources exist—enhancing the accuracy/reliability of those fewer sources even in the absence of the other sources upon which the models were based.

    CLUSTER PROCESSING OF AN AGGREGATED DATASET
    8.
    发明申请
    CLUSTER PROCESSING OF AN AGGREGATED DATASET 审中-公开
    聚集数据集的集群处理

    公开(公告)号:US20090006309A1

    公开(公告)日:2009-01-01

    申请号:US12023267

    申请日:2008-01-31

    IPC分类号: G06F17/30

    摘要: Systems and methods are presented that may involve receiving a aggregated dataset, wherein the aggregated dataset includes data from a panel data source, a fact data source, and a dimension data source that have been associated with a standard population database. The process may also involve storing the aggregated data in a partition within a partitioned database, wherein the partition is associated with a data characteristic. The process may also involve associating a master processing node with a plurality of slave nodes, wherein each of the plurality of slave nodes is associated with a partition of the partitioned database. The process may also involve submitting an analytic query to the master processing node. The process may also involve assigning analytic processing to at least one of the plurality of slave nodes by the master processing node, wherein the assignment is based at least in part on the association of the partition with the data characteristic. The process may also involve reading the aggregated data from the partitioned database by the assigned slave node. The process may also involve analyzing the aggregated data by the assigned slave node, wherein the analysis produces a result at each slave node. The process may also involve combining the results from each of the plurality of slave nodes by the master processing node into a master result and reporting the master result to a user interface.

    摘要翻译: 呈现可能涉及接收聚合数据集的系统和方法,其中聚合数据集包括已经与标准人口数据库相关联的面板数据源,事实数据源和维度数据源的数据。 该过程还可以涉及将聚合数据存储在分区数据库中的分区中,其中分区与数据特征相关联。 该过程还可以涉及将主处理节点与多个从节点相关联,其中多个从节点中的每一个与分区数据库的分区相关联。 该过程还可以涉及向主处理节点提交分析查询。 该过程还可以包括由主处理节点向多个从节点中的至少一个分配处理分配,其中分配至少部分地基于分区与数据特征的关联。 该过程还可以涉及由所分配的从节点读取来自分区数据库的聚合数据。 该过程还可以涉及通过分配的从节点分析聚合数据,其中分析在每个从节点处产生结果。 该过程还可以包括将来自主处理节点的多个从节点中的每一个的结果组合成主结果并将主结果报告给用户界面。