Systems and methods for extracting signal differences from sparse data sets
Abstract:
The present disclosure provides systems and methods for extracting signal differences from sparse data sets. Data sets for comparison, including a control data set and one or more test data sets, may be normalized and separated into subsets or groupings via a MapReduce function. Normalization may account for large values present in both control and test data sets that would otherwise reduce the significance of smaller correlated values, creating false negatives. The MapReduce may provide identification and analysis of correlations between sets via related entities. Accordingly, via the systems and methods discussed herein, a computing device may extract statistically significant differences between data sets, without requiring extensive entity by entity comparison (or entity to every entity comparison, which, for a data set of millions of entities, may be too computationally expensive or take too long), reducing memory footprint and processor requirements.
Information query
Patent Agency Ranking
0/0