Systems and methods for dataset recommendation in a zero-trust computing environment
摘要:
Systems and methods for recommendation of cohort sample sets is provided. In some embodiments, a set of dataset requirements is received as a required vector set. The historical vector sets are queried. Each vector set corresponds to a known dataset. The difference between the required vector set and each of the historical vector sets is calculated by framing the distance as a p-value in a hypothesis test, compared against a threshold. The historical vector set with the least difference to the required vector set is identified. The least difference is calculated as a count of differing classes or as a numerically weighted summation of differing classes.
信息查询
0/0