REDUCING INSTANCES OF INCLUSION OF DATA ASSOCIATED WITH HINDSIGHT BIAS IN A TRAINING SET OF DATA FOR A MACHINE LEARNING SYSTEM

    公开(公告)号:US20200057959A1

    公开(公告)日:2020-02-20

    申请号:US16264659

    申请日:2019-01-31

    Abstract: Instances of data associated with hindsight bias in a training set of data for a machine learning system can be reduced. A first set of data, having a first set of fields, can be received. Data in a first field can be analyzed with respect to data in a second field corresponding to an event to be predicted. A result can be that the data in the first field is associated with hindsight bias. A second set of data, having a second set of fields, can be produced. The second set of fields can exclude the first field. One or more features associated with the second set of data can be generated. A third set of data, having the second set of fields and fields that correspond to the one or more features, can be produced. The training set of data can be produced using the third set of data.

Patent Agency Ranking