METHODS AND SYSTEMS FOR PREDICTIVE ANALYSIS OF TRANSACTION DATA USING MACHINE LEARNING

    公开(公告)号:US20240095549A1

    公开(公告)日:2024-03-21

    申请号:US18367563

    申请日:2023-09-13

    申请人: FMR LLC

    IPC分类号: G06N5/022

    CPC分类号: G06N5/022

    摘要: Methods and apparatuses are described for predictive analysis of transaction data using machine learning. A server computing device trains a plurality of machine learning models using historical transaction data for a set of entities as input to predict a likelihood of future transaction activity for each of the entities, each machine learning model trained on a different target transaction variable. The server computing device executes each of the plurality of machine learning models to generate, for each entity, a predicted likelihood value for a future transaction associated with the entity and each of the target transaction variables. The server computing device transmits the predicted likelihood values for each entity to a remote computing device for display.

    Intelligent generation of code for imputation of missing data in a machine learning dataset

    公开(公告)号:US12014157B2

    公开(公告)日:2024-06-18

    申请号:US17898815

    申请日:2022-08-30

    申请人: FMR LLC

    CPC分类号: G06F8/35 G06N5/01 G06N5/022

    摘要: Methods and apparatuses are described for intelligent imputation of missing data in a machine learning (ML) dataset comprised of a plurality of features. Each feature includes a plurality of values, where at least a portion of the values for one or more features are missing. A server analyzes the ML dataset to generate characteristics of the missing values in the ML dataset. The server selects an imputation algorithm for filling in the missing values based upon the identified characteristics. The server determines a computing environment in which the imputation algorithm is executed based upon one or more of a size of the ML dataset or the selected algorithm. The server generates code that comprises instructions for executing the imputation algorithm on the ML dataset in the computing environment. The server integrates the code into an ML platform that executes the code to assign replacement values to the missing values.

    INTELLIGENT GENERATION OF CODE FOR IMPUTATION OF MISSING DATA IN A MACHINE LEARNING DATASET

    公开(公告)号:US20240069874A1

    公开(公告)日:2024-02-29

    申请号:US17898815

    申请日:2022-08-30

    申请人: FMR LLC

    IPC分类号: G06F8/35 G06N5/00 G06N5/02

    CPC分类号: G06F8/35 G06N5/003 G06N5/022

    摘要: Methods and apparatuses are described for intelligent imputation of missing data in a machine learning (ML) dataset comprised of a plurality of features. Each feature includes a plurality of values, where at least a portion of the values for one or more features are missing. A server analyzes the ML dataset to generate characteristics of the missing values in the ML dataset. The server selects an imputation algorithm for filling in the missing values based upon the identified characteristics. The server determines a computing environment in which the imputation algorithm is executed based upon one or more of a size of the ML dataset or the selected algorithm. The server generates code that comprises instructions for executing the imputation algorithm on the ML dataset in the computing environment. The server integrates the code into an ML platform that executes the code to assign replacement values to the missing values.