SYSTEM AND METHOD FOR TRAINING A MACHINE LEARNING MODEL

    公开(公告)号:US20240185135A1

    公开(公告)日:2024-06-06

    申请号:US18513849

    申请日:2023-11-20

    CPC classification number: G06N20/00

    Abstract: A system for generating a training dataset for a machine learning process, and training a machine learning model, the system comprising a data obtaining unit configured to obtain training data comprising a plurality of events of interest and the behaviour of an agent corresponding to those events, an event identifying unit configured to identify, based upon one or more corresponding indicators, the occurrence of an event of interest in the training data, a list generating unit configured to generate a list of identified events in the training data, wherein identified events are added to the list with a probability that is inversely proportional to the frequency of the occurrence of that event within the training data, a dataset generating unit configured to generate a dataset comprising information about the events contained in the generated list, and a training unit configured to train a machine learning model using the generated dataset, wherein the machine learning model is trained to generate behaviour for an agent corresponding to events within the generated dataset.

    APPARATUS FOR GENERATING DATASETS FOR TRAINING MACHINE LEARNING MODELS, AND A METHOD THEREOF

    公开(公告)号:US20240265679A1

    公开(公告)日:2024-08-08

    申请号:US18430733

    申请日:2024-02-02

    CPC classification number: G06V10/774

    Abstract: An apparatus for generating datasets for training machine learning models includes: a receiving unit configured to receive video data comprising sequential image frames; a storage unit configured to store a plurality of the sequential image frames; and a selecting unit configured to select, for a target image frame, a subset of stored image frames, the subset providing contextual data relating to the target image frame for the machine learning model. The selecting unit is configured to successively generate sampling values, wherein a difference between successive sampling values increases with each successively generated sampling value; and the selecting unit is configured to select a given image frame from the stored image frames in dependence upon whether a number of sequential image frames between the given image frame and the target image frame coincides with one of the successively generated sampling values.

Patent Agency Ranking