SYSTEMS AND METHODS FOR VALIDATING DATA

    公开(公告)号:US20220138034A1

    公开(公告)日:2022-05-05

    申请号:US17573580

    申请日:2022-01-11

    Abstract: Systems and methods are validating data in a data set. A data set including data to validate and a validator to use in validating the data is selected based on user input generated based on interactions of a user with a graphical user interface. The validator is applied to the data to determine whether one or more statistics generated through application of the validator to the data is valid or invalid based on a validation routine associated with the validator. A data quality report indicating whether the data set is valid or invalid, based on a determination of whether the one or more statistics is valid or invalid, is generated and selectively presented to the user through the graphical user interface.

    Classification system with methodology for efficient verification
    8.
    发明授权
    Classification system with methodology for efficient verification 有权
    具有有效验证方法的分类系统

    公开(公告)号:US09390086B2

    公开(公告)日:2016-07-12

    申请号:US14483527

    申请日:2014-09-11

    Abstract: Techniques for a classification system with methodology for enhanced verification are described. In one approach, a classification computer trains a classifier based on a set of training documents. After training is complete, the classification computer iterates over a collection unlabeled documents uses the trained classifier to predict a label for each unlabeled document. A verification computer retrieves one of the documents assigned a label by the classification computer. The verification computer then generates a user interface that displays select information from the document and provides an option to verify the label predicted by the classification computer or provide an alternative label. The document and the verified label are then fed back into the set of training documents and are used to retrain the classifier to improve subsequent classifications. In addition, the document is indexed by a query computer based on the verified label and made available for search and display.

    Abstract translation: 描述了具有增强验证方法的分类系统的技术。 在一种方法中,分类计算机基于一组训练文档来训练分类器。 培训完成后,分类计算机迭代收集未标记的文档,使用训练有素的分类器预测每个未标记文档的标签。 验证计算机检索由分类计算机分配的标签的文档之一。 验证计算机然后生成用户界面,其从文档中显示选择信息,并提供验证由分类计算机预测的标签或提供替代标签的选项。 然后将文件和经过验证的标签反馈到一组训练文档中,并用于重新训练分类器以改进后续分类。 此外,文档由查询计算机根据经过验证的标签进行索引,并可用于搜索和显示。

    FRAMEWORK FOR EVALUATION OF COMPUTER-BASED MODELS

    公开(公告)号:US20240420258A1

    公开(公告)日:2024-12-19

    申请号:US18349738

    申请日:2023-07-10

    Abstract: Computer-implemented systems and methods are disclosed, including for evaluation of computer-based models in a management framework. A computer-implemented method may include, for example, receiving one or more inputs including requesting to add an evaluation configuration to a defined modeling objective, specifying at least a first evaluation data set for the evaluation configuration, specifying at least a first evaluation library for the evaluation configuration, and specifying at least a first subset definition for the evaluation configuration. A computer-implemented method may in response to the one or more user inputs include: creating, storing, and/or updating the evaluation configuration. A computer-implemented method may include evaluating, based on the evaluation configuration, the one or more models associated with the defined modeling objective.

Patent Agency Ranking