SYSTEMS AND METHODS ITERATIVE NATURAL LANGUAGE-BASED DATA PIPELINE GENERATIONS AND PROTOTYPING

    公开(公告)号:US20250094171A1

    公开(公告)日:2025-03-20

    申请号:US18598333

    申请日:2024-03-07

    Abstract: At least some embodiments of the present disclosure are related to methods and systems for evaluating, generating, and/or prototyping data pipelines. In certain embodiments, a system is configured to perform operations include: receiving an input dataset, the input dataset including a data schema; generating a first prompt based on the input dataset and a first prompt structure having one or more text strings and one or more blanks; providing the first prompt to a language model; receiving a use case generated by the language model for the input dataset, the use case including a description of how to use the input dataset; generating a data pipeline based on the use case; and applying the data pipeline to the input dataset to generate an output dataset.

Patent Agency Ranking