Column lineage and metadata propagation

    公开(公告)号:US11599539B2

    公开(公告)日:2023-03-07

    申请号:US16287631

    申请日:2019-02-27

    Abstract: A logical query plan to derive a target dataset from one or more source datasets is identified. The logical query plan defines source columns of the one or more source datasets and respective target columns of the target dataset. The logical query plan is parsed to derive relationships between the source columns of the one or more source datasets and the respective target columns of the target dataset. Target column metadata is generated for a target column of the target dataset. The target column metadata reflects a derived relationship between one or more source columns and the target column and existing source column metadata of each of the one or more source columns. The target column metadata is stored for the target column of the target dataset.

    COLUMN LINEAGE AND METADATA PROPAGATION
    4.
    发明申请

    公开(公告)号:US20200210427A1

    公开(公告)日:2020-07-02

    申请号:US16287631

    申请日:2019-02-27

    Abstract: A logical query plan to derive a target dataset from one or more source datasets is identified. The logical query plan defines source columns of the one or more source datasets and respective target columns of the target dataset. The logical query plan is parsed to derive relationships between the source columns of the one or more source datasets and the respective target columns of the target dataset. Target column metadata is generated for a target column of the target dataset. The target column metadata reflects a derived relationship between one or more source columns and the target column and existing source column metadata of each of the one or more source columns. The target column metadata is stored for the target column of the target dataset.

    Column lineage and metadata propagation

    公开(公告)号:US12141144B2

    公开(公告)日:2024-11-12

    申请号:US18110283

    申请日:2023-02-15

    Abstract: Whether a logical query plan to derive a target dataset from one or more source datasets is available is determined. Responsive to determining that the logical query plan is not available for the transformation code in the first programming language, relationships between the source columns of the one or more source datasets and the respective target columns of the target dataset are inferred. Target column metadata for a target column of the target dataset is generated. The target column metadata reflects the inferred relationships between the source columns of the one or more source datasets and the respective target columns. The target column metadata for the target column of the target dataset is stored.

Patent Agency Ranking