-
公开(公告)号:US12079352B2
公开(公告)日:2024-09-03
申请号:US17226014
申请日:2021-04-08
Applicant: Palantir Technologies Inc.
Inventor: Anton Apostolatos , Adam Lieskovský , Florian Diegruber , Francisco Ferreira , Joseph Kane , Joanna Peller , Kelvin Lau , Maciej Laska , Mikael Ibrahim Mofarrej , Max-Philipp Schrader , Philipp Hoefer , Spencer McCollester , Viktor Nordling
CPC classification number: G06F21/604 , G06F16/258
Abstract: A computer-implemented method enforces data security constraints in a data pipeline. The data pipeline takes one or more source datasets as input and performs one or more data transformations on them. The method includes using data defining one or more data security constraints to configure the data pipeline to perform a data transformation on a restricted subset of entries of the source datasets. The restriction is defined by the data defining one or more data security constraints. The method further includes performing the data transformation according to the configuration to produce one or more transformed datasets. The method further includes using the data defining one or more data security constraints to perform a verification on one or more of the transformed datasets to ensure that entries in the one or more of the transformed datasets are restricted as defined by the one or more data security constraints.
-
公开(公告)号:US20220198032A1
公开(公告)日:2022-06-23
申请号:US17226014
申请日:2021-04-08
Applicant: Palantir Technologies Inc.
Inventor: Anton Apostolatos , Adam Lieskovský , Florian Diegruber , Francisco Ferreira , Joseph Kane , Joanna Peller , Kelvin Lau , Maciej Laska , Mikael Ibrahim Mofarrej , Max-Philipp Schrader , Philipp Hoefer , Spencer McCollester , Viktor Nordling
Abstract: A computer-implemented method enforces data security constraints in a data pipeline. The data pipeline takes one or more source datasets as input and performs one or more data transformations on them. The method includes using data defining one or more data security constraints to configure the data pipeline to perform a data transformation on a restricted subset of entries of the source datasets. The restriction is defined by the data defining one or more data security constraints. The method further includes performing the data transformation according to the configuration to produce one or more transformed datasets. The method further includes using the data defining one or more data security constraints to perform a verification on one or more of the transformed datasets to ensure that entries in the one or more of the transformed datasets are restricted as defined by the one or more data security constraints.
-
公开(公告)号:US11599539B2
公开(公告)日:2023-03-07
申请号:US16287631
申请日:2019-02-27
Applicant: Palantir Technologies Inc.
Inventor: Christopher Dugan , Aditya Chaturvedi , David Vogelbacher , Jiawei Wu , Natacha Gabbamonte , Joseph Kane
IPC: G06F16/2453 , G06F16/907 , G06F16/28 , G06F21/62
Abstract: A logical query plan to derive a target dataset from one or more source datasets is identified. The logical query plan defines source columns of the one or more source datasets and respective target columns of the target dataset. The logical query plan is parsed to derive relationships between the source columns of the one or more source datasets and the respective target columns of the target dataset. Target column metadata is generated for a target column of the target dataset. The target column metadata reflects a derived relationship between one or more source columns and the target column and existing source column metadata of each of the one or more source columns. The target column metadata is stored for the target column of the target dataset.
-
公开(公告)号:US20200210427A1
公开(公告)日:2020-07-02
申请号:US16287631
申请日:2019-02-27
Applicant: Palantir Technologies Inc.
Inventor: Christopher Dugan , Aditya Chaturvedi , David Vogelbacher , Jiawei Wu , Natacha Gabbamonte , Joseph Kane
IPC: G06F16/2453 , G06F16/28 , G06F16/907 , G06F21/62
Abstract: A logical query plan to derive a target dataset from one or more source datasets is identified. The logical query plan defines source columns of the one or more source datasets and respective target columns of the target dataset. The logical query plan is parsed to derive relationships between the source columns of the one or more source datasets and the respective target columns of the target dataset. Target column metadata is generated for a target column of the target dataset. The target column metadata reflects a derived relationship between one or more source columns and the target column and existing source column metadata of each of the one or more source columns. The target column metadata is stored for the target column of the target dataset.
-
公开(公告)号:US12141144B2
公开(公告)日:2024-11-12
申请号:US18110283
申请日:2023-02-15
Applicant: Palantir Technologies Inc.
Inventor: Christopher Dugan , Aditya Chaturvedi , David Vogelbacher , Jiawei Wu , Natacha Gabbamonte , Joseph Kane
IPC: G06F16/2453 , G06F16/28 , G06F16/907 , G06F21/62
Abstract: Whether a logical query plan to derive a target dataset from one or more source datasets is available is determined. Responsive to determining that the logical query plan is not available for the transformation code in the first programming language, relationships between the source columns of the one or more source datasets and the respective target columns of the target dataset are inferred. Target column metadata for a target column of the target dataset is generated. The target column metadata reflects the inferred relationships between the source columns of the one or more source datasets and the respective target columns. The target column metadata for the target column of the target dataset is stored.
-
公开(公告)号:US20230195728A1
公开(公告)日:2023-06-22
申请号:US18110283
申请日:2023-02-15
Applicant: Palantir Technologies Inc.
Inventor: Christopher Dugan , Aditya Chaturvedi , David Vogelbacher , Jiawei Wu , Natacha Gabbamonte , Joseph Kane
IPC: G06F16/2453 , G06F16/907 , G06F16/28 , G06F21/62
CPC classification number: G06F16/24542 , G06F16/907 , G06F16/288 , G06F21/6218
Abstract: Whether a logical query plan to derive a target dataset from one or more source datasets is available is determined. Responsive to determining that the logical query plan is not available for the transformation code in the first programming language, relationships between the source columns of the one or more source datasets and the respective target columns of the target dataset are inferred. Target column metadata for a target column of the target dataset is generated. The target column metadata reflects the inferred relationships between the source columns of the one or more source datasets and the respective target columns. The target column metadata for the target column of the target dataset is stored.
-
-
-
-
-