-
公开(公告)号:US20210064645A1
公开(公告)日:2021-03-04
申请号:US17010187
申请日:2020-09-02
Applicant: Palantir Technologies Inc.
Inventor: Francisco Ferreira , Ryan Norris , Viktor Nordling , Kelvin Lau
IPC: G06F16/36 , G06F16/182 , G06F16/176
Abstract: A method, performed by one or more processors, is disclosed, comprising providing, to a plurality of parties permitted to communicate data via a shared database, an ontology application associated with a common core ontology, the core ontology defining constraints required to be met for producing, from one or more received datasets, one or more data objects for storing in the shared database. The ontology application may be configured to receive one or more datasets from one or more parties and to use the core database ontology to determine if the received one or more datasets conform to the constraints of the core ontology, and store the received one or more datasets as data objects in the shared database, conditional on the constraints being met.
-
公开(公告)号:US12079352B2
公开(公告)日:2024-09-03
申请号:US17226014
申请日:2021-04-08
Applicant: Palantir Technologies Inc.
Inventor: Anton Apostolatos , Adam Lieskovský , Florian Diegruber , Francisco Ferreira , Joseph Kane , Joanna Peller , Kelvin Lau , Maciej Laska , Mikael Ibrahim Mofarrej , Max-Philipp Schrader , Philipp Hoefer , Spencer McCollester , Viktor Nordling
CPC classification number: G06F21/604 , G06F16/258
Abstract: A computer-implemented method enforces data security constraints in a data pipeline. The data pipeline takes one or more source datasets as input and performs one or more data transformations on them. The method includes using data defining one or more data security constraints to configure the data pipeline to perform a data transformation on a restricted subset of entries of the source datasets. The restriction is defined by the data defining one or more data security constraints. The method further includes performing the data transformation according to the configuration to produce one or more transformed datasets. The method further includes using the data defining one or more data security constraints to perform a verification on one or more of the transformed datasets to ensure that entries in the one or more of the transformed datasets are restricted as defined by the one or more data security constraints.
-
公开(公告)号:US20220198032A1
公开(公告)日:2022-06-23
申请号:US17226014
申请日:2021-04-08
Applicant: Palantir Technologies Inc.
Inventor: Anton Apostolatos , Adam Lieskovský , Florian Diegruber , Francisco Ferreira , Joseph Kane , Joanna Peller , Kelvin Lau , Maciej Laska , Mikael Ibrahim Mofarrej , Max-Philipp Schrader , Philipp Hoefer , Spencer McCollester , Viktor Nordling
Abstract: A computer-implemented method enforces data security constraints in a data pipeline. The data pipeline takes one or more source datasets as input and performs one or more data transformations on them. The method includes using data defining one or more data security constraints to configure the data pipeline to perform a data transformation on a restricted subset of entries of the source datasets. The restriction is defined by the data defining one or more data security constraints. The method further includes performing the data transformation according to the configuration to produce one or more transformed datasets. The method further includes using the data defining one or more data security constraints to perform a verification on one or more of the transformed datasets to ensure that entries in the one or more of the transformed datasets are restricted as defined by the one or more data security constraints.
-
公开(公告)号:US20200089601A1
公开(公告)日:2020-03-19
申请号:US16693063
申请日:2019-11-22
Applicant: Palantir Technologies Inc.
Inventor: Francisco Ferreira , Edgar Gomes de Araujo , Jose Angel Riarola
Abstract: An improved unit test framework that validates large datasets generated by a data management system is described herein. Typical unit test frameworks validate functions. However, the improved unit test framework validates the underlying data. For example, after each step of a data transformation process implemented by the data management system, the data management system can execute a data unit test that loads data sets into memory, checks a set of preconditions, and applies unit test logic to the loaded data sets. In some embodiments, the data management system executes the data unit tests asynchronously with the data transformation processes and therefore do not interfere with the data transformation processes. Rather, the data management system generates and transmits a notification when any step of the data transformation process fails a particular data unit test.
-
公开(公告)号:US12032476B2
公开(公告)日:2024-07-09
申请号:US17681639
申请日:2022-02-25
Applicant: Palantir Technologies Inc.
Inventor: Francisco Ferreira , Edgar Gomes de Araujo , Jose Angel Riarola
CPC classification number: G06F11/3688 , G06F8/30 , G06F8/436 , G06F11/3696
Abstract: An improved unit test framework that validates large datasets generated by a data management system is described herein. Typical unit test frameworks validate functions. However, the improved unit test framework validates the underlying data. For example, after each step of a data transformation process implemented by the data management system, the data management system can execute a data unit test that loads data sets into memory, checks a set of preconditions, and applies unit test logic to the loaded data sets. In some embodiments, the data management system executes the data unit tests asynchronously with the data transformation processes and therefore do not interfere with the data transformation processes. Rather, the data management system generates and transmits a notification when any step of the data transformation process fails a particular data unit test.
-
公开(公告)号:US20220179779A1
公开(公告)日:2022-06-09
申请号:US17681639
申请日:2022-02-25
Applicant: Palantir Technologies Inc.
Inventor: Francisco Ferreira , Edgar Gomes de Araujo , Jose Angel Riarola
Abstract: An improved unit test framework that validates large datasets generated by a data management system is described herein. Typical unit test frameworks validate functions. However, the improved unit test framework validates the underlying data. For example, after each step of a data transformation process implemented by the data management system, the data management system can execute a data unit test that loads data sets into memory, checks a set of preconditions, and applies unit test logic to the loaded data sets. In some embodiments, the data management system executes the data unit tests asynchronously with the data transformation processes and therefore do not interfere with the data transformation processes. Rather, the data management system generates and transmits a notification when any step of the data transformation process fails a particular data unit test.
-
公开(公告)号:US11294801B2
公开(公告)日:2022-04-05
申请号:US16693063
申请日:2019-11-22
Applicant: Palantir Technologies Inc.
Inventor: Francisco Ferreira , Edgar Gomes de Araujo , Jose Angel Riarola
Abstract: An improved unit test framework that validates large datasets generated by a data management system is described herein. Typical unit test frameworks validate functions. However, the improved unit test framework validates the underlying data. For example, after each step of a data transformation process implemented by the data management system, the data management system can execute a data unit test that loads data sets into memory, checks a set of preconditions, and applies unit test logic to the loaded data sets. In some embodiments, the data management system executes the data unit tests asynchronously with the data transformation processes and therefore do not interfere with the data transformation processes. Rather, the data management system generates and transmits a notification when any step of the data transformation process fails a particular data unit test.
-
-
-
-
-
-