-
公开(公告)号:US11314698B2
公开(公告)日:2022-04-26
申请号:US16208435
申请日:2018-12-03
Applicant: Palantir Technologies, Inc.
Inventor: Hao Dang , Gustav Brodman , Yi Xue , Stacey Milspaw , Yifei Huang , Yanran Lu
IPC: G06F16/182 , G06F16/2455 , G06F16/25 , G06F16/23 , G06F9/455
Abstract: Techniques for automatically scheduling builds of derived datasets in a distributed database system that supports pipelined data transformations are described herein. In an embodiment, a data processing method comprises, in association with a distributed database system that implements one or more data transformation pipelines, each of the data transformation pipelines comprising at least a first dataset, a first transformation, a second derived dataset and dataset dependency and timing metadata, detecting an arrival of a new raw dataset or new derived dataset; in response to the detecting, obtaining from the dataset dependency and timing metadata a dataset subset comprising those datasets that depend on at least the new raw dataset or new derived dataset; for each member dataset in the dataset subset, determining if the member dataset has a dependency on any other dataset that is not yet arrived, and in response to determining that the member dataset does not have a dependency on any other dataset that is not yet arrived: initiating a build of a portion of the data transformation pipeline comprising the member dataset and all other datasets on which the member dataset is dependent, without waiting for arrival of other datasets.
-
公开(公告)号:US10191926B2
公开(公告)日:2019-01-29
申请号:US15914215
申请日:2018-03-07
Applicant: Palantir Technologies, Inc.
Inventor: Jacob Meacham , Michael Harris , Gustav Brodman , Lynn Cuthriell , Hannah Korus , Brian Toth , Jonathan Hsiao , Mark Elliot , Brian Schimpf , Michael Garland , Evelyn Nguyen
Abstract: A history preserving data pipeline computer system and method. In one aspect, the history preserving data pipeline system provides immutable and versioned datasets. Because datasets are immutable and versioned, the system makes it possible to determine the data in a dataset at a point in time in the past, even if that data is no longer in the current version of the dataset.
-
公开(公告)号:US20190258520A1
公开(公告)日:2019-08-22
申请号:US16401079
申请日:2019-05-01
Applicant: Palantir Technologies Inc.
Inventor: Daniel Paquette , Huw Pryce , Alexander Feldman , Ryan Zheng , Daniel Walker , Cody Moore , Patricio Velez , Gustav Brodman , Jakub Kozlowski , Eric Wong , Steven Capetta , Charles Post , Rick White
Abstract: Techniques for secure and efficient interfacing with a cloud computing service are described. In an embodiment, a cloud computing management service is programmed or configured to communicate with a cloud computing service. The cloud computing management service can be accessed by software engineers that are looking to deploy a software instance to a computing device of the cloud computing service. Thus, the cloud computing management service acts as an intermediary layer in front of the cloud computing service. In an embodiment, the cloud computing management service may store one or more frequently-used system parameters for deployment of software instances. The parameters conform to company's security protocols, compliance protocols, and/or other standards.
-
公开(公告)号:US10176217B1
公开(公告)日:2019-01-08
申请号:US15698574
申请日:2017-09-07
Applicant: Palantir Technologies, Inc.
Inventor: Hao Dang , Gustav Brodman , Yi Xue , Stacey Milspaw , Yifei Huang , Yanran Lu
Abstract: Techniques for automatically scheduling builds of derived datasets in a distributed database system that supports pipelined data transformations are described herein. In an embodiment, a data processing method comprises, in association with a distributed database system that implements one or more data transformation pipelines, each of the data transformation pipelines comprising at least a first dataset, a first transformation, a second derived dataset and dataset dependency and timing metadata, detecting an arrival of a new raw dataset or new derived dataset; in response to the detecting, obtaining from the dataset dependency and timing metadata a dataset subset comprising those datasets that depend on at least the new raw dataset or new derived dataset; for each member dataset in the dataset subset, determining if the member dataset has a dependency on any other dataset that is not yet arrived, and in response to determining that the member dataset does not have a dependency on any other dataset that is not yet arrived: initiating a build of a portion of the data transformation pipeline comprising the member dataset and all other datasets on which the member dataset is dependent, without waiting for arrival of other datasets.
-
5.
公开(公告)号:US11977919B2
公开(公告)日:2024-05-07
申请号:US17339622
申请日:2021-06-04
Applicant: Palantir Technologies Inc.
Inventor: Daniel Paquette , Huw Pryce , Alexander Feldman , Ryan Zheng , Daniel Walker , Cody Moore , Patricio Velez , Gustav Brodman , Jakub Kozlowski , Eric Wong , Steven Capetta , Charles Post , Rick White
CPC classification number: G06F9/5011 , G06F8/65 , G06F9/44505 , H04L63/20
Abstract: Techniques for secure and efficient interfacing with a cloud computing service are described. In an embodiment, a cloud computing management service is programmed or configured to communicate with a cloud computing service. The cloud computing management service can be accessed by software engineers that are looking to deploy a software instance to a computing device of the cloud computing service. Thus, the cloud computing management service acts as an intermediary layer in front of the cloud computing service. In an embodiment, the cloud computing management service may store one or more frequently-used system parameters for deployment of software instances. The parameters conform to company's security protocols, compliance protocols, and/or other standards.
-
6.
公开(公告)号:US20210294653A1
公开(公告)日:2021-09-23
申请号:US17339622
申请日:2021-06-04
Applicant: Palantir Technologies Inc.
Inventor: Daniel Paquette , Huw Pryce , Alexander Feldman , Ryan Zheng , Daniel Walker , Cody Moore , Patricio Velez , Gustav Brodman , Jakub Kozlowski , Eric Wong , Steven Capetta , Charles Post , Rick White
Abstract: Techniques for secure and efficient interfacing with a cloud computing service are described. In an embodiment, a cloud computing management service is programmed or configured to communicate with a cloud computing service. The cloud computing management service can be accessed by software engineers that are looking to deploy a software instance to a computing device of the cloud computing service. Thus, the cloud computing management service acts as an intermediary layer in front of the cloud computing service. In an embodiment, the cloud computing management service may store one or more frequently-used system parameters for deployment of software instances. The parameters conform to company's security protocols, compliance protocols, and/or other standards.
-
公开(公告)号:US20190114289A1
公开(公告)日:2019-04-18
申请号:US16208435
申请日:2018-12-03
Applicant: Palantir Technologies, Inc.
Inventor: Hao Dang , Gustav Brodman , Yi Xue , Stacey Milspaw , Yifei Huang , Yanran Lu
IPC: G06F16/182 , G06F9/455
Abstract: Techniques for automatically scheduling builds of derived datasets in a distributed database system that supports pipelined data transformations are described herein. In an embodiment, a data processing method comprises, in association with a distributed database system that implements one or more data transformation pipelines, each of the data transformation pipelines comprising at least a first dataset, a first transformation, a second derived dataset and dataset dependency and timing metadata, detecting an arrival of a new raw dataset or new derived dataset; in response to the detecting, obtaining from the dataset dependency and timing metadata a dataset subset comprising those datasets that depend on at least the new raw dataset or new derived dataset; for each member dataset in the dataset subset, determining if the member dataset has a dependency on any other dataset that is not yet arrived, and in response to determining that the member dataset does not have a dependency on any other dataset that is not yet arrived: initiating a build of a portion of the data transformation pipeline comprising the member dataset and all other datasets on which the member dataset is dependent, without waiting for arrival of other datasets.
-
公开(公告)号:US09946738B2
公开(公告)日:2018-04-17
申请号:US15287715
申请日:2016-10-06
Applicant: Palantir Technologies, Inc.
Inventor: Jacob Meacham , Michael Harris , Gustav Brodman , Lynn Cuthriell , Hannah Korus , Brian Toth , Jonathan Hsiao , Mark Elliot , Brian Schimpf , Michael Garland , Evelyn Nguyen
IPC: G06F17/30
CPC classification number: G06F17/30309 , G06F11/1451 , G06F17/30227 , G06F17/3023 , G06F17/30292 , G06F17/30371 , G06F17/3038 , G06F17/30563
Abstract: A history preserving data pipeline computer system and method. In one aspect, the history preserving data pipeline system provides immutable and versioned datasets. Because datasets are immutable and versioned, the system makes it possible to determine the data in a dataset at a point in time in the past, even if that data is no longer in the current version of the dataset.
-
公开(公告)号:US10853338B2
公开(公告)日:2020-12-01
申请号:US16240507
申请日:2019-01-04
Applicant: Palantir Technologies, Inc.
Inventor: Jacob Meacham , Michael Harris , Gustav Brodman , Lynn Cuthriell , Hannah Korus , Brian Toth , Jonathan Hsiao , Mark Elliot , Brian Schimpf , Michael Garland , Evelyn Nguyen
Abstract: A history preserving data pipeline computer system and method. In one aspect, the history preserving data pipeline system provides immutable and versioned datasets. Because datasets are immutable and versioned, the system makes it possible to determine the data in a dataset at a point in time in the past, even if that data is no longer in the current version of the dataset.
-
10.
公开(公告)号:US10324759B1
公开(公告)日:2019-06-18
申请号:US15831014
申请日:2017-12-04
Applicant: Palantir Technologies Inc.
Inventor: Daniel Paquette , Huw Pryce , Alexander Feldman , Ryan Zheng , Daniel Walker , Cody Moore , Patricio Velez , Gustav Brodman , Jakub Kozlowski , Eric Wong , Steven Capetta , Charles Post , Rick White
Abstract: Techniques for secure and efficient interfacing with a cloud computing service are described. In an embodiment, a cloud computing management service is programmed or configured to communicate with a cloud computing service. The cloud computing management service can be accessed by software engineers that are looking to deploy a software instance to a computing device of the cloud computing service. Thus, the cloud computing management service acts as an intermediary layer in front of the cloud computing service. In an embodiment, the cloud computing management service may store one or more frequently-used system parameters for deployment of software instances. The parameters conform to company's security protocols, compliance protocols, and/or other standards.
-
-
-
-
-
-
-
-
-