-
公开(公告)号:US20190179936A1
公开(公告)日:2019-06-13
申请号:US16138759
申请日:2018-09-21
Applicant: Palantir Technologies Inc.
Inventor: Gabrielle Javitt , Samuel Szuflita , Satej Soman , Harsh Pandey , Siddharth Dhulipalla , Vipul Shekhawat
IPC: G06F17/30
Abstract: Disclosed herein are systems and methods for joining datasets. The system may include one or more processors and a memory storing instructions that, when executed by the one or more processors. The processor may cause the system to perform determining at least a first database table to be annotated, the first database table including a set of columns and rows of a dataset. In some embodiments, the system may include determining at least one typeclass that applies to one or more columns included in the first database table, wherein the typeclass describes values stored in the one or more columns and annotating the one or more columns, wherein the annotated columns are associated with the typeclass.
-
公开(公告)号:US20190018889A1
公开(公告)日:2019-01-17
申请号:US15900289
申请日:2018-02-20
Applicant: Palantir Technologies Inc.
Inventor: Caitlin Colgrove , Harsh Pandey , Gabrielle Javitt
IPC: G06F17/30
Abstract: A first dataset from one or more databases and a second dataset from the one or more databases may be identified. The first dataset may contain first data while the second dataset may contain second data. A first relationship measure may be computed for the first dataset, where the first relationship measure is configured to represent the first data in a first condensed format. A second relationship measure may be computed for the second dataset, where the second relationship measure is configured to represent the second data in a second condensed format. A join key may be computed using the first relationship measure and the second relationship measure. The join key may represent a correspondence area between the first dataset and the second dataset. An interactive user interface element may be configured to display a graphical depiction of the correspondence area between the first dataset and the second dataset.
-