-
公开(公告)号:US20250094439A1
公开(公告)日:2025-03-20
申请号:US18827034
申请日:2024-09-06
Applicant: Palantir Technologies Inc.
Inventor: Morten Telling , Alexander Bailey , Richard Burdish , Ankit Shankar , Matthew Hawes , Codrut Lemeni , Nanwei Cai , Tiffany Wang , Joseph Rafidi , Kamran Khan
IPC: G06F16/25 , G06F16/242
Abstract: A system may use a large language model (“LLM”) to generate a data pipeline. The system can receive a natural language query and a selection of a plurality of data sets for generating a data pipeline and generate a prompt comprising at least: the natural language query, indications of the plurality of data sets, an indication of a format of a first computer language, and an indication of available data transformations. The system can transmit the prompt to an LLM and receive, from the LLM, a response to the prompt in the format of the first computer language. The system can parse the response in the first computer language to identify at least an indication of one or more recommended data transformations. The system can generate, based on the indication of the one or more recommended data transformations, the data pipeline using a second computer language.