DATA PERMISSIONED LANGUAGE MODEL DOCUMENT SEARCH

    公开(公告)号:US20240354436A1

    公开(公告)日:2024-10-24

    申请号:US18505912

    申请日:2023-11-09

    CPC classification number: G06F21/6227 G06F16/3344 G06F16/3347

    Abstract: Computer-implemented systems and methods are disclosed, including systems and methods utilizing language models for searching a large corpus of data. A computer-implemented method may include: receiving a first user input comprising a natural language query; vectorizing the first user input into a query vector; executing, using the query vector, a similarity search in a document search model to identify one or more similar document portions, where the document search model includes a plurality of vectors corresponding to a plurality of portions of a set of documents; generating a first prompt for a large language model (“LLM”), the first prompt including at least the first user input, and the one or more similar document portions; transmitting the first prompt to the LLM; receiving a first output from the LLM in response to the first prompt; and providing, via a user interface, the first output from the LLM.

Patent Agency Ranking