Partitioning documents for contextual search

    公开(公告)号:US11983209B1

    公开(公告)日:2024-05-14

    申请号:US18303943

    申请日:2023-04-20

    摘要: Operations of a search management system are disclosed. The operations may include: identifying a data corpus containing a plurality of documents, generating sets of feature vectors representing the plurality of documents, receiving a query to search the data corpus, generating a query vector for the query, identifying a target feature vector that meets a similarity threshold by comparing the query vector to the feature vectors, and presenting a query result that includes at least part of the document. The feature vectors may be generated by executing a multi-step partitioning process for partitioning a respective document into a plurality of document partitions, such that the sets of feature vectors that are generated correspond to the plurality of document partitions for the respective document. The query result may include a target partition from among the plurality of document partitions represented by the target feature vector.

    Partitioning Documents For Contextual Search

    公开(公告)号:US20240354323A1

    公开(公告)日:2024-10-24

    申请号:US18634293

    申请日:2024-04-12

    摘要: Operations of a search management system are disclosed. The operations may include: identifying a data corpus containing a plurality of documents, generating sets of feature vectors representing the plurality of documents, receiving a query to search the data corpus, generating a query vector for the query, identifying a target feature vector that meets a similarity threshold by comparing the query vector to the feature vectors, and presenting a query result that includes at least part of the document. The feature vectors may be generated by executing a multi-step partitioning process for partitioning a respective document into a plurality of document partitions, such that the sets of feature vectors that are generated correspond to the plurality of document partitions for the respective document. The query result may include a target partition from among the plurality of document partitions represented by the target feature vector.