DATA ENRICHMENT USING PARALLEL SEARCH

    公开(公告)号:US20240394252A1

    公开(公告)日:2024-11-28

    申请号:US18673986

    申请日:2024-05-24

    Applicant: Plaid Inc.

    Abstract: In some implementations, an enrichment engine may receive a first entry. The enrichment engine may generate a normalized first entry by using subword tokenization of the first entry. The enrichment engine may execute a plurality of searches concurrently, including: a first search configured to map a portion of the normalized first entry to a first result using regular expressions and fuzzy matching, a second search configured to provide the normalized first entry to a machine learning model in order to receive a second result, and a third search configured to map a vectorized version of the normalized first entry to a third result in a vector database. The enrichment engine may determine a selected result from the first result based on the first search, the second result based on the second search, or the third result based on the third search. The enrichment engine may return the selected result.

Patent Agency Ranking