DATA ENRICHMENT USING PARALLEL SEARCH

    公开(公告)号:US20240394252A1

    公开(公告)日:2024-11-28

    申请号:US18673986

    申请日:2024-05-24

    Applicant: Plaid Inc.

    Abstract: In some implementations, an enrichment engine may receive a first entry. The enrichment engine may generate a normalized first entry by using subword tokenization of the first entry. The enrichment engine may execute a plurality of searches concurrently, including: a first search configured to map a portion of the normalized first entry to a first result using regular expressions and fuzzy matching, a second search configured to provide the normalized first entry to a machine learning model in order to receive a second result, and a third search configured to map a vectorized version of the normalized first entry to a third result in a vector database. The enrichment engine may determine a selected result from the first result based on the first search, the second result based on the second search, or the third result based on the third search. The enrichment engine may return the selected result.

    DATA ENRICHMENT USING NAME, LOCATION, AND IMAGE LOOKUP

    公开(公告)号:US20230367780A1

    公开(公告)日:2023-11-16

    申请号:US18318377

    申请日:2023-05-16

    Applicant: Plaid Inc.

    CPC classification number: G06F16/2468 G06F16/2455 G06F16/90344

    Abstract: In some implementations, a server may receive, from a user device and at a secure endpoint of an application programming interface, a set of structured data including a plurality of entries. The server may extract, from each entry, a corresponding partial string from a corresponding description string included in the entry, and may determine, for each partial string, a corresponding data structure in a database. The server may generate, for each entry, a standardized name and a location indicator based on the corresponding data structure, and may extract, for each data structure, an image corresponding to the data structure. Accordingly, the server may return, to the user device, a modified set of structured data including, for each entry, the standardized name, the location indicator, and the corresponding image.

Patent Agency Ranking