Robust name matching with regularized embeddings

    公开(公告)号:US11822887B2

    公开(公告)日:2023-11-21

    申请号:US17199963

    申请日:2021-03-12

    Applicant: ADOBE INC.

    CPC classification number: G06F40/295 G06F40/126 G06N3/049

    Abstract: Systems and methods for natural language processing are described. One or more embodiments of the disclosure provide an entity matching apparatus trained using machine learning techniques to determine whether a query name corresponds to a candidate name based on a similarity score. In some examples, the query name and the candidate name are encoded using a character encoder to produce a regularized input sequence and a regularized candidate sequence, respectively. The regularized input sequence and the regularized candidate sequence are formed from a regularized character set having fewer characters than a natural language character set.

    LEARNING TO FUSE SENTENCES WITH TRANSFORMERS FOR SUMMARIZATION

    公开(公告)号:US20220261555A1

    公开(公告)日:2022-08-18

    申请号:US17177372

    申请日:2021-02-17

    Applicant: ADOBE INC.

    Abstract: Systems and methods for sentence fusion are described. Embodiments receive coreference information for a first sentence and a second sentence, wherein the coreference information identifies entities associated with both a term of the first sentence and a term of the second sentence, apply an entity constraint to an attention head of a sentence fusion network, wherein the entity constraint limits attention weights of the attention head to terms that correspond to a same entity of the coreference information, and predict a fused sentence using the sentence fusion network based on the entity constraint, wherein the fused sentence combines information from the first sentence and the second sentence.

Patent Agency Ranking