Domain Adaptation for Machine Learning Models

    公开(公告)号:US20220391768A1

    公开(公告)日:2022-12-08

    申请号:US17883811

    申请日:2022-08-09

    Applicant: Adobe Inc.

    Abstract: Adapting a machine learning model to process data that differs from training data used to configure the model for a specified objective is described. A domain adaptation system trains the model to process new domain data that differs from a training data domain by using the model to generate a feature representation for the new domain data, which describes different content types included in the new domain data. The domain adaptation system then generates a probability distribution for each discrete region of the new domain data, which describes a likelihood of the region including different content described by the feature representation. The probability distribution is compared to ground truth information for the new domain data to determine a loss function, which is used to refine model parameters. After determining that model outputs achieve a threshold similarity to the ground truth information, the model is output as a domain-agnostic model.

    Asides detection in documents
    4.
    发明授权

    公开(公告)号:US12136287B2

    公开(公告)日:2024-11-05

    申请号:US17651433

    申请日:2022-02-17

    Applicant: Adobe Inc.

    Abstract: Techniques are disclosed for identifying asides within a document, and detecting a display order of contents based of the identified asides. In a document, an “aside” represents a content region of the document that is distinct from the main content regions, and may be visually distinguishable from the main content region. In an example, a document is received, where the document lacks identification of asides. The document is analyzed to identify asides within the document. A display order of contents within the document is then determined, based on the identified asides. For example, in the display order, the asides are ordered between two segments of the main content and/or at a beginning or an end of the main content, but may not be ordered to be embedded in between a segment of the main content. The document is displayed in accordance with the display order.

    Privacy preserving document analysis

    公开(公告)号:US11689507B2

    公开(公告)日:2023-06-27

    申请号:US16695636

    申请日:2019-11-26

    Applicant: Adobe Inc.

    CPC classification number: H04L63/04 G06N5/04 G06N20/00 G06Q30/0202

    Abstract: Systems and techniques for privacy preserving document analysis are described that derive insights pertaining to a digital document without communication of the content of the digital document. To do so, the privacy preserving document analysis techniques described herein capture visual or contextual features of the digital document and creates a stamp representation that represents these features without included the content of the digital document. The stamp representation is projected into a stamp embedding space based on a stamp encoding model generated through machine learning techniques capturing feature patterns and interaction in the stamp representations. The stamp encoding model exploits these feature interactions to define similarity of source documents based on location within the stamp embedding space. Accordingly, the techniques described herein can determine a similarity of documents without having access to the documents themselves.

    Asides detection in documents
    9.
    发明授权

    公开(公告)号:US11256913B2

    公开(公告)日:2022-02-22

    申请号:US16598680

    申请日:2019-10-10

    Applicant: Adobe Inc.

    Abstract: Techniques are disclosed for identifying asides within a document, and detecting a display order of contents based of the identified asides. In a document, an “aside” represents a content region of the document that is distinct from the main content regions, and may be visually distinguishable from the main content region. In an example, a document is received, where the document lacks identification of asides. The document is analyzed to identify asides within the document. A display order of contents within the document is then determined, based on the identified asides. For example, in the display order, the asides are ordered between two segments of the main content and/or at a beginning or an end of the main content, but may not be ordered to be embedded in between a segment of the main content. The document is displayed in accordance with the display order.

Patent Agency Ranking