MERGING MISIDENTIFIED TEXT STRUCTURES IN A DOCUMENT

    公开(公告)号:US20250165703A1

    公开(公告)日:2025-05-22

    申请号:US18511111

    申请日:2023-11-16

    Applicant: Adobe Inc.

    Abstract: Embodiments are disclosed for merging misidentified text structures. The method may include receiving a document including a plurality of text elements. The method may further include determining, by a machine learning model, a likelihood of merging a first text element of the plurality of text elements with a second text element of the plurality of text elements based on structure data and context data associated with the first and second text elements. The method may further include determining whether the likelihood of merging the first text element with the second text element satisfies a threshold. The method further includes responsive to determining that the likelihood of merging the first text element with the second text element satisfies the threshold, merging the first text element with the second text element.

    HIERARCHICAL SEGMENTATION OF UNSTRUCTURED TEXT USING NEURAL NETWORKS

    公开(公告)号:US20250165517A1

    公开(公告)日:2025-05-22

    申请号:US18511186

    申请日:2023-11-16

    Applicant: Adobe Inc.

    Abstract: Embodiments are disclosed for a digital design system trained to segment unstructured text into topically coherent segments. The method may include receiving unstructured text, the unstructured text including a sequence of sentences. The disclosed systems and methods further comprise generating, by a neural network, a hierarchically segmented tree structure representing the unstructured text. The tree structure comprises a plurality of tree structure nodes, where a node of the tree structure nodes represents a sentence from the sequence of sentences. The segments and sub-segments of the unstructured text can then be determined based on node data for nodes of the hierarchically segmented tree structure. Using the determined segments and sub-segments of the unstructured text, a modified representation of the unstructured text can be displayed.

Patent Agency Ranking