Mathematical function defined natural language annotation

    公开(公告)号:US12197846B2

    公开(公告)日:2025-01-14

    申请号:US16688084

    申请日:2019-11-19

    Abstract: Provided is a method, a computer program product, and a system for associating mathematical functions to numerical text in a natural language sample. The method includes inputting a natural language sample from a text dataset and identifying a numerical text within the natural language sample. The method further includes displaying a mathematical function corresponding to the numerical text to be selected. The mathematical function can be selected via graphical user interface displayed on a computing device. The method also includes receiving and inserting the mathematical function as a feature into a feature vector of the natural language sample and selecting an output label for the natural language sample. The output label relates to the mathematical function selected for the numerical text. The method further includes exporting the natural language sample into a labeled dataset which can be used to train a machine learning model.

    Using visual features to identify document sections

    公开(公告)号:US10565444B2

    公开(公告)日:2020-02-18

    申请号:US15698212

    申请日:2017-09-07

    Abstract: A method, computer system, and a computer program product for identifying sections in a document based on a plurality of visual features is provided. The present invention may include receiving a plurality of documents. The present invention may also include extracting a plurality of content blocks. The present invention may further include determining the plurality of visual features. The present invention may then include grouping the extracted plurality of content blocks into a plurality of categories. The present invention may also include generating a plurality of closeness scores for the plurality of categories by utilizing a Visual Similarity Measure. The present invention may further include generating a plurality of Association Matrices on the plurality of categories for each of the received plurality of documents based on the Visual Similarity Measure. The present invention may further include merging the plurality of categories into a plurality of clusters.

Patent Agency Ranking