ONTOLOGY INTEGRATION FOR DOCUMENT SUMMARIZATION

    公开(公告)号:US20210294829A1

    公开(公告)日:2021-09-23

    申请号:US17210318

    申请日:2021-03-23

    申请人: Sorcero, Inc.

    摘要: Provided is a method including obtaining parameters and a document, determining a domain based on the parameters, where the domain maps to a first ontology, and where ontologies map n-grams onto a set of concepts. The method includes scoring a first set of n-grams of the document using a scoring model based on relations between members of the first set of n-grams, selecting sections of the text based on n-gram scores provided by the scoring model, and determining an initial n-gram set, where each respective n-gram of the initial n-gram set maps to a respective concept of the set of concepts, and where each respective n-gram is identified by an ontology other than the first ontology. The method includes determining related n-grams mapped to the set of concepts associated with the domain and generating a text summary for the document based on the sections and the related n-grams.

    FEATURE ENGINEERING WITH QUESTION GENERATION

    公开(公告)号:US20210294781A1

    公开(公告)日:2021-09-23

    申请号:US17210320

    申请日:2021-03-23

    申请人: Sorcero, Inc.

    摘要: Provided is a computer-implemented process including obtaining a corpus of natural-language text documents, automatically generating questions about information in corresponding portions of the documents, and associating the questions with the corresponding portions of the documents. The process further includes storing the questions and the associations with the corresponding portions of the documents in memory to form an index of automatically-generated questions to corresponding portions of documents that answer the questions.