Agent persona grounded chit-chat generation framework

    公开(公告)号:US11087092B2

    公开(公告)日:2021-08-10

    申请号:US16399871

    申请日:2019-04-30

    Abstract: Approaches for determining a response for an agent in an undirected dialogue are provided. The approaches include a dialogue generating framework comprising an encoder neural network, a decoder neural network, and a language model neural network. The dialogue generating framework generates a sketch sentence response with at least one slot. The sketch sentence response is generated word by word and takes into account the undirected dialogue and agent traits of the agent making the response. The dialogue generating framework generates sentence responses by filling the slot with words from the agent traits. The dialogue generating framework ranks the sentence responses according to perplexity by passing the sentence responses through a language model and selects a final response which is a sentence response that has a lowest perplexity.

    Evaluating the Factual Consistency of Abstractive Text Summarization

    公开(公告)号:US20210124876A1

    公开(公告)日:2021-04-29

    申请号:US16750598

    申请日:2020-01-23

    Abstract: A weakly-supervised, model-based approach is provided for verifying or checking factual consistency and identifying conflicts between source documents and a generated summary. In some embodiments, an artificially generated training dataset is created by applying rule-based transformations to sentences sampled from one or more unannotated source documents of a dataset. Each of the resulting transformed sentences can be either semantically variant or invariant from the respective original sampled sentence, and labeled accordingly. In some embodiments, the generated training dataset is used to train a factual consistency checking model. The factual consistency checking model can classify whether a corresponding text summary is factually consistent with a source text document, and if so, may identify a span in the source text document that supports the corresponding text summary.

    SYSTEMS AND METHODS FOR SCIENETIFIC CONTRIBUTION SUMMARIZATION

    公开(公告)号:US20220067302A1

    公开(公告)日:2022-03-03

    申请号:US17161327

    申请日:2021-01-28

    Abstract: Embodiments described herein provide natural language processing (NLP) systems and methods that provide a customized summarization of scientific or technical articles, which disentangles background information from new contributions, and summarizes the background information or the new information (or both) based on a user's preference. Specifically, the systems and methods utilize machine learning classifiers to classify portions of sentences within the article as containing background information or as containing a new contribution attributable to the article. The systems and methods then incorporate the background information in the summary or incorporate the new contribution in the summary and output the summary. In this way, the systems and methods can provide summaries of scientific literatures, which largely accelerates literature review in scientific fields.

    Systems and methods for scientific contribution summarization

    公开(公告)号:US11790184B2

    公开(公告)日:2023-10-17

    申请号:US17161327

    申请日:2021-01-28

    CPC classification number: G06F40/40 G06F40/284 G06N5/04 G06N20/00 G06F40/30

    Abstract: Embodiments described herein provide natural language processing (NLP) systems and methods that provide a customized summarization of scientific or technical articles, which disentangles background information from new contributions, and summarizes the background information or the new information (or both) based on a user's preference. Specifically, the systems and methods utilize machine learning classifiers to classify portions of sentences within the article as containing background information or as containing a new contribution attributable to the article. The systems and methods then incorporate the background information in the summary or incorporate the new contribution in the summary and output the summary. In this way, the systems and methods can provide summaries of scientific literatures, which largely accelerates literature review in scientific fields.

    SYSTEMS AND METHODS FOR EXPLAINABLE AND FACTUAL MULTI-DOCUMENT SUMMARIZATION

    公开(公告)号:US20230070497A1

    公开(公告)日:2023-03-09

    申请号:US17589675

    申请日:2022-01-31

    Abstract: Embodiments described herein provide methods and systems for summarizing multiple documents. A system receives a plurality of documents and generates embeddings of the sentences from the plurality of documents. The embedded sentences are clustered in a representation space. Sentences from a reference summary are embedded and aligned with the closest cluster. Sentences from each cluster are summarized with the aligned reference sentences as a target. A loss is computed based on the summarized sentences and the aligned references, and the natural language processing model is updated based on the loss. Sentences may be masked from being used in the summarization by identifying sentences that are contradicted by other sentences within the plurality of documents.

Patent Agency Ranking