OUT-OF-DOMAIN ENCODER TRAINING
    3.
    发明申请

    公开(公告)号:US20210034965A1

    公开(公告)日:2021-02-04

    申请号:US16530457

    申请日:2019-08-02

    IPC分类号: G06N3/08 G06N20/00 G06F17/16

    摘要: A computer-implemented method includes using an embedding network to generate prototypical vectors. Each prototypical vector is based on a corresponding label associated with a first domain. The computer-implemented method also includes using the embedding network to generate an in-domain test vector based on at least one data sample from a particular label associated with the first domain and using the embedding network to generate an out-of-domain test vector based on at least one other data sample associated with a different domain. The computer-implemented method also includes comparing the prototypical vectors to the in-domain test vector to generate in-domain comparison values and comparing the prototypical vectors to the out-of-domain test vector to generate out-of-domain comparison values. The computer-implemented method also includes modifying, based on the in-domain comparison values and the out-of-domain comparison values, one or more parameters of the embedding network.

    Holistic document search
    4.
    发明授权

    公开(公告)号:US10459900B2

    公开(公告)日:2019-10-29

    申请号:US15183742

    申请日:2016-06-15

    IPC分类号: G06F16/22 G06F16/93

    摘要: A set of documents is parsed. Members of the set of documents include a set of text elements and a set of visual elements. A text content stream based on the set of text elements and a visual content stream based on the set of visual elements are produced. For respective documents, a set of respective visual element summarizations is built from the visual content stream. Each visual summarization includes a text description of a respective visual element in the respective document. A holistic index is created by indexing the text content from the text content stream and the text descriptions of the visual elements in a single search index. The indexing uses a set of semantic relationships between the text content from the text content stream and the textual descriptions of the visual elements. A user interface allows a user to selectively search text content and visual content.

    KNOWLEDGE GRAPH COMPRESSION
    6.
    发明申请

    公开(公告)号:US20220335270A1

    公开(公告)日:2022-10-20

    申请号:US17231289

    申请日:2021-04-15

    IPC分类号: G06N3/04

    摘要: Aspects of the present disclosure relate to knowledge graph compression. An input knowledge graph (KG) can be received. The input KG can be encoded to receive a first set of node embeddings. The input KG can be compressed into an output KG. The output KG can be encoded to receive a second set of node embeddings. A model for KG compression can be trained using optimal transport based on a distance matrix between the first set of node embeddings and the second set of node embeddings.

    Context-aware conversation thread detection for communication sessions

    公开(公告)号:US11288578B2

    公开(公告)日:2022-03-29

    申请号:US16597937

    申请日:2019-10-10

    摘要: A computer system identifies threads in a communication session. A feature vector is generated for a message in a communication session, wherein the feature vector includes elements for features and contextual information of the message. The message feature vector and feature vectors for a plurality of threads are processed using machine learning models each associated with a corresponding thread to determine a set of probability values for classifying the message into at least one thread, wherein the threads include one or more pre-existing threads and a new thread. A classification of the message into at least one of the threads is indicated based on the set of probability values. Classification of one or more prior messages is adjusted based on the message's classification. Embodiments of the present invention further include a method and program product for identifying threads in a communication session in substantially the same manner described above.

    Holistic document search
    8.
    发明授权

    公开(公告)号:US11093469B2

    公开(公告)日:2021-08-17

    申请号:US16536968

    申请日:2019-08-09

    IPC分类号: G06F16/22 G06F16/93 G06F16/36

    摘要: A set of documents is parsed. Members of the set of documents include a set of text elements and a set of visual elements. A text content stream based on the set of text elements and a visual content stream based on the set of visual elements are produced. For respective documents, a set of respective visual element summarizations is built from the visual content stream. Each visual summarization includes a textual description of a respective visual element in the respective document. A holistic index is created by indexing the text content from the text content stream and the text descriptions of the visual elements for each document in a single search index. The indexing uses a set of semantic relationships between the text content from the text content stream and the textual descriptions of the visual elements. A user interface allows a user to selectively search text content and visual content.