Unsupervised dialogue topic extraction

    公开(公告)号:US11507617B2

    公开(公告)日:2022-11-22

    申请号:US16685933

    申请日:2019-11-15

    Abstract: Disclosed are some implementations of systems, apparatus, methods and computer program products for extracting topics from a corpus of exchanges. The system generates vector representations of utterances of an entity common to the exchanges and uses the vector representations to cluster the utterances. The system labels the clusters and uses the labeled clusters to generate an exchange label sequence for each of the exchanges, where each exchange label sequence corresponds to a sequence of utterances generated by the entity. The system processes the exchange label sequences to generate one or more subsets of the utterances, where each of the subsets corresponds to a particular topic.

    UNSUPERVISED DIALOGUE TOPIC EXTRACTION

    公开(公告)号:US20210149949A1

    公开(公告)日:2021-05-20

    申请号:US16685933

    申请日:2019-11-15

    Abstract: Disclosed are some implementations of systems, apparatus, methods and computer program products for extracting topics from a corpus of exchanges. The system generates vector representations of utterances of an entity common to the exchanges and uses the vector representations to cluster the utterances. The system labels the clusters and uses the labeled clusters to generate an exchange label sequence for each of the exchanges, where each exchange label sequence corresponds to a sequence of utterances generated by the entity. The system processes the exchange label sequences to generate one or more subsets of the utterances, where each of the subsets corresponds to a particular topic.

    IDENTIFICATION OF RESPONSE LIST
    7.
    发明申请

    公开(公告)号:US20210150146A1

    公开(公告)日:2021-05-20

    申请号:US16687626

    申请日:2019-11-18

    Abstract: A system is configured to analyze a corpus of historical chat data to identify the list of “best” responses. As such, the user is not required to identify a list of canned responses for input into the system. The described system uses a context word embedding function and response word embedding function to generate context vectors and response vectors corresponding to the corpus of conversation data, and the vectors are represented by a respective context matrix and a response matrix. The system processes these matrices to generate scores for responses, clusters the responses, and identifies the responses corresponding to the best scores for each cluster.

    UNSUPERVISED DIALOGUE STRUCTURE EXTRACTION

    公开(公告)号:US20210149921A1

    公开(公告)日:2021-05-20

    申请号:US16685926

    申请日:2019-11-15

    Abstract: Disclosed are some implementations of systems, apparatus, methods and computer program products for extracting state flow structures from a corpus of exchanges. The system generates vector representations of utterances of an entity common to the exchanges and uses the vector representations to cluster the utterances.
    The system labels the clusters and uses the labeled clusters to generate an exchange label sequence for each of the exchanges, where the exchange label sequence corresponds to a sequence of utterances generated by the entity. The system processes the exchange label sequences to generate a state flow structure, where each of the states is represented by a corresponding set of utterances.

Patent Agency Ranking