MULTI-LINGUAL INTENT MODEL WITH OUT-OF-DOMAIN DETECTION

    公开(公告)号:US20230086302A1

    公开(公告)日:2023-03-23

    申请号:US17479748

    申请日:2021-09-20

    Abstract: A method that includes receiving an input at an interactive conversation service that uses an intent classification model. The method may further include generating, using an encoder model of the intent classification model, a set of output vectors corresponding to the input, where the encoder model is configured to determine a set of metrics corresponding to intent classifications. The method may further include determining, using an outlier detection model of the intent classification model, whether the input is in-domain or out-of-domain (OOD) based on a first vector of the set of output vectors satisfying a domain threshold relative to one or more of the intent classifications. The method may further include outputting, by the intent classification model, a second vector of the set of output vectors that indicates the set of metrics corresponding to the intent classifications or an indication that the input is OOD.

    SYNTHETIC CRAFTING OF TRAINING AND TEST DATA FOR NAMED ENTITY RECOGNITION

    公开(公告)号:US20220245346A1

    公开(公告)日:2022-08-04

    申请号:US17248583

    申请日:2021-01-29

    Abstract: A method and system for extracting and labeling Named-Entity Recognition (NER) data in a target language for use in a multi-lingual software module has been developed. First, a textual sentence is translated to the target language using a translation module. A named entity is identified and extracted within the translated sentence. The named entity is identified by either: exact mapping; a semantically similar translated named entity that meets a predetermined minimum threshold of similarity; or utilizing a rule-based library for the target language. Once identified, the named entity is labeled with a pre-determined category and stored in a retrievable electronic database.

Patent Agency Ranking