Intelligent training set augmentation for natural language processing tasks

    公开(公告)号:US11599721B2

    公开(公告)日:2023-03-07

    申请号:US17002562

    申请日:2020-08-25

    Abstract: A natural language processing system that trains task models for particular natural language tasks programmatically generates additional utterances for inclusion in the training set, based on the existing utterances in the training set and the existing state of a task model as generated from the original (non-augmented) training set. More specifically, the training augmentation module 220 identifies specific textual units of utterances and generates variants of the utterances based on those identified units. The identification is based on determined importances of the textual units to the output of the task model, as well as on task rules that correspond to the natural language task for which the task model is being generated. The generation of the additional utterances improves the quality of the task model without the expense of manual labeling of utterances for training set inclusion.

    Intelligent Training Set Augmentation for Natural Language Processing Tasks

    公开(公告)号:US20220067277A1

    公开(公告)日:2022-03-03

    申请号:US17002562

    申请日:2020-08-25

    Abstract: A natural language processing system that trains task models for particular natural language tasks programmatically generates additional utterances for inclusion in the training set, based on the existing utterances in the training set and the existing state of a task model as generated from the original (non-augmented) training set. More specifically, the training augmentation module 220 identifies specific textual units of utterances and generates variants of the utterances based on those identified units. The identification is based on determined importances of the textual units to the output of the task model, as well as on task rules that correspond to the natural language task for which the task model is being generated. The generation of the additional utterances improves the quality of the task model without the expense of manual labeling of utterances for training set inclusion.

    HIERARCHICAL NATURAL LANGUAGE UNDERSTANDING SYSTEMS

    公开(公告)号:US20220245349A1

    公开(公告)日:2022-08-04

    申请号:US17162318

    申请日:2021-01-29

    Abstract: Methods and systems for hierarchical natural language understanding are described. A representation of an utterance is inputted to a first machine learning model to obtain information on the first utterance. According to the information on the utterance a determination that the representation of the utterance is to be inputted to a second machine learning model that performs a dedicated natural language task is performed. In response to determining that the representation of the utterance is to be inputted to a second machine learning model, the utterance is inputted to the second machine learning model to obtain an output of the dedicated natural language task.

Patent Agency Ranking