Aspect prompting framework for language modeling

    公开(公告)号:US12175204B2

    公开(公告)日:2024-12-24

    申请号:US17584068

    申请日:2022-01-25

    Abstract: Techniques for dynamically developing a contextual set of prompts based on relevant aspects extracted from s set of training data. One technique includes obtaining training data comprising text examples and associated labels, extracting aspects from the training data, generating prompting templates based on the training data and the extracted aspects, concatenating each of the text examples with the respective generated prompting template to create prompting functions, training a machine learning language model on the prompting functions to predict a solution for a task, where the training is formulated as a masked language modeling problem with blanks of the prompting templates being set as text labels and expected output for the task being set as specified solution labels, and the training learns or updates model parameters of the machine learning language model for performing the task. The machine learning language model is provided with the learned or updated model parameters.

    FRACTIONAL INFERENCE ON GPU AND CPU FOR LARGE SCALE DEPLOYMENT OF CUSTOMIZED TRANSFORMERS BASED LANGUAGE MODELS

    公开(公告)号:US20230100303A1

    公开(公告)日:2023-03-30

    申请号:US17488289

    申请日:2021-09-28

    Abstract: Systems and methods for fractional inference on GPU and CPU for large scale deployment of customized transformers based language models are disclosed herein. The method can include, receiving data for use in generation of a machine learning model output, ingesting the data with a first machine learning model on a Graphic Processing Unit, receiving at least one intermediate output from the first machine learning model at a temporary store, receiving the at least one intermediate output from the temporary store at a Central Processing Unit, ingesting the at least one intermediate output with a second machine learning model on the Central Processing Unit, and outputting a prediction with the second machine learning model.

    ASPECT PROMPTING FRAMEWORK FOR LANGUAGE MODELING

    公开(公告)号:US20230237277A1

    公开(公告)日:2023-07-27

    申请号:US17584068

    申请日:2022-01-25

    CPC classification number: G06F40/40 G06V30/19147 G06F40/186 G06F40/284

    Abstract: Techniques for dynamically developing a contextual set of prompts based on relevant aspects extracted from s set of training data. One technique includes obtaining training data comprising text examples and associated labels, extracting aspects from the training data, generating prompting templates based on the training data and the extracted aspects, concatenating each of the text examples with the respective generated prompting template to create prompting functions, training a machine learning language model on the prompting functions to predict a solution for a task, where the training is formulated as a masked language modeling problem with blanks of the prompting templates being set as text labels and expected output for the task being set as specified solution labels, and the training learns or updates model parameters of the machine learning language model for performing the task. The machine learning language model is provided with the learned or updated model parameters.

    TECHNIQUES FOR TRAINED MODEL BIAS ASSESSMENT

    公开(公告)号:US20230131834A1

    公开(公告)日:2023-04-27

    申请号:US17508734

    申请日:2021-10-22

    Abstract: A system is disclosed that is configured to perform various bias checks on an machine learning (ML) model in order to identify one or more biases, if any, that may be inherent to the ML model. Bias evaluation results generated from performing the checks are then reported to a user, such as to a consumer of the ML model, a data scientist responsible for modeling and training the ML model, and others. The bias evaluation system performs one or more bias checks by generating synthetic datasets using attributes present in the ML model or a training dataset used to train the ML model. Prediction data is then generated by inputting the synthetically generated input data points of the synthetic datasets into the ML model. The prediction data is then processed and evaluated for biases. Results of the evaluation may be compiled into a bias evaluation report.

Patent Agency Ranking