Patent search ap:("salesforce.com Page inc.") AND inv:"Linqing LIU"

1.

发明申请
Multi-Task Knowledge Distillation for Language Model 有权

公开(公告)号：US20210142164A1

公开(公告)日：2021-05-13

申请号：US16716249

申请日：2019-12-16

Applicant: salesforce.com, inc.

Inventor： Linqing LIU , Caiming XIONG

IPC: G06N3/08 , G06N3/04 , G06F40/30 , G06F40/40

Abstract: Systems and methods are provided that employ knowledge distillation under a multi-task learning setting. In some embodiments, the systems and methods are implemented with a larger teacher model and a smaller student model, each of which comprise one or more shared layers and a plurality of task layers for performing multiple tasks. During training of the teacher model, its shared layers are initialized, and then the teacher model is multi-task refined. The teacher model predicts teacher logits. During training of the student model, its shared layers are initialized. Knowledge distillation is employed to transfer knowledge from the teacher model to the student model by the student model updating its shared layers and task layers, for example, according to the teacher logits of the teacher model. Other features are also provided.

Patent Agency Ranking