MODEL TRAINING METHOD AND APPARATUS, AND READABLE STORAGE MEDIUM

    公开(公告)号:US20240362486A1

    公开(公告)日:2024-10-31

    申请号:US18565920

    申请日:2022-05-09

    Inventor: Haien ZENG

    CPC classification number: G06N3/082

    Abstract: A model training method includes: acquiring a sample data set corresponding to a target task, a teacher model and an ith initial student model; performing an ith time of channel pruning on the ith initial student model, to acquire a student model subjected to the ith time of channel pruning; performing knowledge distillation according to the sample data set, the teacher model and the student model subjected to the ith time of channel pruning, to acquire an (i+1)th initial student model, wherein a compression ratio of the (i+1)th initial student model to the ith initial student model is equal to a preset ith compression ratio; and updating i to be i+1, and returning to the step of performing the ith time of channel pruning on the ith initial student model, until the updated i is greater than a threshold value N, to acquire a target student model.

Patent Agency Ranking