-
公开(公告)号:US20250077980A1
公开(公告)日:2025-03-06
申请号:US18952687
申请日:2024-11-19
Applicant: Lemon Inc. , Beijing Youzhuju Network Technology Co., Ltd.
Inventor: Xinbo ZHANG , Luong Quoc TRUNG , Zhanming JIE , Peng SUN , Xiaoran JIN, JR. , Hang LI
IPC: G06N20/00
Abstract: There are provided a method, an apparatus, a device, and a storage medium for model training. In a method, a target model is fine-tuned using a set of training data, each training data including a sample question and corresponding annotation information, the annotation information including policy information for solving the sample question and answer information of the sample question. At least one sample question in the set of training data is provided to the fine-tuned target model to determine a candidate answer to the at least one sample question. The fine-tuned target model is trained based at least on a comparison between the candidate answer and the answer information of the at least one sample question.