-
公开(公告)号:US20210397791A1
公开(公告)日:2021-12-23
申请号:US17203680
申请日:2021-03-16
Inventor: Danxiang Zhu
IPC: G06F40/289 , G06K9/62 , G06F40/205 , G06N5/02
Abstract: The present disclosure provides a method for training language model, and associated apparatus, electronic device and readable storage medium, which relates to the technical field of deep learning and the technical field of natural language processing. A specific implementation solution is as follows: sampling a paragraph of text from each article in a plurality of articles respectively, to obtain multiple paragraphs of text; concatenating the multiple paragraphs of text to obtain a concatenated text; inputting the concatenated text into a language model, a prediction value of the number of articles being output via the language model; training the language model based on the actual number of articles in the plurality of articles and a prediction value of the number of articles, until a preset training completion condition is satisfied. In the present disclosure, the classification of the entire paragraph of text content by the language model may be implemented and the effect of recognizing the text content by the language model may be enhanced by training the language model using texts sampled from the plurality of articles.