Method and device for training a model based on federated learning

    公开(公告)号:US12056582B2

    公开(公告)日:2024-08-06

    申请号:US16921207

    申请日:2020-07-06

    Abstract: A method and device for training a model based on federated learning are provided. The method includes: receiving a second original independent variable calculated value from a second data provider device; the second original independent variable calculated value being calculated by the second data provider device according to a second original independent variable and a second model parameter; calculating a dependent variable estimation value according to a first model parameter initial value of a first provider device, a first original independent variable of the first data provider device, and the second original independent variable calculated value; calculating a difference between a dependent variable of the first data provider device and the dependent variable estimation value; calculating a gradient of a loss function with respect to a first model parameter, according to the difference; and updating the first model parameter according to the gradient of the loss function with respect to the first model parameter.

    Multi-lingual model training method, apparatus, electronic device and readable storage medium

    公开(公告)号:US11995405B2

    公开(公告)日:2024-05-28

    申请号:US17348104

    申请日:2021-06-15

    CPC classification number: G06F40/30 G06F40/58 G06N20/00

    Abstract: The present disclosure provides a multi-lingual model training method, apparatus, electronic device and readable storage medium and relates to the technical field of deep learning and natural language processing. A technical solution of the present disclosure when training the multi-lingual model is: obtaining training corpuses comprising a plurality of bilingual corpuses and a plurality of monolingual corpuses; training a multi-lingual model with a first training task by using the plurality of bilingual corpuses; training the multi-lingual model with a second training task by using the plurality of monolingual corpuses; and completing the training of the multi-lingual model in a case of determining that loss functions of the first training task and second training task converge. In the present disclosure, the multi-lingual model can be enabled to achieve semantic interaction between different languages and improve the accuracy of the multi-lingual model in learning the semantic representations of the multi-lingual model.

Patent Agency Ranking