-
公开(公告)号:US20230229898A1
公开(公告)日:2023-07-20
申请号:US18186942
申请日:2023-03-20
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
IPC: G06N3/0499 , G06N3/08
CPC classification number: G06N3/0499 , G06N3/08
Abstract: A data processing method includes: obtaining to-be-processed data and a target neural network model, where the target neural network model includes a first transformer layer, the first transformer layer includes a first residual branch and a second residual branch, the first residual branch includes a first attention head, and the second residual branch includes a target feed-forward network (FFN) layer; and performing target task related processing on the to-be-processed data based on the target neural network model, to obtain a data processing result, where the target neural network model is for performing a target operation on an output of the first attention head and a first weight value to obtain an output of the first residual branch, and/or the target neural network model is for performing a target operation on an output of the target FFN and a second weight value to obtain an output of the second residual branch.