DATA PROCESSING METHOD AND RELATED DEVICE
    1.
    发明公开

    公开(公告)号:US20230229898A1

    公开(公告)日:2023-07-20

    申请号:US18186942

    申请日:2023-03-20

    CPC classification number: G06N3/0499 G06N3/08

    Abstract: A data processing method includes: obtaining to-be-processed data and a target neural network model, where the target neural network model includes a first transformer layer, the first transformer layer includes a first residual branch and a second residual branch, the first residual branch includes a first attention head, and the second residual branch includes a target feed-forward network (FFN) layer; and performing target task related processing on the to-be-processed data based on the target neural network model, to obtain a data processing result, where the target neural network model is for performing a target operation on an output of the first attention head and a first weight value to obtain an output of the first residual branch, and/or the target neural network model is for performing a target operation on an output of the target FFN and a second weight value to obtain an output of the second residual branch.

Patent Agency Ranking