-
公开(公告)号:US20240273356A1
公开(公告)日:2024-08-15
申请号:US18433358
申请日:2024-02-05
Applicant: MONTAGE TECHNOLOGY CO., LTD.
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: A data processing method for a neural network implemented by a computing device and comprises a plurality of layers, the computing device comprises a first and a second memory, weight data of each of the plurality of layers is stored in the second memory, input data and output data have respective predetermined storage locations, the method comprises: performing batch processing to input data and weight data with a predetermined batch size; the predetermined batch size is determined using the following steps: determining actual storage locations of the input data and the output data of each layer when data is processed in batches using batch size candidates; determining memory access conditions of the second memory for each layer when data is processed in batches using batch size candidates; determining total memory access amount of the neural network corresponding to batch size candidates; selecting the predetermined batch size.