DATA PROCESSING METHOD FOR A NEURAL NETWORK
    1.
    发明公开

    公开(公告)号:US20240273356A1

    公开(公告)日:2024-08-15

    申请号:US18433358

    申请日:2024-02-05

    Inventor: Qian WU Ming LI Qi XU

    CPC classification number: G06N3/08

    Abstract: A data processing method for a neural network implemented by a computing device and comprises a plurality of layers, the computing device comprises a first and a second memory, weight data of each of the plurality of layers is stored in the second memory, input data and output data have respective predetermined storage locations, the method comprises: performing batch processing to input data and weight data with a predetermined batch size; the predetermined batch size is determined using the following steps: determining actual storage locations of the input data and the output data of each layer when data is processed in batches using batch size candidates; determining memory access conditions of the second memory for each layer when data is processed in batches using batch size candidates; determining total memory access amount of the neural network corresponding to batch size candidates; selecting the predetermined batch size.

Patent Agency Ranking