METHOD AND DEVICE FOR PRECISION ALLOCATION BASED ON NEURAL NETWORK PROCESSOR

    公开(公告)号:US20240411516A1

    公开(公告)日:2024-12-12

    申请号:US18734138

    申请日:2024-06-05

    Abstract: A precision allocation method and device based on a neural network processor are provided. The precision allocation method includes allocating a weight of a neural network to a multiplier column of a neural network processor, determining a lower tolerance for the multiplier column, and selecting a first data type for the multiplier column from a plurality of data types based on the lower tolerance, wherein each of the plurality of data types corresponds to a different precision level, and performing, by the neural network processor, a multiplication operation based on the weight and the first data type.

Patent Agency Ranking