-
公开(公告)号:US11928578B2
公开(公告)日:2024-03-12
申请号:US17108927
申请日:2020-12-01
发明人: Sungju Ryu , Jae-Joon Kim , Youngtaek Oh
摘要: A method of processing of a sparsity-aware neural processing unit includes receiving a plurality of input activations (IA); obtaining a weight having a non-zero value in each weight output channel; storing the weight and the IA in a memory, and obtaining an input channel index comprising a memory address location in which the weight and the IA are stored; and arranging the non-zero weight of each weight output channel according to a row size of an index matching unit (IMU) and matching the IA to the weight in the IMU comprising a buffer memory storing the input channel index.