-
公开(公告)号:US11928578B2
公开(公告)日:2024-03-12
申请号:US17108927
申请日:2020-12-01
Inventor: Sungju Ryu , Jae-Joon Kim , Youngtaek Oh
CPC classification number: G06N3/063 , G06F5/06 , G06F7/76 , G06F17/15 , G06F17/16 , G06N3/047 , G06F5/065
Abstract: A method of processing of a sparsity-aware neural processing unit includes receiving a plurality of input activations (IA); obtaining a weight having a non-zero value in each weight output channel; storing the weight and the IA in a memory, and obtaining an input channel index comprising a memory address location in which the weight and the IA are stored; and arranging the non-zero weight of each weight output channel according to a row size of an index matching unit (IMU) and matching the IA to the weight in the IMU comprising a buffer memory storing the input channel index.