- 专利标题: HYBRID MULTIPY-ACCUMULATION OPERATION WITH COMPRESSED WEIGHTS
-
申请号: US18184101申请日: 2023-03-15
-
公开(公告)号: US20230229917A1公开(公告)日: 2023-07-20
- 发明人: Michael Wu , Arnab Raha , Deepak Abraham Mathaikutty , Nihat Tunali , Martin Langhammer
- 申请人: Intel Corporation
- 申请人地址: US CA Santa Clara
- 专利权人: Intel Corporation
- 当前专利权人: Intel Corporation
- 当前专利权人地址: US CA Santa Clara
- 主分类号: G06N3/08
- IPC分类号: G06N3/08 ; G06F7/544
摘要:
A compute block can perform hybrid multiply-accumulate (MAC) operations. The compute block may include a weight compressing module and a processing element (PE) array. The weight compression module may select a first group of one or more weights and a second group of one or more weights from a weight tensor of a DNN (deep neural network) layer. A weight in the first group is quantized to a power of two value. A weight in the second group is quantized to an integer. The integer and the exponent of the power of two value may be stored in a memory in lieu of the original values of the weights. A PE in the PE array includes a shifter configured to shift an activation of the layer by the exponent of the power of two value and a multiplier configured to multiplying the integer with another activation of the layer.
信息查询