- 专利标题: Flexible, lightweight quantized deep neural networks
-
申请号: US16887988申请日: 2020-05-29
-
公开(公告)号: US11521074B2公开(公告)日: 2022-12-06
- 发明人: Ruizhou Ding , Zeye Liu , Ting-Wu Chin , Diana Marculescu , Ronald D. Blanton
- 申请人: Carnegie Mellon University
- 申请人地址: US PA Pittsburgh
- 专利权人: Carnegie Mellon University
- 当前专利权人: Carnegie Mellon University
- 当前专利权人地址: US PA Pittsburgh
- 代理机构: KDB Firm PLLC
- 主分类号: G06N3/08
- IPC分类号: G06N3/08 ; G06N3/04
摘要:
To improve the throughput and energy efficiency of Deep Neural Networks (DNNs) on customized hardware, lightweight neural networks constrain the weights of DNNs to be a limited combination of powers of 2. In such networks, the multiply-accumulate operation can be replaced with a single shift operation, or two shifts and an add operation. To provide even more design flexibility, the k for each convolutional filter can be optimally chosen instead of being fixed for every filter. The present invention formulates the selection of k to be differentiable and describes model training for determining k-based weights on a per-filter basis. The present invention can achieve higher speeds as compared to lightweight NNs with only minimal accuracy degradation, while also achieving higher computational energy efficiency for ASIC implementation.
公开/授权文献
- US20200380371A1 FLEXIBLE, LIGHTWEIGHT QUANTIZED DEEP NEURAL NETWORKS 公开/授权日:2020-12-03
信息查询