Flexible, lightweight quantized deep neural networks

发明授权

US11521074B2 Flexible, lightweight quantized deep neural networks 有权

请登陆查看更多内容

专利标题： Flexible, lightweight quantized deep neural networks
申请号： US16887988

申请日： 2020-05-29
公开(公告)号： US11521074B2

公开(公告)日： 2022-12-06
发明人: Ruizhou Ding , Zeye Liu , Ting-Wu Chin , Diana Marculescu , Ronald D. Blanton
申请人： Carnegie Mellon University
申请人地址： US PA Pittsburgh
专利权人： Carnegie Mellon University
当前专利权人： Carnegie Mellon University
当前专利权人地址： US PA Pittsburgh
代理机构： KDB Firm PLLC
主分类号： G06N3/08
IPC分类号： G06N3/08 ; G06N3/04

Flexible, lightweight quantized deep neural networks

摘要：

To improve the throughput and energy efficiency of Deep Neural Networks (DNNs) on customized hardware, lightweight neural networks constrain the weights of DNNs to be a limited combination of powers of 2. In such networks, the multiply-accumulate operation can be replaced with a single shift operation, or two shifts and an add operation. To provide even more design flexibility, the k for each convolutional filter can be optimally chosen instead of being fixed for every filter. The present invention formulates the selection of k to be differentiable and describes model training for determining k-based weights on a per-filter basis. The present invention can achieve higher speeds as compared to lightweight NNs with only minimal accuracy degradation, while also achieving higher computational energy efficiency for ASIC implementation.

公开/授权文献

US20200380371A1 FLEXIBLE, LIGHTWEIGHT QUANTIZED DEEP NEURAL NETWORKS 公开/授权日：2020-12-03

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法