Deep learning inference efficiency technology with early exit and speculative execution

发明授权

US11562200B2 Deep learning inference efficiency technology with early exit and speculative execution 有权

请登陆查看更多内容

专利标题： Deep learning inference efficiency technology with early exit and speculative execution
申请号： US16266880

申请日： 2019-02-04
公开(公告)号： US11562200B2

公开(公告)日： 2023-01-24
发明人: Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island
申请人： Intel Corporation
申请人地址： US CA Santa Clara
专利权人： Intel Corporation
当前专利权人： Intel Corporation
当前专利权人地址： US CA Santa Clara
代理机构： Jordan IP Law, LLC
主分类号： G06N3/04
IPC分类号： G06N3/04 ; G06F30/33

Deep learning inference efficiency technology with early exit and speculative execution

摘要：

Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.

公开/授权文献

US20190180168A1 DEEP LEARNING INFERENCE EFFICIENCY TECHNOLOGY WITH EARLY EXIT AND SPECULATIVE EXECUTION 公开/授权日：2019-06-13

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/04	..体系结构，例如，互连拓扑