发明授权
- 专利标题: Deep learning inference efficiency technology with early exit and speculative execution
-
申请号: US16266880申请日: 2019-02-04
-
公开(公告)号: US11562200B2公开(公告)日: 2023-01-24
- 发明人: Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island
- 申请人: Intel Corporation
- 申请人地址: US CA Santa Clara
- 专利权人: Intel Corporation
- 当前专利权人: Intel Corporation
- 当前专利权人地址: US CA Santa Clara
- 代理机构: Jordan IP Law, LLC
- 主分类号: G06N3/04
- IPC分类号: G06N3/04 ; G06F30/33
摘要:
Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.
公开/授权文献
信息查询