专利检索 ap:("Intel Corporation") AND inv:"Haim Barad" 第 1 页

1.

发明公开
DEEP LEARNING INFERENCE EFFICIENCY TECHNOLOGY WITH EARLY EXIT AND SPECULATIVE EXECUTION 审中-公开

公开(公告)号：US20240104916A1

公开(公告)日：2024-03-28

申请号：US18519674

申请日：2023-11-27

申请人： Intel Corporation

发明人： Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island

IPC分类号： G06V10/82 , G06F30/33 , G06N3/04 , G06V10/44 , G06V10/94

CPC分类号： G06V10/82 , G06F30/33 , G06N3/04 , G06V10/454 , G06V10/955

摘要： Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.

2.

发明授权
Deep learning inference efficiency technology with early exit and speculative execution 有权

公开(公告)号：US11869232B2

公开(公告)日：2024-01-09

申请号：US18151914

申请日：2023-01-09

申请人： Intel Corporation

发明人： Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island

IPC分类号： G06V10/82 , G06N3/04 , G06F30/33 , G06V10/44 , G06V10/94

CPC分类号： G06V10/82 , G06F30/33 , G06N3/04 , G06V10/454 , G06V10/955

摘要： Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.

3.

发明申请
DEEP LEARNING INFERENCE EFFICIENCY TECHNOLOGY WITH EARLY EXIT AND SPECULATIVE EXECUTION 审中-公开

公开(公告)号：US20190180168A1

公开(公告)日：2019-06-13

申请号：US16266880

申请日：2019-02-04

申请人： Intel Corporation

发明人： Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island

IPC分类号： G06N3/04 , G06F9/30 , G06F17/50

摘要： Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.

4.

发明公开
DEEP LEARNING INFERENCE EFFICIENCY TECHNOLOGY WITH EARLY EXIT AND SPECULATIVE EXECUTION 审中-公开

公开(公告)号：US20230215158A1

公开(公告)日：2023-07-06

申请号：US18151914

申请日：2023-01-09

申请人： Intel Corporation

发明人： Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island

IPC分类号： G06V10/82 , G06N3/04 , G06F30/33 , G06V10/44 , G06V10/94

CPC分类号： G06V10/82 , G06N3/04 , G06F30/33 , G06V10/454 , G06V10/955

摘要： Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.

5.

发明授权
Early exit for natural language processing models 有权

公开(公告)号：US11544461B2

公开(公告)日：2023-01-03

申请号：US16411763

申请日：2019-05-14

申请人： Intel Corporation

发明人： Barak Battach , Amit Bleiweiss , Haim Barad

IPC分类号： G06F40/30 , G06N20/00 , G06F40/284 , G06F40/216

摘要： The disclosure provides a natural language processing (NLP) model arranged to operate on two lexicons, where one lexicon is a sub-set of the other lexicon. The NLP model can be arranged to generate output based on the sub-set lexicon and exit processing of the NLP model, to potentially save computation cycles.

6.

发明授权
Deep learning inference efficiency technology with early exit and speculative execution 有权

公开(公告)号：US11562200B2

公开(公告)日：2023-01-24

申请号：US16266880

申请日：2019-02-04

申请人： Intel Corporation

发明人： Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island

IPC分类号： G06N3/04 , G06F30/33

摘要： Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.

7.

发明申请
EARLY EXIT FOR NATURAL LANGUAGE PROCESSING MODELS 审中-公开

公开(公告)号：US20190266236A1

公开(公告)日：2019-08-29

申请号：US16411763

申请日：2019-05-14

申请人： Intel Corporation

发明人： Barak Battach , Amit Bleiweiss , Haim Barad

IPC分类号： G06F17/27

摘要： The disclosure provides a natural language processing (NLP) model arranged to operate on two lexicons, where one lexicon is a sub-set of the other lexicon. The NLP model can be arranged to generate output based on the sub-set lexicon and exit processing of the NLP model, to potentially save computation cycles.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类