-
公开(公告)号:US11869232B2
公开(公告)日:2024-01-09
申请号:US18151914
申请日:2023-01-09
Applicant: Intel Corporation
Inventor: Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island
CPC classification number: G06V10/82 , G06F30/33 , G06N3/04 , G06V10/454 , G06V10/955
Abstract: Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.
-
2.
公开(公告)号:US20240104916A1
公开(公告)日:2024-03-28
申请号:US18519674
申请日:2023-11-27
Applicant: Intel Corporation
Inventor: Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island
CPC classification number: G06V10/82 , G06F30/33 , G06N3/04 , G06V10/454 , G06V10/955
Abstract: Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.
-
3.
公开(公告)号:US20190180168A1
公开(公告)日:2019-06-13
申请号:US16266880
申请日:2019-02-04
Applicant: Intel Corporation
Inventor: Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island
Abstract: Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.
-
公开(公告)号:US11562200B2
公开(公告)日:2023-01-24
申请号:US16266880
申请日:2019-02-04
Applicant: Intel Corporation
Inventor: Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island
Abstract: Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.
-
公开(公告)号:US12211260B2
公开(公告)日:2025-01-28
申请号:US18519674
申请日:2023-11-27
Applicant: Intel Corporation
Inventor: Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island
Abstract: Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.
-
6.
公开(公告)号:US20230215158A1
公开(公告)日:2023-07-06
申请号:US18151914
申请日:2023-01-09
Applicant: Intel Corporation
Inventor: Haim Barad , Barak Hurwitz , Uzi Sarel , Eran Geva , Eli Kfir , Moshe Island
CPC classification number: G06V10/82 , G06N3/04 , G06F30/33 , G06V10/454 , G06V10/955
Abstract: Systems, apparatuses and methods may provide for technology that processes an inference workload in a first subset of layers of a neural network that prevents or inhibits data dependent branch operations, conducts an exit determination as to whether an output of the first subset of layers satisfies one or more exit criteria, and selectively bypasses processing of the output in a second subset of layers of the neural network based on the exit determination. The technology may also speculatively initiate the processing of the output in the second subset of layers while the exit determination is pending. Additionally, when the inference workloads include a plurality of batches, the technology may mask one or more of the plurality of batches from processing in the second subset of layers.
-
-
-
-
-