-
公开(公告)号:US20210350585A1
公开(公告)日:2021-11-11
申请号:US17344639
申请日:2021-06-10
Applicant: INTEL CORPORATION
Inventor: Tomer Bar-On , Jacob Subag , Yaniv Fais , Jeremie Dreyfuss , Gal Novik , Gal Leibovich , Tomer Schwartz , Ehud Cohen , Lev Faivishevsky , Uzi Sarel , Amitai Armon , Yahav Shadmiy
IPC: G06T9/00 , H04N19/42 , G06N3/04 , H04N19/436 , G06N3/08
Abstract: In an example, an apparatus comprises logic, at least partially including hardware logic, to implement a lossy compression algorithm which utilizes a data transform and quantization process to compress data in a convolutional neural network (CNN) layer. Other embodiments are also disclosed and claimed.
-
公开(公告)号:US11113053B2
公开(公告)日:2021-09-07
申请号:US16579394
申请日:2019-09-23
Applicant: Intel Corporation
Inventor: Asit K. Mishra , Edward T. Grochowski , Jonathan D. Pearce , Deborah T. Marr , Ehud Cohen , Elmoustapha Ould-Ahmed-Vall , Jesus Corbal San Adrian , Robert Valentine , Mark J. Charney , Christopher J. Hughes , Milind B. Girkar
IPC: G06F9/30
Abstract: A processor includes a decode unit to decode an instruction that is to indicate a first source packed data operand that is to include at least four data elements, to indicate a second source packed data operand that is to include at least four data elements, and to indicate one or more destination storage locations. The execution unit, in response to the instruction, is to store at least one result mask operand in the destination storage location(s). The at least one result mask operand is to include a different mask element for each corresponding data element in one of the first and second source packed data operands in a same relative position. Each mask element is to indicate whether the corresponding data element in said one of the source packed data operands equals any of the data elements in the other of the source packed data operands.
-
公开(公告)号:US11037330B2
公开(公告)日:2021-06-15
申请号:US15482725
申请日:2017-04-08
Applicant: Intel Corporation
Inventor: Tomer Bar-On , Jacob Subag , Yaniv Fais , Jeremie Dreyfuss , Gal Novik , Gal Leibovich , Tomer Schwartz , Ehud Cohen , Lev Faivishevsky , Uzi Sarel , Amitai Armon , Yahav Shadmiy
IPC: G06T9/00 , H04N19/42 , G06N3/04 , H04N19/436 , G06N3/08
Abstract: In an example, an apparatus comprises logic, at least partially including hardware logic, to implement a lossy compression algorithm which utilizes a data transform and quantization process to compress data in a convolutional neural network (CNN) layer. Other embodiments are also disclosed and claimed.
-
14.
公开(公告)号:US20210049804A1
公开(公告)日:2021-02-18
申请号:US17006253
申请日:2020-08-28
Applicant: Intel Corporation
Inventor: Uzi Sarel , Ehud Cohen , Tomer Schwartz , Amitai Armon , Yahav Shadmiy , Itamar Ben-Ari , Amit Bleiweiss , Lev Faivishevsky , Tomer Bar-On , Yaniv Fais , Jacob Subag , Michael Behar , Guy Jacob , Gal Leibovich , Jeremie Dreyfuss
Abstract: In an example, an apparatus comprises a plurality of execution units; and logic, at least partially including hardware logic, to determine a sub-graph of a network that can be executed in a frequency domain and apply computations in the sub-graph in the frequency domain. Other embodiments are also disclosed and claimed.
-
公开(公告)号:US20190102671A1
公开(公告)日:2019-04-04
申请号:US15720982
申请日:2017-09-29
Applicant: Intel Corporation
Inventor: Ehud Cohen , Moshe Maor , Ashutosh Parkhi , Michael Behar , Yaniv Fais
Abstract: A convolutional neural network (CNN) accelerator, including: a CNN circuit for performing a multiple-layer CNN computation, wherein the multiple layers are to receive an input feature according to an input feature map (IFM) and a weight matrix per output feature, wherein an output of a first layer provides an input for a next layer; and a mapping circuit to access a three-dimensional input matrix stored as a Z-major matrix; wherein the CNN circuit is to perform an inner-product direct convolution on the Z-major matrix, wherein the direct convolution lacks a lowering operation.
-
16.
公开(公告)号:US10120691B2
公开(公告)日:2018-11-06
申请号:US15214270
申请日:2016-07-19
Applicant: Intel Corporation
Inventor: Boris Ginzburg , Ronny Ronen , Eliezer Weissmann , Karthikeyan Vaithianathan , Ehud Cohen
Abstract: In one embodiment, a processor includes an accelerator, a decoder to decode a first instruction into a decoded first instruction, and a second instruction into a decoded second instruction, and an execution unit to execute the first decoded instruction to, for a thread executing on the accelerator that is to be placed in an inactive state, cause a save of context information for the thread, and a save of a vector identifying the accelerator corresponding to the context information, and execute the second decoded instruction to read the vector to determine the accelerator to restore saved context information into for the thread, read the saved context information, and restore the saved context information into the accelerator.
-
公开(公告)号:US20180314932A1
公开(公告)日:2018-11-01
申请号:US15499898
申请日:2017-04-28
Applicant: Intel Corporation
Inventor: Tomer Schwartz , Ehud Cohen , Uzi Sarel , Amitai Armon , Yaniv Fais , Lev Faivishevsky , Amit Bleiweiss , Yahav Shadmiy , Jacob Subag
CPC classification number: G06N3/06 , G06N3/0454
Abstract: In an example, an apparatus comprises a plurality of execution units and logic, at least partially including hardware logic, to generate synthetic data for a generative adversarial network (GAN) using the plurality of execution units. Other embodiments are also disclosed and claimed.
-
公开(公告)号:US12189479B2
公开(公告)日:2025-01-07
申请号:US17993591
申请日:2022-11-23
Applicant: Intel Corporation
Inventor: Theodros Yigzaw , Geeyarpuram N. Santhanakrishnan , Ganapati N. Srinivasa , Jose A. Vargas , Hisham Shafi , Michael Mishaeli , Ehud Cohen , Zeev Sperber , Shlomo Raikin , Mohan J. Kumar , Julius Y. Mandelblat
Abstract: An apparatus and method are described for detecting and correcting data fetch errors within a processor core. For example, one embodiment of an instruction processing apparatus for detecting and recovering from data fetch errors comprises: at least one processor core having a plurality of instruction processing stages including a data fetch stage and a retirement stage; and error processing logic in communication with the processing stages to perform the operations of: detecting an error associated with data in response to a data fetch operation performed by the data fetch stage; and responsively performing one or more operations to ensure that the error does not corrupt an architectural state of the processor core within the retirement stage.
-
公开(公告)号:US20240160910A1
公开(公告)日:2024-05-16
申请号:US18527734
申请日:2023-12-04
Applicant: Intel Corporation
Inventor: Uzi Sarel , Ehud Cohen , Tomer Schwartz , Amitai Armon , Yahav Shadmiy , Amit Bleiweiss , Gal Leibovich , Jeremie Dreyfuss , Lev Faivishevsky , Tomer Bar-On , Yaniv Fais , Jacob Subag
CPC classification number: G06N3/063 , G06F9/30014 , G06F9/30025 , G06F9/30043 , G06N3/044 , G06N3/045 , G06N3/084
Abstract: In an example, an apparatus comprises a plurality of execution units comprising at least a first type of execution unit and a second type of execution unit and logic, at least partially including hardware logic, to expose embedded cast operations in at least one of a load instruction or a store instruction; determine a target precision level for the cast operations; and load the cast operations at the target precision level. Other embodiments are also disclosed and claimed.
-
公开(公告)号:US11620766B2
公开(公告)日:2023-04-04
申请号:US17344639
申请日:2021-06-10
Applicant: INTEL CORPORATION
Inventor: Tomer Bar-On , Jacob Subag , Yaniv Fais , Jeremie Dreyfuss , Gal Novik , Gal Leibovich , Tomer Schwartz , Ehud Cohen , Lev Faivishevsky , Uzi Sarel , Amitai Armon , Yahav Shadmiy
Abstract: In an example, an apparatus comprises logic, at least partially including hardware logic, to implement a lossy compression algorithm which utilizes a data transform and quantization process to compress data in a convolutional neural network (CNN) layer.
-
-
-
-
-
-
-
-
-