Patent search ap:("Intel Corporation") AND inv:"Yuval Yosef" Page 2

11.

发明授权
Apparatus and method for invocation of a multi threaded accelerator 有权

公开(公告)号：US10346195B2

公开(公告)日：2019-07-09

申请号：US13730971

申请日：2012-12-29

Applicant: Intel Corporation

Inventor： Oren Ben-Kiki , Ilan Pardo , Eliezer Weissmann , Robert Valentine , Yuval Yosef

IPC: G06F9/46 , G06F9/38 , G06F9/48

Abstract: A processor is described having logic circuitry of a general purpose CPU core to save multiple copies of context of a thread of the general purpose CPU core to prepare multiple micro-threads of a multi-threaded accelerator for execution to accelerate operations for the thread through parallel execution of the micro-threads.

12.

发明授权
Apparatus and method for a hybrid latency-throughput processor 有权

公开(公告)号：US10255077B2

公开(公告)日：2019-04-09

申请号：US15226875

申请日：2016-08-02

Applicant: Intel Corporation

Inventor： Oren Ben-Kiki , Yuval Yosef , Ilan Pardo , Dror Markovich

IPC: G06F9/38 , G06F9/30 , G06F9/46 , G06F15/80 , G06F15/78

Abstract: An apparatus and method are described for executing both latency-optimized execution logic and throughput-optimized execution logic on a processing device. For example, a processor according to one embodiment comprises: latency-optimized execution logic to execute a first type of program code; throughput-optimized execution logic to execute a second type of program code, wherein the first type of program code and the second type of program code are designed for the same instruction set architecture; logic to identify the first type of program code and the second type of program code within a process and to distribute the first type of program code for execution on the latency-optimized execution logic and the second type of program code for execution on the throughput-optimized execution logic.

13.

发明授权
Processing core having shared front end unit 有权

公开(公告)号：US10140129B2

公开(公告)日：2018-11-27

申请号：US13730719

申请日：2012-12-28

Applicant: Intel Corporation

Inventor： Ilan Pardo , Dror Markovich , Oren Ben-Kiki , Yuval Yosef

IPC: G06F9/38 , G06F9/30

Abstract: A processor having one or more processing cores is described. Each of the one or more processing cores has front end logic circuitry and a plurality of processing units. The front end logic circuitry is to fetch respective instructions of threads and decode the instructions into respective micro-code and input operand and resultant addresses of the instructions. Each of the plurality of processing units is to be assigned at least one of the threads, is coupled to said front end unit, and has a respective buffer to receive and store microcode of its assigned at least one of the threads. Each of the plurality of processing units also comprises: i) at least one set of functional units corresponding to a complete instruction set offered by the processor, the at least one set of functional units to execute its respective processing unit's received microcode; ii) registers coupled to the at least one set of functional units to store operands and resultants of the received microcode; iii) data fetch circuitry to fetch input operands for the at least one functional units' execution of the received microcode.

14.

发明授权
Apparatus and method for low-latency invocation of accelerators 有权

公开(公告)号：US10095521B2

公开(公告)日：2018-10-09

申请号：US15145748

申请日：2016-05-03

Applicant: Intel Corporation

Inventor： Oren Ben-Kiki , Ilan Pardo , Robert Valentine , Eliezer Weissmann , Dror Markovich , Yuval Yosef

IPC: G06F9/30 , G06F9/38 , G06F11/07 , G06F9/54 , G06F12/0875

Abstract: An apparatus and method are described for providing low-latency invocation of accelerators. For example, a processor according to one embodiment comprises execution logic to execute a plurality of instructions including an accelerator invocation instruction to invoke one or more accelerator commands. The accelerator invocation instruction stores command data specifying the command within a command register. One or more accelerators read the command data from the command register and responsively attempt to execute the command identified by the command data. Upon a switch from a first context to a second context, an accelerator context save/restore pointer identifies a region within system memory where the accelerator is to save its state and later the accelerator context save/restore pointer aids in restoring its state upon returning to the first context.

15.

发明申请
Apparatus and Method for a Hybrid Latency-Throughput Processor 审中-公开
Title translation: 用于混合延迟吞吐量处理器的装置和方法

公开(公告)号：US20160342419A1

公开(公告)日：2016-11-24

申请号：US15226875

申请日：2016-08-02

Applicant: Intel Corporation

Inventor： Ben Oren-Kiki , Yuval Yosef , Ilan Pardo , Dror Markovich

IPC: G06F9/38 , G06F9/30 , G06F15/80

CPC classification number: G06F9/3851 , G06F9/30079 , G06F9/3836 , G06F9/3855 , G06F9/3861 , G06F9/3877 , G06F9/46 , G06F15/7867 , G06F15/7892 , G06F15/80

Abstract: An apparatus and method are described for executing both latency-optimized execution logic and throughput-optimized execution logic on a processing device. For example, a processor according to one embodiment comprises: latency-optimized execution logic to execute a first type of program code; throughput-optimized execution logic to execute a second type of program code, wherein the first type of program code and the second type of program code are designed for the same instruction set architecture; logic to identify the first type of program code and the second type of program code within a process and to distribute the first type of program code for execution on the latency-optimized execution logic and the second type of program code for execution on the throughput-optimized execution logic.

Abstract translation: 描述了用于在处理设备上执行延迟优化的执行逻辑和吞吐量优化的执行逻辑的装置和方法。例如，根据一个实施例的处理器包括：执行第一类型的程序代码的等待时间优化的执行逻辑; 吞吐量优化执行逻辑以执行第二类型的程序代码，其中所述第一类型的程序代码和所述第二类型的程序代码被设计用于相同的指令集架构; 识别过程中的第一类型的程序代码和第二类型的程序代码的逻辑，并且将用于执行的第一类型的程序代码分配在延迟优化的执行逻辑和第二类型的程序代码上以便在吞吐量 - 优化的执行逻辑。

16.

发明授权
Instruction set architecture-based inter-sequencer communications with a heterogeneous resource 有权
Title translation: 与异构资源的指令集基于架构的间隔器通信

公开(公告)号：US09459874B2

公开(公告)日：2016-10-04

申请号：US14541933

申请日：2014-11-14

Applicant: Intel Corporation

Inventor： Hong Wang , John Shen , Hong Jiang , Richard Hankins , Per Hammarlund , Dion Rodgers , Gautham Chinya , Baiju Patel , Shiv Kaushik , Bryant Bigbee , Gad Sheaffer , Yoav Talgam , Yuval Yosef , James P. Held

IPC: G06F9/38 , G06F9/30 , G06T1/20

CPC classification number: G06F9/30145 , G06F9/30 , G06F9/30181 , G06F9/3877 , G06F9/3879 , G06T1/20

Abstract: In one embodiment, the present invention includes a method for directly communicating between an accelerator and an instruction sequencer coupled thereto, where the accelerator is a heterogeneous resource with respect to the instruction sequencer. An interface may be used to provide the communication between these resources. Via such a communication mechanism a user-level application may directly communicate with the accelerator without operating system support. Further, the instruction sequencer and the accelerator may perform operations in parallel. Other embodiments are described and claimed.

Abstract translation: 在一个实施例中，本发明包括一种用于在加速器和与其耦合的指令定序器之间直接通信的方法，其中加速器相对于指令定序器是异质资源。可以使用接口来提供这些资源之间的通信。通过这种通信机制，用户级应用可以直接与加速器进行通信，而无需操作系统支持。此外，指令定序器和加速器可以并行地执行操作。描述和要求保护其他实施例。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification