-
公开(公告)号:US10459858B2
公开(公告)日:2019-10-29
申请号:US15804939
申请日:2017-11-06
Applicant: INTEL CORPORATION
Inventor: Hong Wang , Per Hammarlund , Xiang Zou , John P. Shen , Xinmin Tian , Milind Girkar , Perry H. Wang , Piyush N. Desai
Abstract: Method, apparatus, and program means for a programmable event driven yield mechanism that may activate other threads. In one embodiment, an apparatus includes execution resources to execute a plurality of instructions and a monitor to detect a condition indicating a low level of progress. The monitor can disrupt processing of a program by transferring to a handler in response to detecting the condition indicating a low level of progress. In another embodiment, thread switch logic may be coupled to a plurality of event monitors which monitor events within the multithreading execution logic. The thread switch logic switches threads based at least partially on a programmable condition of one or more of the performance monitors.
-
公开(公告)号:US20190243684A1
公开(公告)日:2019-08-08
申请号:US15890984
申请日:2018-02-07
Applicant: Intel Corporation
Inventor: Pooja Roy , Jayesh Gaur , Sreenivas Subramoney , Zeev Sperber , Alexandr Titov , Lihu Rappoport , Stanislav Shwartsman , Hong Wang , Adi Yoaz , Ronak Singhal , Robert S. Chappell
Abstract: A processor including an execution unit, an instruction scheduler circuit to identify a first instruction of an instruction stream, identify a second instruction on which execution of the first instruction depends, and assign a first dispatch priority value to the first instruction and the second instruction, and a dispatch circuit to dispatch, based on the first dispatch priority value, the first instruction and the second instruction to an instruction execution circuit.
-
公开(公告)号:US10331582B2
公开(公告)日:2019-06-25
申请号:US15430765
申请日:2017-02-13
Applicant: INTEL CORPORATION
Inventor: Ishwar S. Bhati , Huichu Liu , Jayesh Gaur , Kunal Korgaonkar , Sasikanth Manipatruni , Sreenivas Subramoney , Tanay Karnik , Hong Wang , Ian A. Young
IPC: G06F13/16 , G06F12/0811
Abstract: A processor includes a processing core and a cache controller including a read queue and a separate write queue. The read queue is to buffer read requests of the processing core to a non-volatile memory, last level cache (NVM-LLC), and the write queue is to buffer write requests to the NVM-LLC. The cache controller is to detect whether the write queue is full. The cache controller further prioritizes a first order of sending requests to the NVM-LLC when the write queue contains an empty slot, the first order specifying a first pattern of sending the read requests before the write requests, and prioritizes a second order of sending requests to the NVM-LLC in response to a determination that the write queue is full, the second order specifying a second pattern of alternating between sending a write request from the write queue and a read request from the read queue.
-
公开(公告)号:US20190004802A1
公开(公告)日:2019-01-03
申请号:US15637562
申请日:2017-06-29
Applicant: Intel Corporation
Inventor: Stephen J. Tarsa , Gokce Keskin , Gautham N. Chinya , Hong Wang
Abstract: A processor, including: an execution unit including branching circuitry; a branch predictor, including a hard-to-predict (HTP) branch filter to identify an HTP branch; and a special branch predictor to receive identification of an HTP branch from the HTP branch filter, the special branch predictor including a convolutional neural network (CNN) branch predictor to predict a branching action for the HTP branch.
-
公开(公告)号:US20180321936A1
公开(公告)日:2018-11-08
申请号:US15943609
申请日:2018-04-02
Applicant: Intel Corporation
Inventor: Edward Grochowski , Hong Wang , John P. Shen , Perry H. Wang , Jamison D. Collins , James Held , Partha Kundu , Raya Leviathan , Tin-Fook Ngai
CPC classification number: G06F9/30003 , G06F9/30087 , G06F9/3009 , G06F9/30101 , G06F9/3013 , G06F9/384 , G06F9/3851
Abstract: A method and system to provide user-level multithreading are disclosed. The method according to the present techniques comprises receiving programming instructions to execute one or more shared resource threads (shreds) via an instruction set architecture (ISA). One or more instruction pointers are configured via the ISA; and the one or more shreds are executed simultaneously with a microprocessor, wherein the microprocessor includes multiple instruction sequencers.
-
公开(公告)号:US20180307484A1
公开(公告)日:2018-10-25
申请号:US15900030
申请日:2018-02-20
Applicant: Intel Corporation
Inventor: Ed Grochowski , Hong Wang , John P. Shen , Perry H. Wang , Jamison D. Collins , James Held , Partha Kundu , Raya Leviathan , Tin-Fook Ngai
CPC classification number: G06F9/30003 , G06F9/30087 , G06F9/3009 , G06F9/30101 , G06F9/3013 , G06F9/384 , G06F9/3851
Abstract: A method and system to provide user-level multithreading are disclosed. The method according to the present techniques comprises receiving programming instructions to execute one or more shared resource threads (shreds) via an instruction set architecture (ISA). One or more instruction pointers are configured via the ISA; and the one or more shreds are executed simultaneously with a microprocessor, wherein the microprocessor includes multiple instruction sequencers.
-
公开(公告)号:US09785576B2
公开(公告)日:2017-10-10
申请号:US14227178
申请日:2014-03-27
Applicant: Intel Corporation
Inventor: Thiam Wah Loh , Per Hammarlund , Andreas Wasserbauer , Swee Chong Peter Kuan , Eckhard Delfs , Deepak A. Mathaikutty , Stephen J. Robinson , Gautham N. Chinya , Perry H. Wang , Chee Weng Tan , Hong Wang , Reza Fortas
CPC classification number: G06F12/1408 , G06F12/1491 , G06F21/10 , G06F21/575 , G06F2212/1052 , G06F2221/032 , Y02D10/13
Abstract: Systems and methods for employing hardware-assisted virtualization for implementing a secure video output path. An example processing system comprises: a memory; a shared interconnect; and a processing core communicatively coupled to the memory via the shared interconnect, the processing core to: initialize a first virtual machine and a second virtual machine; responsive to receiving a memory access transaction initiated by the first virtual machine to access a memory buffer, tag the memory access transaction with an identifier of the first virtual machine; and responsive to receiving a digital content decoder access transaction initiated by the second virtual machine, tag the digital decoder access transaction with an identifier of the second virtual machine.
-
公开(公告)号:US20170091655A1
公开(公告)日:2017-03-30
申请号:US14865124
申请日:2015-09-25
Applicant: Intel Corporation
Inventor: Tsung-Han Lin , Gokce Keskin , Hsiang-Tsung Kung , She-Hwa Yen , Hong Wang
IPC: G06N99/00
CPC classification number: G06N20/00 , G06F9/3836 , G06F15/76
Abstract: A processor includes a front end to decode an instruction, an allocator to pass the instruction to a nearest neighbor logic unit (NNLU) to execute the instruction, and a retirement unit to retire the instruction. The NNLU includes logic to determine input of the instruction for which nearest neighbors will be calculated, transform the input, retrieve candidate atoms for which the nearest neighbors will be calculated, compute distance between the candidate atoms and the input, and determine the nearest neighbors for the input based upon the computed distance.
-
公开(公告)号:US20160274910A1
公开(公告)日:2016-09-22
申请号:US15166469
申请日:2016-05-27
Applicant: Intel Corporation
Inventor: Gautham Chinya , Hong Wang , Prashant Sethi , Shivnandan Kaushik , Bryant Bigbee , John Shen , Richard Hankins , Xiang Zou , Baiju V. Patel , Jason W. Brandt , Anil Aggarwal , John L. Reid
CPC classification number: G06F9/3005 , G06F9/3009 , G06F9/3851 , G06F9/3861 , G06F9/3877 , G06F9/3885 , G06F9/461
Abstract: Embodiments of the invention provide a method of creating, based on an operating-system-scheduled thread running on an operating-system-visible sequencer and using an instruction set extension, a persistent user-level thread to run on an operating-system-sequestered sequencer independently of context switch activities on the operating-system-scheduled thread. The operating-system-scheduled thread and the persistent user-level thread may share a common virtual address space. Embodiments of the invention may also provide a method of causing a service thread running on an additional operating-system-visible sequencer to provide operating system services to the persistent user-level thread. Embodiments of the invention may further provide apparatus, system, and machine-readable medium thereof.
-
公开(公告)号:US09442721B2
公开(公告)日:2016-09-13
申请号:US13722481
申请日:2012-12-20
Applicant: Intel Corporation
Inventor: Edward T. Grochowski , Hong Wang , John P. Shen , Perry H. Wang , Jamison D. Colins , James P. Held , Partha Kundu , Raya Leviathan , Tin-Fook Ngai
CPC classification number: G06F9/30003 , G06F9/30087 , G06F9/3009 , G06F9/30101 , G06F9/3013 , G06F9/384 , G06F9/3851
Abstract: A method and system to provide user-level multithreading are disclosed. The method according to the present techniques comprises receiving programming instructions to execute one or more shared resource threads (shreds) via an instruction set architecture (ISA). One or more instruction pointers are configured via the ISA; and the one or more shreds are executed simultaneously with a microprocessor, wherein the microprocessor includes multiple instruction sequencers.
Abstract translation: 公开了提供用户级多线程的方法和系统。 根据本技术的方法包括接收经由指令集架构(ISA)执行一个或多个共享资源线程(碎片)的编程指令。 一个或多个指令指针通过ISA配置; 并且一个或多个碎片与微处理器同时执行,其中微处理器包括多个指令定序器。
-
-
-
-
-
-
-
-
-