Patent search ap:("INTEL CORPORATION") AND inv:"Fernando Latorre" Page 2

11.

发明授权
Instruction and logic for support of code modification in translation lookaside buffers 有权
Title translation: 用于支持翻译后备缓冲区中的代码修改的指令和逻辑

公开(公告)号：US09582432B2

公开(公告)日：2017-02-28

申请号：US15178008

申请日：2016-06-09

Applicant: Intel Corporation

Inventor： Jaroslaw Topp , Niranjan L. Cooray , Fernando Latorre

IPC: G06F12/00 , G06F13/00 , G06F12/10 , G06F9/30

CPC classification number: G06F12/1027 , G06F9/30003 , G06F9/30181 , G06F9/30185 , G06F9/3877 , G06F9/3887 , G06F12/0815 , G06F12/0888 , G06F12/1081 , G06F2212/1016 , G06F2212/30 , G06F2212/302 , G06F2212/50 , G06F2212/68

Abstract: A processor includes a core with logic to execute a translated instruction. The translated instruction is translated from an instruction stored in a memory location. The processor further includes a translation lookaside buffer including logic to store translation indicators from a physical map. Each translation indicator indicates whether a corresponding memory location includes translated code to be protected. The processor further includes a translation indicator agent including logic to determine whether the buffer indicates whether the memory location has been modified subsequent to translation of the instruction.

Abstract translation: 处理器包括具有执行翻译指令的逻辑的核心。转换后的指令从存储在存储单元中的指令转换。处理器还包括翻译后备缓冲器，其包括用于存储来自物理图的转换指示符的逻辑。每个翻译指示符指示对应的存储器位置是否包括要保护的翻译代码。处理器还包括翻译指示剂代理，其包括用于确定缓冲器是否指示在指令的翻译之后是否已经修改了存储器位置的逻辑。

12.

发明授权
Two-level cache locking mechanism 有权
Title translation: 两级缓存锁定机制

公开(公告)号：US09558121B2

公开(公告)日：2017-01-31

申请号：US13729840

申请日：2012-12-28

Applicant: INTEL CORPORATION

Inventor： Li-Gao Zei , Fernando Latorre , Steffen Kosinski , Jaroslaw Topp , Varun Mohandru , Lutz Naethke

IPC: G06F12/00 , G06F12/08 , G06F12/10

CPC classification number: G06F12/0846 , G06F12/0864 , G06F12/1063

Abstract: A virtually tagged cache may be configured to index virtual address entries in the cache into lockable sets based on a page offset value. When a memory operation misses on the virtually tagged cache, only the one set of virtual address entries with the same page offset may be locked. Thereafter, this general lock may be released and only an address stored in the physical tag array matching the physical address and a virtual address in the virtual tag array corresponding to the matching address stored in the physical tag array may be locked to reduce the amount and duration of locked addresses. The machine may be stalled only if a particular memory address request hits and/or tries to access one or more entries in a locked set. Devices, systems, methods, and computer readable media are provided.

Abstract translation: 虚拟标记的高速缓存可以被配置为基于页面偏移值将高速缓存中的虚拟地址条目索引到可锁定集合。当内存操作错过虚拟标记的缓存时，只有一组具有相同页偏移量的虚拟地址条目可能被锁定。此后，可以解除该通用锁定，并且仅锁定与物理地址匹配的物理标签阵列中存储的地址和与物理标签阵列中存储的匹配地址相对应的虚拟标签阵列中的虚拟地址，以减少数量和锁定地址的持续时间。只有当特定的存储器地址请求命中和/或尝试访问锁定集中的一个或多个条目时，才可能停止该机器。提供了设备，系统，方法和计算机可读介质。

13.

发明授权
Method and apparatus for distributed and cooperative computation in artificial neural networks 有权

公开(公告)号：US12032653B2

公开(公告)日：2024-07-09

申请号：US17306877

申请日：2021-05-03

Applicant: Intel Corporation

Inventor： Frederico C. Pratas , Ayose J. Falcon , Marc Lupon , Fernando Latorre , Pedro Lopez , Enric Herrero Abellanas , Georgios Tournavitis

IPC: G06F17/15 , G06F12/0875 , G06N3/04 , G06N3/063

CPC classification number: G06F17/153 , G06F12/0875 , G06N3/04 , G06N3/063 , G06F2212/1024

Abstract: An apparatus and method are described for distributed and cooperative computation in artificial neural networks. For example, one embodiment of an apparatus comprises: an input/output (I/O) interface; a plurality of processing units communicatively coupled to the I/O interface to receive data for input neurons and synaptic weights associated with each of the input neurons, each of the plurality of processing units to process at least a portion of the data for the input neurons and synaptic weights to generate partial results; and an interconnect communicatively coupling the plurality of processing units, each of the processing units to share the partial results with one or more other processing units over the interconnect, the other processing units using the partial results to generate additional partial results or final results. The processing units may share data including input neurons and weights over the shared input bus.

14.

发明授权
Method and apparatus for distributed and cooperative computation in artificial neural networks 有权

公开(公告)号：US10997273B2

公开(公告)日：2021-05-04

申请号：US15521856

申请日：2015-11-19

Applicant: Intel Corporation

Inventor： Frederico C. Pratas , Ayose J. Falcon , Marc Lupon , Fernando Latorre , Pedro Lopez , Enric Herrero Abellanas , Georgios Tournavitis

IPC: G06F17/15 , G06N3/063 , G06F12/0862 , G06F12/0875 , G06N3/04

Abstract: An apparatus and method are described for distributed and cooperative computation in artificial neural networks. For example, one embodiment of an apparatus comprises: an input/output (I/O) interface; a plurality of processing units communicatively coupled to the I/O interface to receive data for input neurons and synaptic weights associated with each of the input neurons, each of the plurality of processing units to process at least a portion of the data for the input neurons and synaptic weights to generate partial results; and an interconnect communicatively coupling the plurality of processing units, each of the processing units to share the partial results with one or more other processing units over the interconnect, the other processing units using the partial results to generate additional partial results or final results. The processing units may share data including input neurons and weights over the shared input bus.

15.

发明授权
Storage device and method for performing convolution operations 有权

公开(公告)号：US09971540B2

公开(公告)日：2018-05-15

申请号：US14861701

申请日：2015-09-22

Applicant: INTEL CORPORATION

Inventor： Enric Herrero Abellanas , Georgios Tournavitis , Frederico C. Pratas , Marc Lupon , Fernando Latorre , Pedro Lopez , Ayose J. Falcon

IPC: G06F3/06 , G06F17/15 , G06T1/60 , G06N3/063

CPC classification number: G06F3/0644 , G06F3/0604 , G06F3/0683 , G06F17/153 , G06N3/063 , G06T1/60

Abstract: A storage device and method are described for performing convolution operations. For example, one embodiment of an apparatus to perform convolution operations comprises a plurality of processing units to execute convolution operations on input data and partial results; a unified scratchpad memory comprising a plurality of memory banks communicatively coupled to the plurality of processing units through a plurality of read/write ports, each of the plurality of memory banks partitioned to store both the input data and partial results; a control unit to allocate the input data and partial results to the memory banks to ensure a minimum quality of service in accordance with the specified number of read/write ports and the specified convolution operation to be performed.

16.

发明授权
Store forwarding for data caches 有权
Title translation: 存储转发数据缓存

公开(公告)号：US09507725B2

公开(公告)日：2016-11-29

申请号：US13729945

申请日：2012-12-28

Applicant: INTEL CORPORATION

Inventor： Steffen Kosinski , Fernando Latorre , Niranjan Cooray , Stanislav Shwartsman , Ethan Kalifon , Varun Mohandru , Pedro Lopez , Tom Aviram-Rosenfeld , Jaroslav Topp , Li-Gao Zei

IPC: G06F12/00 , G06F12/08

CPC classification number: G06F12/0895 , G06F12/0855 , G06F12/0866

Abstract: A bit or other vector may be used to identify whether an address range entered into an intermediate buffer corresponds to most recently updated data associated with the address range. A bit or other vector may also be used to identify whether an address range entered into an intermediate buffer overlaps with an address range of data that is to be loaded. A processing device may then determine whether to obtain data that is to be loaded entirely from a cache, entirely from an intermediate buffer which temporarily buffers data destined for a cache until the cache is ready to accept the data, or from both the cache and the intermediate buffer depending on the particular vector settings. Systems, devices, methods, and computer readable media are provided.

Abstract translation: 可以使用位或其他向量来识别输入中间缓冲器的地址范围是否对应于与地址范围相关联的最近更新的数据。还可以使用位或其他向量来识别输入中间缓冲器的地址范围是否与要加载的数据的地址范围重叠。然后，处理设备可以完全从中间缓冲区获得要从缓存中完全加载的数据，该中间缓冲器临时缓冲目的地为高速缓存的数据，直到高速缓存准备好接受数据，或者从高速缓存和中间缓冲区取决于特定的向量设置。提供了系统，设备，方法和计算机可读介质。

17.

发明申请
PROFILING ASYNCHRONOUS EVENTS RESULTING FROM THE EXECUTION OF SOFTWARE AT CODE REGION GRANULARITY 审中-公开

公开(公告)号：US20190004916A1

公开(公告)日：2019-01-03

申请号：US16026870

申请日：2018-07-03

Applicant: Intel Corporation

Inventor： Raul Martinez , Enric Gibert Codina , Pedro Lopez , Marti Torrents Lapuerta , Polychronis Xekalakis , Georgios Tournavitis , Kyriakos A. Stavrou , Demos Pavlou , Daniel Ortega , Alejandro Martinez Vicente , Pedro Marcuello , Grigorios Magklis , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos Kotselidis , Fernando Latorre , Marc Lupon , Carlos Madriles

IPC: G06F11/30 , G06F12/0862 , G06F11/34

Abstract: A combination of hardware and software collect profile data for asynchronous events, at code region granularity. An exemplary embodiment is directed to collecting metrics for prefetching events, which are asynchronous in nature. Instructions that belong to a code region are identified using one of several alternative techniques, causing a profile bit to be set for the instruction, as a marker. Each line of a data block that is prefetched is similarly marked. Events corresponding to the profile data being collected and resulting from instructions within the code region are then identified. Each time that one of the different types of events is identified, a corresponding counter is incremented. Following execution of the instructions within the code region, the profile data accumulated in the counters are collected, and the counters are reset for use with a new code region.

18.

发明授权
Processing device for performing convolution operations 有权

公开(公告)号：US09613001B2

公开(公告)日：2017-04-04

申请号：US14136302

申请日：2013-12-20

Applicant: Intel Corporation

Inventor： Enric Herrero Abellanas , Marc Lupon , Ayose J. Falcon , Frederico C. Pratas , Fernando Latorre , Pedro Lopez

IPC: G06F17/15 , G06K9/62

CPC classification number: G06F17/15 , G06F17/153 , G06K9/4609 , G06K9/627 , G06N3/04

Abstract: Systems and methods for performing convolution operations. An example processing system comprises: a processing core; and a convolver unit to apply a convolution filter to a plurality of input data elements represented by a two-dimensional array, the convolver unit comprising a plurality of multipliers coupled to two or more sets of latches, wherein each set of latches is to store a plurality of data elements of a respective one-dimensional section of the two-dimensional array.

19.

发明申请
Instruction and Logic for Support of Code Modification In Translation Lookaside Buffers 有权
Title translation: 在翻译后备缓冲区中支持代码修改的指令和逻辑

公开(公告)号：US20160292081A1

公开(公告)日：2016-10-06

申请号：US15178008

申请日：2016-06-09

Applicant: Intel Corporation

Inventor： Jaroslaw Topp , Niranjan L. Cooray , Fernando Latorre

IPC: G06F12/10 , G06F9/30

CPC classification number: G06F12/1027 , G06F9/30003 , G06F9/30181 , G06F9/30185 , G06F9/3877 , G06F9/3887 , G06F12/0815 , G06F12/0888 , G06F12/1081 , G06F2212/1016 , G06F2212/30 , G06F2212/302 , G06F2212/50 , G06F2212/68

Abstract: A processor includes a core with logic to execute a translated instruction. The translated instruction is translated from an instruction stored in a memory location. The processor further includes a translation lookaside buffer including logic to store translation indicators from a physical map. Each translation indicator indicates whether a corresponding memory location includes translated code to be protected. The processor further includes a translation indicator agent including logic to determine whether the buffer indicates whether the memory location has been modified subsequent to translation of the instruction.

Abstract translation: 处理器包括具有执行翻译指令的逻辑的核心。转换后的指令从存储在存储单元中的指令转换。处理器还包括翻译后备缓冲器，其包括用于存储来自物理图的转换指示符的逻辑。每个翻译指示符指示对应的存储器位置是否包括要保护的翻译代码。处理器还包括翻译指示剂代理，其包括用于确定缓冲器是否指示在指令的翻译之后是否已经修改了存储器位置的逻辑。

20.

发明授权
Adaptive data prefetching 有权
Title translation: 自适应数据预取

公开(公告)号：US09280474B2

公开(公告)日：2016-03-08

申请号：US13976325

申请日：2013-01-03

Applicant: Intel Corporation

Inventor： Demos Pavlou , Pedro Lopez , Mirem Hyuseinova , Fernando Latorre , Steffen Kosinski , Ralf Goettsche , Varun K. Mohandru

IPC: G06F12/08 , G06F12/02 , G06F9/06 , G06F9/30 , G06F9/345 , G06F9/38

CPC classification number: G06F12/0862 , G06F9/06 , G06F9/30 , G06F9/3455 , G06F9/383 , G06F12/02 , G06F2212/6026

Abstract: A system and method for adaptive data prefetching in a processor enables adaptive modification of parameters associated with a prefetch operation. A stride pattern in successive addresses of a memory operation may be detected, including determining a stride length (L). Prefetching of memory operations may be based on a prefetch address determined from a base memory address, the stride length L, and a prefetch distance (D). A number of prefetch misses may be counted at a miss prefetch count (C). Based on the value of the miss prefetch count C, the prefetch distance D may be modified. As a result of adaptive modification of the prefetch distance D, an improved rate of cache hits may be realized.

Abstract translation: 用于处理器中自适应数据预取的系统和方法使得能够对与预取操作相关联的参数进行自适应修改。可以检测存储器操作的连续地址中的步幅图案，包括确定步幅长度（L）。存储器操作的预取可以基于从基本存储器地址确定的预取地址，步幅长度L和预取距离（D）。可以以错误预取计数（C）计数多个预取缺失。基于缺省预取计数C的值，可以修改预取距离D. 作为预取距离D的自适应修改的结果，可以实现改进的高速缓存命中率。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification