专利检索 ap:"Marc Lupon" 第 1 页

1.

发明授权
Mechanism for facilitating dynamic and efficient fusion of computing instructions in software programs 有权
标题翻译：促进软件程序中计算指令的动态和有效融合的机制

公开(公告)号：US09329848B2

公开(公告)日：2016-05-03

申请号：US14129956

申请日：2013-03-27

申请人： Marc Lupon , Raul Martinez , Enric Gibert Codina , Kyriakos A. Stavrou , Grigorios Magklis , Sridhar Samudrala

发明人： Marc Lupon , Raul Martinez , Enric Gibert Codina , Kyriakos A. Stavrou , Grigorios Magklis , Sridhar Samudrala

IPC分类号： G06F9/45

CPC分类号： G06F8/443 , G06F8/4432 , G06F8/4434 , G06F8/4441 , Y02D10/41

摘要： A mechanism is described for facilitating dynamic and efficient fusion of computing instructions according to one embodiment. A method of embodiments, as described herein, includes monitoring a software program for a program region having fusion candidate instructions for a fusion operation at a computing system; evaluating whether the macro operation of the candidate instructions is valuable to the software program; and performing the fusion operation if it is evaluated to be valuable.

摘要翻译： 描述了根据一个实施例的用于促进计算指令的动态和有效融合的机制。如本文所述的实施例的方法包括监视具有用于在计算系统处的融合操作的融合候选指令的程序区域的软件程序; 评估候选指令的宏操作是否对软件程序有价值; 如果评估为有价值，则进行融合操作。

2.

发明申请
METHOD, APPARATUS AND SYSTEM FOR SELECTIVE EXECUTION OF A COMMIT INSTRUCTION 审中-公开
标题翻译：方法，选择性执行委托指令的方法和系统

公开(公告)号：US20140156976A1

公开(公告)日：2014-06-05

申请号：US13995441

申请日：2011-12-22

申请人： Enric Gibert Codina , Josep M. Codina , Fernando Latorre , Pedro Marcuello , Pedro Lopez , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos E. Kotselidis , Marc Lupon , Carlos Madriles Gimeno , Grigorios Magklis , Alejandro Martinez Vicente , Raul Martinez , Daniel Ortega , Demos Pavlou , Kyriakos A. Stavrou , Georgios Tournavitis , Polychronis Xekalakis

发明人： Enric Gibert Codina , Josep M. Codina , Fernando Latorre , Pedro Marcuello , Pedro Lopez , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos E. Kotselidis , Marc Lupon , Carlos Madriles Gimeno , Grigorios Magklis , Alejandro Martinez Vicente , Raul Martinez , Daniel Ortega , Demos Pavlou , Kyriakos A. Stavrou , Georgios Tournavitis , Polychronis Xekalakis

IPC分类号： G06F9/30

CPC分类号： G06F9/30145 , G06F9/30087 , G06F9/30185 , G06F9/3842 , G06F9/3857 , G06F9/3863

摘要： Techniques and mechanisms for a processor to determine whether a commit action is to be performed. In an embodiment, a processor performs operations to determine whether a commit instruction is for contingent performance of a commit action. In another embodiment, one or more conditions of processor state are evaluated in response to determining that the commit instruction is for contingent performance of the commit action, where the evaluation is performed to determine whether the commit action indicated by the commit instruction is to be performed.

摘要翻译： 处理器确定是否执行提交动作的技术和机制。在一个实施例中，处理器执行操作以确定提交指令是否用于或然执行提交动作。在另一个实施例中，响应于确定提交指令用于提交动作的或然执行来评估处理器状态的一个或多个条件，其中执行评估以确定是否要执行由提交指令指示的提交动作。

3.

发明授权
Propagating a prefetching profile bit from a prefetch queue to a data cache to indicate that a line was prefetched in response to an instruction within a code region 有权

公开(公告)号：US10013326B2

公开(公告)日：2018-07-03

申请号：US13993054

申请日：2011-12-29

申请人： Raul Martinez , Enric Gibert Codina , Pedro Lopez , Marti Torrents Lapuerta , Polychronis Xekalakis , Georgios Tournavitis , Kyriakos A. Stavrou , Demos Pavlou , Daniel Ortega , Alejandro Martinez Vicente , Pedro Marcuello , Grigorios Magklis , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos Kotselidis , Fernando Latorre , Marc Lupon , Carlos Madriles

发明人： Raul Martinez , Enric Gibert Codina , Pedro Lopez , Marti Torrents Lapuerta , Polychronis Xekalakis , Georgios Tournavitis , Kyriakos A. Stavrou , Demos Pavlou , Daniel Ortega , Alejandro Martinez Vicente , Pedro Marcuello , Grigorios Magklis , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos Kotselidis , Fernando Latorre , Marc Lupon , Carlos Madriles

IPC分类号： G06F11/30 , G06F11/34 , G06F12/0862

CPC分类号： G06F11/3037 , G06F11/3409 , G06F11/3466 , G06F12/0862 , G06F2201/86 , G06F2201/865 , G06F2201/88 , G06F2201/885

摘要： A combination of hardware and software collect profile data for asynchronous events, at code region granularity. An exemplary embodiment is directed to collecting metrics for prefetching events, which are asynchronous in nature. Instructions that belong to a code region are identified using one of several alternative techniques, causing a profile bit to be set for the instruction, as a marker. Each line of a data block that is prefetched is similarly marked. Events corresponding to the profile data being collected and resulting from instructions within the code region are then identified. Each time that one of the different types of events is identified, a corresponding counter is incremented. Following execution of the instructions within the code region, the profile data accumulated in the counters are collected, and the counters are reset for use with a new code region.

4.

发明授权
Double rounded combined floating-point multiply and add 有权
标题翻译：双圆形组合浮点乘法和加法

公开(公告)号：US09213523B2

公开(公告)日：2015-12-15

申请号：US13539198

申请日：2012-06-29

申请人： Sridhar Samudrala , Grigorios Magklis , Marc Lupon , David R. Ditzel

发明人： Sridhar Samudrala , Grigorios Magklis , Marc Lupon , David R. Ditzel

IPC分类号： G06F7/38 , G06F7/483 , G06F7/544

CPC分类号： G06F7/4876 , G06F7/483 , G06F7/485 , G06F7/4991 , G06F7/49915 , G06F7/5443 , G06F2207/4802

摘要： Methods, apparatus, instructions and logic are disclosed providing double rounded combined floating-point multiply and add functionality as scalar or vector SIMD instructions or as fused micro-operations. Embodiments include detecting floating-point (FP) multiplication operations and subsequent FP operations specifying as source operands results of the FP multiplications. The FP multiplications and the subsequent FP operations are encoded as combined FP operations including rounding of the results of FP multiplication followed by the subsequent FP operations. The encoding of said combined FP operations may be stored and executed as part of an executable thread portion using fused-multiply-add hardware that includes overflow detection for the product of FP multipliers, first and second FP adders to add third operand addend mantissas and the products of the FP multipliers with different rounding inputs based on overflow, or no overflow, in the products of the FP multiplier. Final results are selected respectively using overflow detection.

摘要翻译： 公开了提供双向组合浮点乘法和附加功能作为标量或向量SIMD指令或作为融合微操作的方法，装置，指令和逻辑。实施例包括检测浮点（FP）乘法运算和指定作为FP乘法的源操作数结果的后续FP操作。 FP乘法和随后的FP操作被编码为组合FP操作，包括对FP乘法的结果进行舍入，随后是随后的FP操作。所述组合FP操作的编码可以作为可执行线程部分的一部分使用融合乘法硬件来存储和执行，所述融合乘法加法器包括用于FP乘法器的乘积的溢出检测，第一和第二FP加法器来添加第三操作数加法尾数，基于FP乘法器产品中溢出或不溢出的FP乘法器的不同舍入输入的产品。分别使用溢出检测选择最终结果。

5.

发明申请
SUPPORT FOR SPECULATIVE OWNERSHIP WITHOUT DATA 审中-公开
标题翻译：支持无数据的统计所有权

公开(公告)号：US20130268735A1

公开(公告)日：2013-10-10

申请号：US13994686

申请日：2011-12-29

申请人： Enric Gibert Codina , Fernando Latorre , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Meyrem Hyuseinova , Christos E. Kotselidis , Pedro Lopez , Marc Lupon , Carlos Madriles , Grigorios Magklis , Pedro Marcuello , Alejandro Martinez Vicente , Raul Martinez , Daniel Ortega , Demos Pavlou , Kyriakos A. Stavrou , Georgios Tournavitis , Polychronis Xekalakis

发明人： Enric Gibert Codina , Fernando Latorre , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Meyrem Hyuseinova , Christos E. Kotselidis , Pedro Lopez , Marc Lupon , Carlos Madriles , Grigorios Magklis , Pedro Marcuello , Alejandro Martinez Vicente , Raul Martinez , Daniel Ortega , Demos Pavlou , Kyriakos A. Stavrou , Georgios Tournavitis , Polychronis Xekalakis

IPC分类号： G06F12/08

CPC分类号： G06F12/0815 , G06F2212/507

摘要： Techniques are described for providing an enhanced cache coherency protocol for a multi-core processor that includes a Speculative Request For Ownership Without Data (SRFOWD) for a portion of cache memory. With a SRFOWD, only an acknowledgement message may be provided as an answer to a requesting core. The contents of the affected cache line are not required to be a part of the answer. The enhanced cache coherency protocol may assure that a valid copy of the current cache line exists in case of misspeculation by the requesting core. Thus, an owner of the current copy of the cache line may maintain a copy of the old contents of the cache line. The old contents of the cache line may be discarded if speculation by the requesting core turns out to be correct. Otherwise, in case of misspeculation by the requesting core, the old contents of the cache line may be set back to a valid state.

摘要翻译： 描述了用于为多核处理器提供增强的高速缓存一致性协议的技术，所述多核处理器包括用于高速缓冲存储器的一部分的无数据的投机请求（SRFOWD）。使用SRFOWD，可以仅提供确认消息作为请求内核的答案。受影响的缓存行的内容不需要作为答案的一部分。增强的高速缓存一致性协议可以确保在请求核心的错误配置的情况下存在当前高速缓存行的有效副本。因此，高速缓存行的当前副本的所有者可以维护高速缓存行的旧内容的副本。如果请求的核心的推测证明是正确的，则可以丢弃该高速缓存行的旧内容。否则，在请求核心发生错误的情况下，可以将高速缓存行的旧内容设置回有效状态。

6.

发明申请
PARTIAL COMMITS IN DYNAMIC BINARY TRANSLATION BASED SYSTEMS 有权
标题翻译：基于动态二进制翻译的系统的部分组合

公开(公告)号：US20150007153A1

公开(公告)日：2015-01-01

申请号：US13929360

申请日：2013-06-27

申请人： Raul Martinez , Enric Gibert Codina , Marc Lupon , Kyriakos A. Stavrou

发明人： Raul Martinez , Enric Gibert Codina , Marc Lupon , Kyriakos A. Stavrou

IPC分类号： G06F9/45 , G06F11/14

CPC分类号： G06F8/443 , G06F8/4434

摘要： Described herein are technologies for optimizing computer code. A code generator can optimize a portion of original code to create optimized code. The code generator can create a partial commit point to indicate that execution of the optimized code produces an invalid architectural state. The code generator can create recovery information recover a valid architectural state at a recovery point. The code generator can associate the partial commit point and recovery information with the optimized code.

摘要翻译： 这里描述的是用于优化计算机代码的技术。代码生成器可以优化原始代码的一部分以创建优化的代码。代码生成器可以创建部分提交点，以指示优化代码的执行产生无效的架构状态。代码生成器可以创建恢复信息，以在恢复点恢复有效的体系结构状态。代码生成器可以将部分提交点和恢复信息与优化的代码相关联。

7.

发明申请
INSTRUCTION AND LOGIC FOR OPTIMIZATION LEVEL AWARE BRANCH PREDICTION 审中-公开
标题翻译：指导和逻辑优化水平分支预测

公开(公告)号：US20140095849A1

公开(公告)日：2014-04-03

申请号：US13631402

申请日：2012-09-28

申请人： Polychronis Xekalakis , Pedro Marcuello , Alejandro Vicente Martinez , Christos E. Kotselidis , Grigorios Magklis , Fernando Latorre , Raul Martinez , Josep M. Codina , Enric Gibert Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Pedro Lopez , Marc Lupon , Carlos Madriles , Daniel Ortega , Demos Pavlou , Kyriakos A. Stavrou , Georgios Tournavitis

发明人： Polychronis Xekalakis , Pedro Marcuello , Alejandro Vicente Martinez , Christos E. Kotselidis , Grigorios Magklis , Fernando Latorre , Raul Martinez , Josep M. Codina , Enric Gibert Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Pedro Lopez , Marc Lupon , Carlos Madriles , Daniel Ortega , Demos Pavlou , Kyriakos A. Stavrou , Georgios Tournavitis

IPC分类号： G06F9/38

CPC分类号： G06F9/38 , G06F9/30036 , G06F9/3848 , G06F9/3877 , G06F9/3887 , G06F9/3889

摘要： A computer-readable storage medium, method and system for optimization-level aware branch prediction is described. A gear level is assigned to a set of application instructions that have been optimized. The gear level is also stored in a register of a branch prediction unit of a processor. Branch prediction is then performed by the processor based upon the gear level.

摘要翻译： 描述了用于优化级感知分支预测的计算机可读存储介质，方法和系统。齿轮级别被分配给已优化的一组应用指令。齿轮级也存储在处理器的分支预测单元的寄存器中。然后由处理器基于齿轮级执行分支预测。

8.

发明申请
PROFILING ASYNCHRONOUS EVENTS RESULTING FROM THE EXECUTION OF SOFTWARE AT CODE REGION GRANULARITY 有权
标题翻译：根据代码区域格式的软件执行情况分析异常事件

公开(公告)号：US20130332705A1

公开(公告)日：2013-12-12

申请号：US13993054

申请日：2011-12-29

申请人： Raul Martinez , Enric Gibert Codina , Pedro Lopez , Marti Torrents Lapuerta , Polychronis Xekalakis , Georgios Tournavitis , Kyriakos A. Stavrou , Demos Pavlou , Daniel Ortega , Alejandro Martinez Vicente , Pedro Marcuello , Grigorios Magklis , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos Kotselidis , Fernando Latorre , Marc Lupon , Carlos Madriles

发明人： Raul Martinez , Enric Gibert Codina , Pedro Lopez , Marti Torrents Lapuerta , Polychronis Xekalakis , Georgios Tournavitis , Kyriakos A. Stavrou , Demos Pavlou , Daniel Ortega , Alejandro Martinez Vicente , Pedro Marcuello , Grigorios Magklis , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos Kotselidis , Fernando Latorre , Marc Lupon , Carlos Madriles

IPC分类号： G06F9/30

CPC分类号： G06F11/3037 , G06F11/3409 , G06F11/3466 , G06F12/0862 , G06F2201/86 , G06F2201/865 , G06F2201/88 , G06F2201/885

摘要： A combination of hardware and software collect profile data for asynchronous events, at code region granularity. An exemplary embodiment is directed to collecting metrics for prefetching events, which are asynchronous in nature. Instructions that belong to a code region are identified using one of several alternative techniques, causing a profile bit to be set for the instruction, as a marker. Each line of a data block that is prefetched is similarly marked. Events corresponding to the profile data being collected and resulting from instructions within the code region are then identified. Each time that one of the different types of events is identified, a corresponding counter is incremented. Following execution of the instructions within the code region, the profile data accumulated in the counters are collected, and the counters are reset for use with a new code region.

摘要翻译： 硬件和软件的组合以代码区域粒度收集用于异步事件的简档数据。示例性实施例涉及收集在本质上是异步的预取事件的度量。使用几种替代技术之一来标识属于代码区域的指令，使得为指令设置配置文件位作为标记。预取数据块的每一行都被类似地标记。然后识别与正在收集并且由代码区域内的指令产生的简档数据相对应的事件。每次识别出不同类型的事件之一时，相应的计数器将被递增。在代码区域中执行指令之后，收集在计数器中累积的简档数据，并且重置计数器以与新的代码区域一起使用。

9.

发明授权
Managed instruction cache prefetching 有权

公开(公告)号：US09811341B2

公开(公告)日：2017-11-07

申请号：US13995649

申请日：2011-12-29

申请人： Kyriakos A. Stavrou , Enric Gibert Codina , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos E. Kotselidis , Fernando Latorre , Pedro Lopez , Marc Lupon , Carlos Madriles Gimeno , Grigorios Magklis , Pedro Marcuello , Alejandro Martinez Vicente , Raul Martinez , Daniel Ortega , Demos Pavlou , Georgios Tournavitis , Polychronis Xekalakis

发明人： Kyriakos A. Stavrou , Enric Gibert Codina , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos E. Kotselidis , Fernando Latorre , Pedro Lopez , Marc Lupon , Carlos Madriles Gimeno , Grigorios Magklis , Pedro Marcuello , Alejandro Martinez Vicente , Raul Martinez , Daniel Ortega , Demos Pavlou , Georgios Tournavitis , Polychronis Xekalakis

IPC分类号： G06F9/38 , G06F9/30 , G06F12/0862

CPC分类号： G06F9/3804 , G06F9/30047 , G06F9/3017 , G06F9/3802 , G06F9/3806 , G06F9/3848 , G06F12/0862 , G06F2212/452

摘要： Disclosed is an apparatus and method to manage instruction cache prefetching from an instruction cache. A processor may comprise: a prefetch engine; a branch prediction engine to predict the outcome of a branch; and dynamic optimizer. The dynamic optimizer may be used to control: identifying common instruction cache misses and inserting a prefetch instruction from the prefetch engine to the instruction cache.

10.

发明申请
RECONFIGURABLE PROCESSING UNIT 有权
标题翻译：可重构加工单元

公开(公告)号：US20150170021A1

公开(公告)日：2015-06-18

申请号：US14133192

申请日：2013-12-18

申请人： Marc Lupon , Enric Herrero Abellanas , Ayose Falcon , Fernando Latorre , Pedro Lopez , Frederico Pratas

发明人： Marc Lupon , Enric Herrero Abellanas , Ayose Falcon , Fernando Latorre , Pedro Lopez , Frederico Pratas

IPC分类号： G06N3/02

CPC分类号： G06N3/063

摘要： A processing device includes a processor core and a number of calculation modules that each is configurable to perform any one of operations for a convolutional neuron network system. A first set of the calculation modules are configured to perform convolution operations, a second set of the calculation modules are reconfigured to perform averaging operations, and a third set of the calculation modules are reconfigured to perform dot product operations.

摘要翻译： 处理设备包括处理器核心和多个计算模块，每个计算模块可配置为执行卷积神经网络系统的任何一个操作。第一组计算模块被配置为执行卷积运算，第二组计算模块被重新配置以执行平均运算，并且第三组计算模块被重新配置以执行点积运算。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类