专利检索 ap:("Lee Evan Eisen" OR "David Stephen Levitan" OR "Francis Patrick O'Connell" OR "Wolfram M. Sauer") AND inv:"Francis Patrick O'Connell" 第 1 页

1.

发明授权
Method and logical apparatus for managing processing system resource use for speculative execution 失效
标题翻译：用于管理用于投机执行的处理系统资源使用的方法和逻辑装置

公开(公告)号：US07890738B2

公开(公告)日：2011-02-15

申请号：US11039498

申请日：2005-01-20

申请人： Lee Evan Eisen , David Stephen Levitan , Francis Patrick O'Connell , Wolfram M. Sauer

发明人： Lee Evan Eisen , David Stephen Levitan , Francis Patrick O'Connell , Wolfram M. Sauer

IPC分类号： G06F9/50 , G06F9/42

CPC分类号： G06F9/5011 , G06F9/3844 , G06F9/3851 , G06F2209/507 , Y02D10/22

摘要： A method and logical apparatus for managing processing system resource use for speculative execution reduces the power and performance burden associated with inefficient speculative execution of program instructions. A measure of the efficiency of speculative execution is used to reduce resources allocated to a thread while the speculation efficiency is low. The resource control applied may be the number of instruction fetches allocated to the thread or the number of execution time slices. Alternatively, or in combination, the size of a prefetch instruction storage allocated to the thread may be limited. The control condition may be comparison of the number of correct or incorrect speculations to a threshold, comparison of the number of correct to incorrect speculations, or a more complex evaluator such as the size of a ratio of incorrect to total speculations.

摘要翻译： 用于管理用于推测性执行的处理系统资源使用的方法和逻辑装置降低与程序指令的无效推测执行相关联的功率和性能负担。投机执行效率的度量用于减少分配给线程的资源，同时投机效率低。应用的资源控制可以是分配给线程的指令获取的数量或执行时间片的数量。或者或组合地，分配给线程的预取指令存储器的大小可能受到限制。控制条件可以是正确的或不正确的猜测的数量与阈值的比较，正确到不正确的猜测的数量的比较，或比较复杂的评估者，比如不正确比例与总猜测的比例。

2.

发明授权
Selectively prohibiting speculative execution of conditional branch type based on instruction bit 失效
标题翻译：选择性地禁止基于指令位的推测性执行条件分支类型

公开(公告)号：US07254693B2

公开(公告)日：2007-08-07

申请号：US11002522

申请日：2004-12-02

申请人： Lee Evan Eisen , Francis Patrick O'Connell

发明人： Lee Evan Eisen , Francis Patrick O'Connell

IPC分类号： G06F9/38

CPC分类号： G06F9/3842 , G06F9/30058 , G06F9/30181 , G06F9/3844 , G06F9/3851 , G06F9/3867

摘要： A method, apparatus, and computer program product are disclosed for selectively prohibiting speculative conditional branch execution. A particular type of conditional branch instruction is selected. An indication is stored within each instruction that is the particular type of conditional branch instruction. A processor then fetches a first instruction from code that is to be executed. A determination is made regarding whether the first instruction includes the indication. In response to determining that the instruction includes the indication: speculative execution of the first instruction is prohibited, an actual location to which the first instruction will branch is resolved, and execution of the code is branched to the actual location. In response to determining that the instruction does not include the indication, the first instruction is speculatively executed.

摘要翻译： 公开了用于选择性地禁止推测性条件分支执行的方法，装置和计算机程序产品。选择特定类型的条件分支指令。指示存储在作为条件分支指令的特定类型的每个指令内。然后处理器从要执行的代码中获取第一条指令。确定第一指令是否包括指示。响应于确定指令包括指示：禁止第一指令的推测执行，第一指令将分支的实际位置被解析，并且代码的执行被分支到实际位置。响应于确定指令不包括指示，推测性地执行第一指令。

3.

发明授权
Fine-grained software-directed data prefetching using integrated high-level and low-level code analysis optimizations 失效
标题翻译：使用集成的高级和低级代码分析优化进行细粒度的软件导向数据预取

公开(公告)号：US07669194B2

公开(公告)日：2010-02-23

申请号：US10926595

申请日：2004-08-26

申请人： Roch Georges Archambault , Robert James Blainey , Yaoqing Gao , Allan Russell Martin , James Lawrence McInnes , Francis Patrick O'Connell

发明人： Roch Georges Archambault , Robert James Blainey , Yaoqing Gao , Allan Russell Martin , James Lawrence McInnes , Francis Patrick O'Connell

IPC分类号： G06F9/44 , G06F9/45 , G06F9/30

CPC分类号： G06F8/4442

摘要： A mechanism for minimizing effective memory latency without unnecessary cost through fine-grained software-directed data prefetching using integrated high-level and low-level code analysis and optimizations is provided. The mechanism identifies and classifies streams, identifies data that is most likely to incur a cache miss, exploits effective hardware prefetching to determine the proper number of streams to be prefetched, exploits effective data prefetching on different types of streams in order to eliminate redundant prefetching and avoid cache pollution, and uses high-level transformations with integrated lower level cost analysis in the instruction scheduler to schedule prefetch instructions effectively.

摘要翻译： 提供了一种通过使用集成高级和低级代码分析和优化的细粒度软件导向数据预取来最小化有效存储器延迟而不需要成本的机制。该机制识别和分类流，识别最可能引起缓存未命中的数据，利用有效的硬件预取来确定要预取的流的适当数量，利用不同类型的流上的有效数据预取，以消除冗余预取和避免高速缓存污染，并在指令调度程序中使用集成较低级别成本分析的高级转换，有效地调度预取指令。

4.

发明授权
Method and apparatus for mapping software prefetch instructions to hardware prefetch logic 失效
标题翻译：将软件预取指令映射到硬件预取逻辑的方法和装置

公开(公告)号：US06915415B2

公开(公告)日：2005-07-05

申请号：US10042102

申请日：2002-01-07

申请人： Michael John Mayfield , Francis Patrick O'Connell , David Scott Ray

发明人： Michael John Mayfield , Francis Patrick O'Connell , David Scott Ray

IPC分类号： G06F9/00 , G06F9/30 , G06F9/318 , G06F9/38

CPC分类号： G06F9/383 , G06F9/30047 , G06F9/3017 , G06F9/3455

摘要： A method and apparatus for mapping some software prefetch instructions in a microprocessor system to a modified set of hardware prefetch instructions and executing the software prefetch by invoking the corresponding modified hardware prefetch instruction. For common software prefetch access patterns, by mapping the software prefetches to hardware, improved prefetching can be achieved without the need for additional hardware.

摘要翻译： 一种用于将微处理器系统中的一些软件预取指令映射到修改的硬件预取指令集并且通过调用相应的修改的硬件预取指令来执行软件预取的方法和装置。对于常见的软件预取访问模式，通过将软件预取映射到硬件，可以实现改进的预取，而无需额外的硬件。

5.

发明授权
Software prefetch system and method for predetermining amount of streamed data 失效
标题翻译：软件预取系统和预测流数据量的方法

公开(公告)号：US06574712B1

公开(公告)日：2003-06-03

申请号：US09550180

申请日：2000-04-14

申请人： James Allan Kahle , Michael John Mayfield , Francis Patrick O'Connell , David Scott Ray , Edward John Silha , Joel M. Tendler

发明人： James Allan Kahle , Michael John Mayfield , Francis Patrick O'Connell , David Scott Ray , Edward John Silha , Joel M. Tendler

IPC分类号： G06F1208

CPC分类号： G06F9/383 , G06F9/30047 , G06F9/3802 , G06F12/0862 , G06F12/0897 , G06F2212/6028

摘要： A data processing system includes a processor having a first level cache and a prefetch engine. Coupled to the processor are a second level cache and a third level cache and a system memory. Prefetching of cache lines is performed into each of the first, second, and third level caches by the prefetch engine. Prefetch requests from the prefetch engine to the second and third level caches is performed over a private prefetch request bus, which is separate from the bus system that transfers data from the various cache levels to the processor. A software instruction is used to accelerate the prefetch process by overriding the normal functionality of the hardware prefetch engine. The instruction also limits the amount of data to be prefetched.

摘要翻译： 数据处理系统包括具有第一级高速缓存和预取引擎的处理器。耦合到处理器的是二级缓存和第三级缓存和系统存储器。通过预取引擎对高速缓存行的预取执行到第一，第二和第三级高速缓存中的每一个。从预取引擎到第二和第三级高速缓存的预取请求通过专用预取请求总线执行，该专用预取请求总线与将数据从各种高速缓存级别传送到处理器的总线系统分开。软件指令用于通过覆盖硬件预取引擎的正常功能来加速预取过程。该指令还限制了要预取的数据量。

6.

发明授权
Data stream prefetching in a microprocessor 失效
标题翻译：数据流在微处理器中预取

公开(公告)号：US07904661B2

公开(公告)日：2011-03-08

申请号：US11953637

申请日：2007-12-10

申请人： Eric Jason Fluhr , Bradly George Frey , John Barry Griswell, Jr. , Hung Qui Le , Cathy May , Francis Patrick O'Connell , Edward John Silha , Albert Thomas Williams

发明人： Eric Jason Fluhr , Bradly George Frey , John Barry Griswell, Jr. , Hung Qui Le , Cathy May , Francis Patrick O'Connell , Edward John Silha , Albert Thomas Williams

IPC分类号： G06F12/00 , G06F13/00

CPC分类号： G06F12/0862 , G06F2212/6028

摘要： A method of prefetching data in a microprocessor includes identifying a data stream associated with a process and determining a depth associated with the data stream based upon prefetch factors including the number of currently concurrent data streams and data consumption rates associated with the concurrent data streams. Data prefetch requests are allocated with the data stream to reflect the determined depth of the data stream. Allocating data prefetch requests may include allocating prefetch requests for a number of cache lines away from the cache line currently being referenced, wherein the number of cache lines is equal to the determined depth. The method may include, responsive to determining the depth associated with a data stream, configuring prefetch hardware to reflect the determined depth for the identified data stream. Prefetch control bits in an instruction executed by the processor control the prefetch hardware configuration.

摘要翻译： 在微处理器中预取数据的方法包括基于包括当前并发数据流的数量和与并发数据流相关联的数据消耗速率的预取因子来识别与进程相关联的数据流并确定与数据流相关联的深度。数据预取请求被分配与数据流以反映确定的数据流的深度。分配数据预取请求可以包括为当前被引用的高速缓存行分配多个高速缓存行的预取请求，其中高速缓存行的数量等于所确定的深度。该方法可以响应于确定与数据流相关联的深度，配置预取硬件以反映所识别的数据流的确定的深度。由处理器执行的指令中的预取控制位控制预取硬件配置。

7.

发明授权
System using stream prefetching history to improve data prefetching performance 失效
标题翻译：系统使用流预取历史来提高数据预取性能

公开(公告)号：US07689775B2

公开(公告)日：2010-03-30

申请号：US12400052

申请日：2009-03-09

申请人： John Barry Griswell, Jr. , Francis Patrick O'Connell

发明人： John Barry Griswell, Jr. , Francis Patrick O'Connell

IPC分类号： G06F12/00

CPC分类号： G06F12/0862 , G06F12/0866 , G06F2212/6024

摘要： Computer implemented method, system and computer program product for prefetching data in a data processing system. A computer implemented method for prefetching data in a data processing system includes generating attribute information of prior data streams by associating attributes of each prior data stream with a storage access instruction which caused allocation of the data stream, and then recording the generated attribute information. The recorded attribute information is accessed, and a behavior of a new data stream is modified using the accessed recorded attribute information.

摘要翻译： 计算机实现方法，系统和计算机程序产品，用于在数据处理系统中预取数据。一种用于在数据处理系统中预取数据的计算机实现方法包括通过将每个先前数据流的属性与导致数据流分配的存储访问指令相关联，然后记录所生成的属性信息来生成先前数据流的属性信息。访问记录的属性信息，并且使用所访问的记录的属性信息来修改新的数据流的行为。

8.

发明授权
Method and apparatus for managing cache line replacement within a computer system 有权
标题翻译：用于在计算机系统内管理高速缓存线更换的方法和装置

公开(公告)号：US06510493B1

公开(公告)日：2003-01-21

申请号：US09354127

申请日：1999-07-15

申请人： Peichun Peter Liu , Francis Patrick O'Connell

发明人： Peichun Peter Liu , Francis Patrick O'Connell

IPC分类号： G06F1200

CPC分类号： G06F12/128 , G06F12/0897

摘要： A cache memory having a mechanism for managing cache lines replacement is disclosed. The cache memory comprises multiple cache lines partitioned into a first group and a second group. The number of cache lines in the second group is preferably larger than the number of cache lines in the first group. A replacement logic block selectively chooses a cache line from one of the two groups of cache lines for replacement during an allocation cycle.

摘要翻译： 公开了一种具有用于管理高速缓存行替换的机制的高速缓冲存储器。高速缓冲存储器包括分割成第一组和第二组的多个高速缓存行。第二组中的高速缓存行的数量优选地大于第一组中的高速缓存行的数量。替换逻辑块在分配周期期间有选择地从两组高速缓存线之一中选择一条高速缓存行进行替换。

9.

发明授权
Method and apparatus for software-assisted data cache and prefetch control 有权
标题翻译：用于软件辅助数据缓存和预取控制的方法和装置

公开(公告)号：US08490065B2

公开(公告)日：2013-07-16

申请号：US11250054

申请日：2005-10-13

申请人： Roch Archambault , Yaoqing Gao , Francis Patrick O'Connell , Robert Brett Tremaine , Michael Edward Wazlowski , Steven Wayne White , Lixin Zhang

发明人： Roch Archambault , Yaoqing Gao , Francis Patrick O'Connell , Robert Brett Tremaine , Michael Edward Wazlowski , Steven Wayne White , Lixin Zhang

IPC分类号： G06F9/44 , G06F9/45

CPC分类号： G06F9/3879 , G06F8/4442 , G06F9/30047 , G06F9/3455 , G06F12/0862 , G06F2212/6028

摘要： The present invention provides a computer implemented method, apparatus, and computer usable program code for compiling instructions to manage a cache system. Loop constructs are analyzed to identify data usage characteristics for cache and prefetching conditions in instructions to form identified prefetch conditions. A set of control instructions are inserted into the instructions based on the data usage characteristics and the identified prefetch conditions to form multiple modified instructions. The set of multiple modified instructions are compiled to generate code for execution to form compiled instructions. The set of control instructions in the compiled instructions form a cache management policy to control movement of data in a memory system during execution of the compiled instructions.

摘要翻译： 本发明提供了一种用于编译用于管理高速缓存系统的指令的计算机实现的方法，装置和计算机可用程序代码。分析循环结构以识别指令中的缓存和预取条件的数据使用特征以形成识别的预取条件。基于数据使用特性和识别的预取条件将一组控制指令插入到指令中以形成多个修改的指令。编译多组修改指令的集合以生成用于执行的代码以形成编译指令。编译指令中的一组控制指令形成高速缓存管理策略以在执行编译指令期间控制存储器系统中数据的移动。

10.

发明授权
Store stream prefetching in a microprocessor 失效
标题翻译：在微处理器中存储流预取

公开(公告)号：US07716427B2

公开(公告)日：2010-05-11

申请号：US11969677

申请日：2008-01-04

申请人： John Barry Griswell, Jr. , Hung Qui Le , Francis Patrick O'Connell , William J. Starke , Jeffrey Adam Stuecheli , Albert Thomas Williams

发明人： John Barry Griswell, Jr. , Hung Qui Le , Francis Patrick O'Connell , William J. Starke , Jeffrey Adam Stuecheli , Albert Thomas Williams

IPC分类号： G06F12/00 , G06F9/38

CPC分类号： G06F12/0862 , G06F9/30043 , G06F9/30047 , G06F9/383 , G06F12/0811

摘要： In a microprocessor having a load/store unit and prefetch hardware, the prefetch hardware includes a prefetch queue containing entries indicative of allocated data streams. A prefetch engine receives an address associated with a store instruction executed by the load/store unit. The prefetch engine determines whether to allocate an entry in the prefetch queue corresponding to the store instruction by comparing entries in the queue to a window of addresses encompassing multiple cache blocks, where the window of addresses is derived from the received address. The prefetch engine compares entries in the prefetch queue to a window of 2M contiguous cache blocks. The prefetch engine suppresses allocation of a new entry when any entry in the prefetch queue is within the address window. The prefetch engine further suppresses allocation of a new entry when the data address of the store instruction is equal to an address in a border area of the address window.

摘要翻译： 在具有加载/存储单元和预取硬件的微处理器中，预取硬件包括预取队列，其包含指示分配的数据流的条目。预取引擎接收与由加载/存储单元执行的存储指令相关联的地址。预取引擎通过将队列中的条目与包含多个高速缓存块的地址的窗口进行比较来确定是否对与存储指令相对应的预取队列中的条目进行分配，其中地址窗口从接收到的地址导出。预取引擎将预取队列中的条目与2M个连续高速缓存块的窗口进行比较。当预取队列中的任何条目都在地址窗口内时，预取引擎抑制新条目的分配。当存储指令的数据地址等于地址窗口的边界区域中的地址时，预取引擎进一步抑制新条目的分配。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类