专利检索 ap:("Lakshminarayana Baba Arimilli" OR "Ravi K. Arimilli" OR "Balaram Sinharoy") AND inv:"Balaram Sinharoy" 第 1 页

1.

发明授权
Techniques for cache injection in a processor system with replacement policy position modification 失效
标题翻译：在具有替换政策位置修改的处理器系统中缓存注入的技术

公开(公告)号：US08429349B2

公开(公告)日：2013-04-23

申请号：US12212977

申请日：2008-09-18

申请人： Lakshminarayana Baba Arimilli , Ravi K. Arimilli , Balaram Sinharoy

发明人： Lakshminarayana Baba Arimilli , Ravi K. Arimilli , Balaram Sinharoy

IPC分类号： G06F12/00 , G06F13/00 , G06F13/28

CPC分类号： G06F12/0862 , G06F12/0835 , G06F12/127 , G06F2212/6028

摘要： A technique for performing cache injection includes monitoring, at a cache, addresses on a bus. Ownership of input/output data on the bus is then acquired by the cache when an address on the bus (that is associated with the input/output data) corresponds to an address of a data block stored in the cache. A replacement policy position of the data block is then modified (to increase a probability that the data block is consumed prior to ejection from the cache).

摘要翻译： 用于执行高速缓存注入的技术包括在缓存上监视总线上的地址。当总线上的地址（与输入/输出数据相关联）对应于存储在高速缓存中的数据块的地址时，总线上的输入/输出数据的所有权由高速缓存获取。然后修改数据块的替换策略位置（以增加数据块在从高速缓存弹出之前消耗的概率）。

2.

发明授权
Techniques for cache injection in a processor system using a cache injection instruction 有权
标题翻译：使用高速缓存注入指令在处理器系统中缓存注入的技术

公开(公告)号：US09256540B2

公开(公告)日：2016-02-09

申请号：US12212935

申请日：2008-09-18

申请人： Lakshminarayana Baba Arimilli , Ravi K. Arimilli , Balaram Sinharoy

发明人： Lakshminarayana Baba Arimilli , Ravi K. Arimilli , Balaram Sinharoy

IPC分类号： G06F12/00 , G06F12/08 , G06F3/06

CPC分类号： G06F12/0862 , G06F3/0604 , G06F3/061 , G06F3/0653 , G06F12/0815 , G06F12/0831 , G06F12/0835 , G06F2212/6028

摘要： A technique for performing cache injection includes monitoring addresses on a bus in response to a cache injection instruction. Ownership of input/output data on the bus is acquired by a cache when an address on the bus (that is associated with the input/output data) corresponds to an address of a data block associated with the cache injection instruction.

摘要翻译： 用于执行高速缓存注入的技术包括响应于高速缓存注入指令监视总线上的地址。当总线上的地址（与输入/输出数据相关联）对应于与高速缓存注入指令相关联的数据块的地址时，通过高速缓存获取总线上的输入/输出数据的所有权。

3.

发明授权
Techniques for cache injection in a processor system responsive to a specific instruction sequence 失效
标题翻译：用于响应特定指令序列的处理器系统中缓存注入的技术

公开(公告)号：US08443146B2

公开(公告)日：2013-05-14

申请号：US12212961

申请日：2008-09-18

申请人： Lakshminarayana Baba Arimilli , Ravi K. Arimilli , Balaram Sinharoy

发明人： Lakshminarayana Baba Arimilli , Ravi K. Arimilli , Balaram Sinharoy

IPC分类号： G06F12/00 , G06F13/00 , G06F13/28

CPC分类号： G06F12/0862 , G06F12/0831 , G06F12/0897 , G06F2212/6024

摘要： A technique for performing cache injection includes monitoring an instruction stream for a specific instruction sequence. Addresses on a bus are then monitored, at a cache, in response to detecting the specific instruction sequence a determined number of times. Ownership of input/output data on the bus is then acquired by the cache when an address on the bus (that is associated with the input/output data) corresponds to an address of a data block stored in the cache.

摘要翻译： 用于执行高速缓存注入的技术包括监视特定指令序列的指令流。响应于检测到特定指令序列确定的次数，在总线上的地址被监视在高速缓存。当总线上的地址（与输入/输出数据相关联）对应于存储在高速缓存中的数据块的地址时，总线上的输入/输出数据的所有权由高速缓存获取。

4.

发明授权
Thread partitioning in a multi-core environment 有权
标题翻译：多核环境中的线程分区

公开(公告)号：US08707016B2

公开(公告)日：2014-04-22

申请号：US12024211

申请日：2008-02-01

申请人： Ravi K. Arimilli , Juan C. Rubio , Balaram Sinharoy

发明人： Ravi K. Arimilli , Juan C. Rubio , Balaram Sinharoy

IPC分类号： G06F9/30

CPC分类号： G06F9/4843 , G06F9/3851

摘要： A set of helper thread binaries is created to retrieve data used by a set of main thread binaries. The set of helper thread binaries and the set of main thread binaries are partitioned according to common instruction boundaries. As a first partition in the set of main thread binaries executes within a first core, a second partition in the set of helper thread binaries executes within a second core, thus “warming up” the cache in the second core. When the first partition of the main completes execution, a second partition of the main core moves to the second core, and executes using the warmed up cache in the second core.

摘要翻译： 创建一组辅助线程二进制文件来检索一组主线程二进制文件使用的数据。辅助线程二进制文件集和主线程二进制文件集合根据公共指令边界进行分区。作为主线程二进制文件集合中的第一分区在第一核心内执行，该辅助线程二进制文件集中的第二分区在第二核心内执行，从而“预热”第二核心中的高速缓存。当主要的第一分区完成执行时，主核心的第二分区移动到第二核心，并使用第二核心中的预热高速缓存执行。

5.

发明授权
Speculative popcount data creation 有权
标题翻译：投机性的popcount数据创建

公开(公告)号：US08387065B2

公开(公告)日：2013-02-26

申请号：US12425343

申请日：2009-04-16

申请人： Ravi K. Arimilli , Ronald N. Kalla , Balaram Sinharoy

发明人： Ravi K. Arimilli , Ronald N. Kalla , Balaram Sinharoy

IPC分类号： G06F9/46 , G06F9/45 , G06F9/30 , G06F9/40

CPC分类号： G06F9/3001 , G06F9/30018 , G06F9/3842

摘要： A method and a data processing system by which population count (popcount) operations are efficiently performed without incurring the latency and loss of critical processing cycles and bandwidth of real time processing. The method comprises: identifying data to be stored to memory for which a popcount may need to be determined; speculatively performing a popcount operation on the data as a background process of the processor while the data is being stored to memory; storing the data to a first memory location; and storing a value of the popcount generated by the popcount operation within a second memory location. The method further comprises: determining a size of data; determining a granular level at which the popcount operation on the data will be performed; and reserving a size of said second memory location that is sufficiently large to hold the value of the popcount.

摘要翻译： 一种方法和数据处理系统，通过该方法和数据处理系统有效地执行人口计数（popcount）操作，而不会导致关键处理周期的延迟和丢失以及实时处理的带宽。该方法包括：识别要存储到可能需要确定一个弹出窗口的存储器的数据; 在将数据存储到存储器中的情况下，作为处理器的后台处理推测性地对数据进行弹出数据操作; 将数据存储到第一存储器位置; 以及将由所述popcount操作生成的所述popcount的值存储在第二存储器位置内。该方法还包括：确定数据的大小; 确定将执行对数据的弹出数据操作的粒度级别; 以及保留所述第二存储器位置的大小足够大以保持所述用户名的值。

6.

发明授权
Helper thread for pre-fetching data 失效
标题翻译：辅助线程用于预取数据

公开(公告)号：US08359589B2

公开(公告)日：2013-01-22

申请号：US12024191

申请日：2008-02-01

申请人： Ravi K. Arimilli , Juan C. Rubio , Balaram Sinharoy

发明人： Ravi K. Arimilli , Juan C. Rubio , Balaram Sinharoy

IPC分类号： G06F9/44 , G06F9/45 , G06F15/167 , G06F9/30 , G06F9/46

CPC分类号： G06F8/41 , G06F9/383 , G06F9/3851

摘要： A set of helper thread binaries is created to retrieve data used by a set of main thread binaries. If executing a portion of the set of helper thread binaries results in the retrieval of data needed by the set of main thread binaries, then that retrieved data is utilized by the set of main thread binaries.

摘要翻译： 创建一组辅助线程二进制文件来检索一组主线程二进制文件使用的数据。如果执行一组辅助线程二进制文件的一部分导致检索主线程二进制文件集所需的数据，那么该检索的数据由主线程二进制文件集合使用。

7.

发明授权
Block driven computation with an address generation accelerator 失效
标题翻译：使用地址生成加速器进行块驱动计算

公开(公告)号：US08285971B2

公开(公告)日：2012-10-09

申请号：US12336315

申请日：2008-12-16

申请人： Ravi K. Arimilli , Balaram Sinharoy

发明人： Ravi K. Arimilli , Balaram Sinharoy

IPC分类号： G06F12/00

CPC分类号： G06F9/383 , G06F9/30094 , G06F9/345

摘要： A processor includes at least one execution unit that executes instructions, at least one register file, coupled to the at least one execution unit, that buffers operands for access by the at least one execution unit, an instruction sequencing unit that fetches instructions for execution by the at least one execution unit, and an address generation accelerator. The address generation accelerator, responsive to an initiation signal received from the instruction sequencing unit, computes and outputs first and second effective addresses of operands of an operation.

摘要翻译： 处理器包括执行指令的至少一个执行单元，耦合到所述至少一个执行单元的至少一个寄存器文件，其缓冲由所述至少一个执行单元访问的操作数，指令排序单元，其通过所述至少一个执行单元和地址生成加速器。地址产生加速器响应于从指令排序单元接收的发起信号，计算并输出操作的操作数的第一和第二有效地址。

8.

发明授权
Varying an amount of data retrieved from memory based upon an instruction hint 失效
标题翻译：根据指令提示改变从存储器检索的数据量

公开(公告)号：US08266381B2

公开(公告)日：2012-09-11

申请号：US12024170

申请日：2008-02-01

申请人： Ravi K. Arimilli , Gheorghe C. Cascaval , Balaram Sinharoy , William E. Speight , Lixin Zhang

发明人： Ravi K. Arimilli , Gheorghe C. Cascaval , Balaram Sinharoy , William E. Speight , Lixin Zhang

IPC分类号： G06F12/08

CPC分类号： G06F12/0862 , G06F12/0822 , G06F2212/507 , G06F2212/6028

摘要： In at least one embodiment, a processor detects during execution of program code whether a load instruction within the program code is associated with a hint. In response to detecting that the load instruction is not associated with a hint, the processor retrieves a full cache line of data from the memory hierarchy into the processor in response to the load instruction. In response to detecting that the load instruction is associated with a hint, a processor retrieves a partial cache line of data into the processor from the memory hierarchy in response to the load instruction.

摘要翻译： 在至少一个实施例中，处理器在执行程序代码期间检测程序代码内的加载指令是否与提示相关联。响应于检测到加载指令不与提示相关联，处理器响应于加载指令从存储器层次结构检索完整的高速缓存行数据到处理器。响应于检测到加载指令与提示相关联，处理器响应于加载指令从存储器层次结构检索数据的部分高速缓存行到处理器中。

9.

发明授权
Remote asynchronous data mover 失效
标题翻译：远程异步数据移动器

公开(公告)号：US07996564B2

公开(公告)日：2011-08-09

申请号：US12425093

申请日：2009-04-16

申请人： Lakshminarayana B. Arimilli , Ravi K. Arimilli , Ronald N. Kalla , Ramakrishnan Rajamony , Balaram Sinharoy , William E. Speight , William J. Starke

发明人： Lakshminarayana B. Arimilli , Ravi K. Arimilli , Ronald N. Kalla , Ramakrishnan Rajamony , Balaram Sinharoy , William E. Speight , William J. Starke

IPC分类号： G06F12/00

CPC分类号： G06F9/54 , G06F12/10 , G06F12/1081

摘要： A distributed data processing system executes multiple tasks within a parallel job, including a first local task on a local node and at least one task executing on a remote node, with a remote memory having real address (RA) locations mapped to one or more of the source effective addresses (EA) and destination EA of a data move operation initiated by a task executing on the local node. On initiation of the data move operation, remote asynchronous data move (RADM) logic identifies that the operation moves data to/from a first EA that is memory mapped to an RA of the remote memory. The local processor/RADM logic initiates a RADM operation that moves a copy of the data directly from/to the first remote memory by completing the RADM operation using the network interface cards (NICs) of the source and destination processing nodes, determined by accessing a data center for the node IDs of remote memory.

摘要翻译： 分布式数据处理系统在并行作业中执行多个任务，包括本地节点上的第一本地任务和在远程节点上执行的至少一个任务，具有映射到以下的一个或多个的实地址（RA）位置的远程存储器由本地节点上执行的任务启动的数据移动操作的源有效地址（EA）和目标EA。在启动数据移动操作时，远程异步数据移动（RADM）逻辑识别该操作将数据移动到/从第一个EA，该第一个EA是映射到远程存储器的RA的存储器。本地处理器/ RADM逻辑启动RADM操作，其通过使用源和目的地处理节点的网络接口卡（NIC）完成RADM操作，直接从/向第一远程存储器移动数据的副本，其通过访问数据中心为远程存储器的节点ID。

10.

发明授权
Method for enabling direct prefetching of data during asychronous memory move operation 失效
标题翻译：用于在异步存储器移动操作期间直接预取数据的方法

公开(公告)号：US07921275B2

公开(公告)日：2011-04-05

申请号：US12024598

申请日：2008-02-01

申请人： Ravi K. Arimilli , Robert S. Blackmore , Chulho Kim , Balaram Sinharoy , Hanhong Xue

发明人： Ravi K. Arimilli , Robert S. Blackmore , Chulho Kim , Balaram Sinharoy , Hanhong Xue

IPC分类号： G06F12/00

CPC分类号： G06F12/0862 , G06F9/30032 , G06F9/3004 , G06F9/30043 , G06F9/30076 , G06F9/30087 , G06F9/3017 , G06F9/30185 , G06F9/3834 , G06F9/3877

摘要： While an asynchronous memory move (AMM) operation is ongoing, a prefetch request for data from the source effective address or the destination effective address triggers cache injection by the AMM mover of relevant data from the stream of data being moved in the physical memory. The memory controller forwards the first prefetched line to the prefetch engine and L1 cache, the next cache lines in the sequence of data to the L2 cache, and a subsequent set of cache lines to the L3 cache. The memory controller then forwards the remaining data to the destination memory location. Quick access to prefetch data is enabled by buffering the stream of data in the upper caches rather than placing all the moved data within the memory. Also, the memory controller places moved data into only a subset of the available cache lines of the upper level cache.

摘要翻译： 当异步存储器移动（AMM）操作正在进行时，来自源有效地址或目的地有效地址的数据的预取请求触发AMM移动器对来自物理存储器中移动的数据流的相关数据的高速缓存注入。存储器控制器将第一预取行转发到预取引擎和L1高速缓存，将数据序列中的下一个高速缓存行转发到L2高速缓存，以及将后续的一组高速缓存行转发到L3高速缓存。存储器控制器然后将剩余的数据转发到目的地存储器位置。通过缓存高速缓存中的数据流，而不是将所有移动的数据放在内存中，可以快速访问预取数据。此外，存储器控制器将移动的数据仅放置在高级缓存的可用高速缓存行的子集中。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类