专利检索 ap:("Venkat R. Indukuru" OR "Brian R. Konigsburg" OR "Alexander E. Mericas" OR "Benjamin W. Stolt") AND inv:"Venkat R. Indukuru" 第 1 页

1.

发明授权
Determining each stall reason for each stalled instruction within a group of instructions during a pipeline stall 失效
标题翻译：在流水线停止期间确定一组指令内每个停顿的指令的每个失速原因

公开(公告)号：US08635436B2

公开(公告)日：2014-01-21

申请号：US13097284

申请日：2011-04-29

申请人： Venkat R. Indukuru , Brian R. Konigsburg , Alexander E. Mericas , Benjamin W. Stolt

发明人： Venkat R. Indukuru , Brian R. Konigsburg , Alexander E. Mericas , Benjamin W. Stolt

IPC分类号： G06F11/30

CPC分类号： G06F9/3867 , G06F9/3853 , G06F9/3855 , G06F9/3857

摘要： During a pipeline stall in an out of order processor, until a next to complete instruction group completes, a monitoring unit receives, from a completion unit of a processor, a next to finish indicator indicating the finish of an oldest previously unfinished instruction from among a plurality of instructions of a next to complete instruction group. The monitoring unit receives, from a plurality of functional units of the processor, a plurality of finish reports including completion reasons for a plurality of separate instructions. The monitoring unit determines at least one stall reason from among multiple stall reasons for the oldest instruction from a selection of completion reasons from a selection of finish reports aligned with the next to finish indicator from among the plurality of finish reports. Once the monitoring unit receives a complete indicator from the completion unit, indicating the completion of the next to complete instruction group, the monitoring unit stores each determined stall reason aligned with each next to finish indicator in memory.

摘要翻译： 在处理器处于不规则处理器的流水线停止期间，直到完成指令组的下一个完成为止，监视单元从处理器的完成单元接收到指示完成以前未完成的指令的完成的下一个完成指示，下一个完成指令组的多个指令。监视单元从处理器的多个功能单元接收多个完成报告，包括多个单独指令的完成原因。从多个完成报告中的与下一个完成指示符对齐的完成报告的选择完成原因的选择中，监视单元从最多的指令的多个失败原因中确定至少一个失败原因。一旦监视单元从完成单元接收到完整的指示符，指示完成下一个完成指令组，则监视单元将每个确定的停顿原因与每个下一个完成指示符对准在存储器中。

2.

发明申请
DELAY IDENTIFICATION IN DATA PROCESSING SYSTEMS 有权
标题翻译：数据处理系统中的延迟识别

公开(公告)号：US20130151816A1

公开(公告)日：2013-06-13

申请号：US13314052

申请日：2011-12-07

申请人： Venkat R. Indukuru , Alexander E. Mericas

发明人： Venkat R. Indukuru , Alexander E. Mericas

IPC分类号： G06F9/30 , G06F9/312

CPC分类号： G06F9/3853 , G06F9/3836 , G06F9/3857

摘要： Methods, systems, and computer program products may provide delay-identification in data processing systems. An apparatus may include a delay-identification unit having a delay counter, a threshold register, a delay register, and a delay detector. The delay detector may be configured to start the delay counter in response to detecting that one group of instructions is delayed, and stop the delay counter in response to detecting that the one group of instructions is no longer delayed. The delay detector may additionally be configured to compare the number of cycles counted by the delay counter with a threshold number of cycles in the threshold register, and store at least one effective address of one of the instructions of the one group of instructions when the number of cycles counted by the delay counter is greater than the threshold number of cycles stored in the threshold register.

摘要翻译： 方法，系统和计算机程序产品可以在数据处理系统中提供延迟识别。一种装置可以包括具有延迟计数器，阈值寄存器，延迟寄存器和延迟检测器的延迟识别单元。延迟检测器可以被配置为响应于检测到一组指令被延迟而启动延迟计数器，并且响应于检测到一组指令不再被延迟而停止延迟计数器。延迟检测器可以另外被配置为将由延迟计数器计数的周期数与阈值寄存器中的阈值数量进行比较，并且当数字的数量存储至少一个指令的一个指令的有效地址时，由延迟计数器计数的周期大于存储在阈值寄存器中的阈值周期数。

3.

发明申请
IDENTIFYING LOAD-HIT-STORE CONFLICTS 有权

公开(公告)号：US20140075158A1

公开(公告)日：2014-03-13

申请号：US13611006

申请日：2012-09-12

申请人： Venkat R. Indukuru , Alexander E. Mericas , Satish K. Sadasivam , Madhavi G. Valluri

发明人： Venkat R. Indukuru , Alexander E. Mericas , Satish K. Sadasivam , Madhavi G. Valluri

IPC分类号： G06F9/312

CPC分类号： G06F9/44552 , G06F9/3834

摘要： A computing device identifies a load instruction and store instruction pair that causes a load-hit-store conflict. A processor tags a first load instruction that instructs the processor to load a first data set from memory. The processor stores an address at which the first load instruction is located in memory in a special purpose register. The processor determines where the first load instruction has a load-hit-store conflict with a first store instruction. If the processor determines the first load instruction has a load-hit store conflict with the first store instruction, the processor stores an address at which the first data set is located in memory in a second special purpose register, tags the first data set being stored by the first store instruction, stores an address at which the first store instruction is located in memory in a third special purpose register and increases a conflict counter.

4.

发明申请
HARDWARE ASSIST THREAD FOR DYNAMIC PERFORMANCE PROFILING 失效
标题翻译：用于动态性能配置的硬件辅助螺丝

公开(公告)号：US20110302395A1

公开(公告)日：2011-12-08

申请号：US12796124

申请日：2010-06-08

申请人： Ronald P. Hall , Venkat R. Indukuru , Alexander E. Mericas , Balaram Sinharoy , Zhong L. Wang

发明人： Ronald P. Hall , Venkat R. Indukuru , Alexander E. Mericas , Balaram Sinharoy , Zhong L. Wang

IPC分类号： G06F9/30

CPC分类号： G06F9/3851 , G06F9/3009 , G06F9/327 , G06F11/3466 , G06F2201/865 , G06F2201/88

摘要： A method and data processing system for managing running of instructions in a program. A processor of the data processing system receives a monitoring instruction of a monitoring unit. The processor determines if at least one secondary thread of a set of secondary threads is available for use as an assist thread. The processor selects the at least one secondary thread from the set of secondary threads to become the assist thread in response to a determination that the at least one secondary thread of the set of secondary threads is available for use as an assist thread. The processor changes profiling of running of instructions in the program from the main thread to the assist thread.

摘要翻译： 一种用于管理程序中的指令的运行的方法和数据处理系统。数据处理系统的处理器接收监视单元的监视指令。处理器确定一组辅助线程的至少一个辅助线程是否可用作辅助线程。响应于确定所述一组次要线程的至少一个辅助线程可用作辅助线程，所述处理器从所述辅助线程组中选择所述至少一个辅助线程以成为所述辅助线程。处理器将程序中指令的运行情况从主线程更改为辅助线程。

5.

发明申请
Method and Apparatus For Evaluating Integrated Circuit Design Performance Using Basic Block Vectors, Cycles Per Instruction (CPI) Information and Microarchitecture Dependent Information 有权
标题翻译：使用基本块向量，每个指令周期（CPI）信息和微架构依赖信息来评估集成电路设计性能的方法和装置

公开(公告)号：US20090276190A1

公开(公告)日：2009-11-05

申请号：US12112034

申请日：2008-04-30

申请人： Robert H. Bell, JR. , Thomas W. Chen , Venkat R. Indukuru , Alexander E. Mericas , Pattabi M. Seshadri , Madhavi G. Valluri

发明人： Robert H. Bell, JR. , Thomas W. Chen , Venkat R. Indukuru , Alexander E. Mericas , Pattabi M. Seshadri , Madhavi G. Valluri

IPC分类号： G06F17/50

CPC分类号： G06F17/5022 , G01R31/318357 , G01R31/318364

摘要： A test system or simulator includes an integrated circuit (IC) benchmark software program that executes workload program software on a semiconductor die IC design model. The benchmark software program includes trace, simulation point, basic block vector (BBV) generation, cycles per instruction (CPI) error, clustering and other programs. The test system also includes CPI stack program software that generates CPI stack data that includes microarchitecture dependent information for each instruction interval of workload program software. The CPI stack data may also include an overall analysis of CPI data for the entire workload program. IC designers may utilize the benchmark software and CPI stack program to develop a reduced representative workload program that includes CPI data as well as microarchitecture dependent information.

摘要翻译： 测试系统或模拟器包括在半导体芯片IC设计模型上执行工作负载程序软件的集成电路（IC）基准软件程序。基准软件程序包括跟踪，仿真点，基本块向量（BBV）生成，每个指令周期（CPI）错误，聚类和其他程序。测试系统还包括生成CPI堆栈数据的CPI堆栈程序软件，包括工作负载程序软件的每个指令间隔的微架构依赖信息。 CPI堆栈数据还可以包括整个工作负载程序的CPI数据的总体分析。 IC设计人员可以利用基准软件和CPI堆栈程序来开发一个减少代表性的工作量程序，其中包括CPI数据以及微架构依赖信息。

6.

发明授权
Floating-point event counters with automatic prescaling 有权
标题翻译：具有自动预分频功能的浮点事件计数器

公开(公告)号：US08514999B2

公开(公告)日：2013-08-20

申请号：US13312715

申请日：2011-12-06

申请人： Giles R. Frazier , Venkat R. Indukuru , Alexander E. Mericas , John F. Spannaus

发明人： Giles R. Frazier , Venkat R. Indukuru , Alexander E. Mericas , John F. Spannaus

IPC分类号： H03K23/00

CPC分类号： G06F11/2284 , G06F11/0724 , G06F11/076 , G06F11/3024 , G06F11/3058 , G06F11/3082 , G06F11/3409 , G06F11/348 , G06F2201/835 , G06F2201/86 , G06F2201/88

摘要： Occurrences of a particular event in an electronic device are counted by incrementing an event counter each time a variable number of the particular events have occurred, and automatically increasing that variable number as the total count increases. The variable number (prescale value) can increase geometrically according to a programmable counter base each time the count mantissa overflows. The event counter thereby provides hardware-implemented automatic prescaling while significantly reducing the number of interface bits required to support very large count ranges, and retaining high accuracy at very large event counts.

摘要翻译： 电子设备中的特定事件的发生通过在每次发生特定事件的可变数量时增加事件计数器来计数，并且随着总计数增加而自动增加该变量数。每当计数尾数溢出时，可变数（预分频值）可根据可编程的计数器基数在几何上增加。事件计数器由此提供硬件实现的自动预分频，同时显着减少支持非常大的计数范围所需的接口位数，并在非常大的事件计数下保持高精度。

7.

发明申请
TEMPORAL LOCALITY AWARE INSTRUCTION SAMPLING 审中-公开
标题翻译：时间局部性特征采样

公开(公告)号：US20140075164A1

公开(公告)日：2014-03-13

申请号：US13610958

申请日：2012-09-12

申请人： Venkat R. Indukuru , Alexander E. Mericas

发明人： Venkat R. Indukuru , Alexander E. Mericas

IPC分类号： G06F9/30 , G06F11/30

CPC分类号： G06F11/3419 , G06F11/3466 , G06F2201/81 , G06F2201/86 , G06F2201/865 , G06F2201/88

摘要： A method and system are disclosed for sampling instructions executing on a computer processor. A computer processor determines a number of times a specified event has occurred within a specified temporal window. The computer processor determines to mark an instruction to be executed for monitoring based on the number of times the specified event has occurred within the temporal window, and in response, the computer processor marks the instruction.

摘要翻译： 公开了一种用于在计算机处理器上执行的采样指令的方法和系统。计算机处理器确定在指定的时间窗口内发生指定事件的次数。计算机处理器根据在时间窗口内发生的指定事件的次数来确定要执行的用于监视的指令，并且作为响应，计算机处理器标记指令。

8.

发明授权
Identifying load-hit-store conflicts 有权
标题翻译：识别加载命中商店冲突

公开(公告)号：US09229745B2

公开(公告)日：2016-01-05

申请号：US13611006

申请日：2012-09-12

申请人： Venkat R. Indukuru , Alexander E. Mericas , Satish K. Sadasivam , Madhavi G. Valluri

发明人： Venkat R. Indukuru , Alexander E. Mericas , Satish K. Sadasivam , Madhavi G. Valluri

IPC分类号： G06F9/00 , G06F9/445 , G06F9/38

CPC分类号： G06F9/44552 , G06F9/3834

摘要： A computing device identifies a load instruction and store instruction pair that causes a load-hit-store conflict. A processor tags a first load instruction that instructs the processor to load a first data set from memory. The processor stores an address at which the first load instruction is located in memory in a special purpose register. The processor determines where the first load instruction has a load-hit-store conflict with a first store instruction. If the processor determines the first load instruction has a load-hit store conflict with the first store instruction, the processor stores an address at which the first data set is located in memory in a second special purpose register, tags the first data set being stored by the first store instruction, stores an address at which the first store instruction is located in memory in a third special purpose register and increases a conflict counter.

摘要翻译： 计算设备识别导致加载命中 - 存储冲突的加载指令和存储指令对。处理器标记指示处理器从存储器加载第一数据集的第一加载指令。处理器将特定目的寄存器中的第一加载指令所在的地址存储在存储器中。处理器确定第一个加载指令与第一个存储指令的加载命中 - 存储冲突的位置。如果处理器确定第一加载指令具有与第一存储指令的加载命中存储冲突，则处理器将第一数据集所在的地址存储在第二专用寄存器中的存储器中，对存储的第一数据集进行标记通过第一存储指令，将第一存储指令所在的地址存储在第三专用寄存器中，并增加冲突计数器。

9.

发明申请
FLOATING-POINT EVENT COUNTERS WITH AUTOMATIC PRESCALING 有权
标题翻译：浮动点活动计数器具有自动预处理功能

公开(公告)号：US20130142301A1

公开(公告)日：2013-06-06

申请号：US13312715

申请日：2011-12-06

申请人： Giles R. Frazier , Venkat R. Indukuru , Alexander E. Mericas , John F. Spannaus

发明人： Giles R. Frazier , Venkat R. Indukuru , Alexander E. Mericas , John F. Spannaus

IPC分类号： H03K23/00

CPC分类号： G06F11/2284 , G06F11/0724 , G06F11/076 , G06F11/3024 , G06F11/3058 , G06F11/3082 , G06F11/3409 , G06F11/348 , G06F2201/835 , G06F2201/86 , G06F2201/88

摘要： Occurrences of a particular event in an electronic device are counted by incrementing an event counter each time a variable number of the particular events have occurred, and automatically increasing that variable number as the total count increases. The variable number (prescale value) can increase geometrically according to a programmable counter base each time the count mantissa overflows. The event counter thereby provides hardware-implemented automatic prescaling while significantly reducing the number of interface bits required to support very large count ranges, and retaining high accuracy at very large event counts.

摘要翻译： 电子设备中的特定事件的发生通过在每次发生特定事件的可变数量时增加事件计数器来计数，并且随着总计数增加而自动增加该变量数。每当计数尾数溢出时，可变数（预分频值）可根据可编程的计数器基数在几何上增加。事件计数器由此提供硬件实现的自动预分频，同时显着减少支持非常大的计数范围所需的接口位数，并在非常大的事件计数下保持高精度。

10.

发明授权
Workload performance projection for future information handling systems using microarchitecture dependent data 有权
标题翻译：使用微架构依赖数据的未来信息处理系统的工作负载性能预测

公开(公告)号：US09135142B2

公开(公告)日：2015-09-15

申请号：US12343482

申请日：2008-12-24

申请人： Robert H. Bell, Jr. , Luigi Brochard , Donald Robert DeSota , Venkat R. Indukuru , Rajendra D. Panda , Sameh S. Sharkawi

发明人： Robert H. Bell, Jr. , Luigi Brochard , Donald Robert DeSota , Venkat R. Indukuru , Rajendra D. Panda , Sameh S. Sharkawi

IPC分类号： G06F3/01 , G01C7/00 , G01C7/04 , G06F3/00 , G06F11/36 , G06F11/34

CPC分类号： G06F11/3612 , G06F11/3428 , G06F11/3457 , G06F11/3466 , G06F2201/88

摘要： A performance projection system includes a test IHS and a currently existing IHS. The performance projection system includes surrogate programs and user application software. The test IHS employs a memory that includes a virtual future IHS, currently existing IHS, surrogate programs, and user application software for determination of runtime and HW counter performance data. The user application software and surrogate programs execute on the currently existing MS to provide designers with runtime data and HW counter or microarchitecture dependent data. Designers execute surrogate programs on the future IHS to provide runtime and HW counter data. Designers normalize and weight the runtime and HW counter data to provide a representative surrogate program for comparison to user application software performance on the future IHS. Using a scaling factor, designers may generate a projection of runtime performance for the user application software executing on the future IHS.

摘要翻译： 性能投影系统包括测试IHS和当前存在的IHS。性能投影系统包括代理程序和用户应用软件。测试IHS采用包含虚拟未来IHS，现有IHS，替代程序和用户应用软件的存储器，用于确定运行时和硬件计数器性能数据。用户应用软件和代理程序在当前现有的MS上执行，为设计人员提供运行时数据和HW计数器或微体系结构依赖数据。设计人员在未来的IHS上执行代理程序来提供运行时和硬件计数器数据。设计师对运行时和HW计数器数据进行规范化和加权，以提供代表性的代理程序，以便与未来IHS的用户应用软件性能进行比较。使用缩放因子，设计人员可以为未来IHS上执行的用户应用软件生成运行时性能的投影。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类