Autonomic Hotspot Profiling Using Paired Performance Sampling
    1.
    发明申请
    Autonomic Hotspot Profiling Using Paired Performance Sampling 有权
    使用配对性能采样的自动热点分析

    公开(公告)号:US20140059334A1

    公开(公告)日:2014-02-27

    申请号:US14067212

    申请日:2013-10-30

    IPC分类号: G06F9/38

    摘要: A processor performance profiler is enabled to for identify specific instructions causing performance issues within a program being executed by a microprocessor through random sampling to find the worst-case offenders of a particular event type such as a cache miss or a branch mis-prediction. Tracking all instructions causing a particular event generates large data logs, creates performance penalties, and makes code analysis more difficult. However, by identifying and tracking the worst offenders within a random sample of events without having to hash all events results in smaller memory requirements for the performance profiler, lower performance impact while profiling, and decreased complexity to analyze the program to identify major performance issues, which, in turn, enables better optimization of the program in shorter developer time.

    摘要翻译: 处理器性能分析器能够用于识别由微处理器通过随机采样来执行的程序中导致性能问题的特定指令,以找到诸如高速缓存未命中或分支误预测的特定事件类型的最坏情况的违规者。 跟踪导致特定事件的所有指令会生成大量数据日志,创建性能损失,并使代码分析更加困难。 然而,通过识别和跟踪随机事件样本中的最坏罪犯,而不必对所有事件进行散列,从而导致性能分析器的较小内存需求,降低性能影响,同时分析并降低分析程序以识别主要性能问题的复杂性, 这反过来,可以在较短的开发人员时间内更好地优化程序。

    Hardware assist thread for dynamic performance profiling
    2.
    发明授权
    Hardware assist thread for dynamic performance profiling 失效
    用于动态性能分析的硬件辅助线

    公开(公告)号:US08612730B2

    公开(公告)日:2013-12-17

    申请号:US12796124

    申请日:2010-06-08

    IPC分类号: G06F9/00

    摘要: A method and data processing system for managing running of instructions in a program. A processor of the data processing system receives a monitoring instruction of a monitoring unit. The processor determines if at least one secondary thread of a set of secondary threads is available for use as an assist thread. The processor selects the at least one secondary thread from the set of secondary threads to become the assist thread in response to a determination that the at least one secondary thread of the set of secondary threads is available for use as an assist thread. The processor changes profiling of running of instructions in the program from the main thread to the assist thread.

    摘要翻译: 一种用于管理程序中的指令的运行的方法和数据处理系统。 数据处理系统的处理器接收监视单元的监视指令。 处理器确定一组辅助线程的至少一个辅助线程是否可用作辅助线程。 响应于确定所述一组次要线程的至少一个辅助线程可用作辅助线程,所述处理器从所述辅助线程组中选择所述至少一个辅助线程以成为所述辅助线程。 处理器将程序中指令的运行情况从主线程更改为辅助线程。

    Autonomic Hotspot Profiling Using Paired Performance Sampling
    3.
    发明申请
    Autonomic Hotspot Profiling Using Paired Performance Sampling 失效
    使用配对性能采样的自动热点分析

    公开(公告)号:US20120124560A1

    公开(公告)日:2012-05-17

    申请号:US12946959

    申请日:2010-11-16

    IPC分类号: G06F9/44

    摘要: A processor performance profiler is enabled to for identify specific instructions causing performance issues within a program being executed by a microprocessor through random sampling to find the worst-case offenders of a particular event type such as a cache miss or a branch mis-prediction. Tracking all instructions causing a particular event generates large data logs, creates performance penalties, and makes code analysis more difficult. However, by identifying and tracking the worst offenders within a random sample of events without having to hash all events results in smaller memory requirements for the performance profiler, lower performance impact while profiling, and decreased complexity to analyze the program to identify major performance issues, which, in turn, enables better optimization of the program in shorter developer time.

    摘要翻译: 处理器性能分析器能够用于识别由微处理器通过随机采样来执行的程序中导致性能问题的特定指令,以找到诸如高速缓存未命中或分支误预测的特定事件类型的最坏情况的违规者。 跟踪导致特定事件的所有指令会生成大量数据日志,创建性能损失,并使代码分析更加困难。 然而,通过识别和跟踪随机事件样本中的最坏罪犯,而不必对所有事件进行散列,从而导致性能分析器的较小内存需求,降低性能影响,同时分析并降低分析程序以识别主要性能问题的复杂性, 这反过来,可以在较短的开发人员时间内更好地优化程序。

    Method and apparatus for instruction completion stall identification in an information handling system
    4.
    发明授权
    Method and apparatus for instruction completion stall identification in an information handling system 有权
    信息处理系统中指令完成失速识别的方法和装置

    公开(公告)号:US08832416B2

    公开(公告)日:2014-09-09

    申请号:US11753005

    申请日:2007-05-24

    IPC分类号: G06F11/34

    摘要: An information handling system includes a processor that executes multiple instructions or instruction threads within a software application program. The information handling system includes operating system software that manages processor system hardware and software in a multi-tasking environment. In one embodiment, the operating system manages instruction completion stall analysis software to determine the cause or causes of instruction stalls. In another embodiment, the stall analysis software cooperates with the operating system software to store instruction completion stall event data on a per instruction basis while the application program executes. The operating system software may cooperate with the stall analysis software to store instruction completion stall data in memory for later manipulation by system users or other software.

    摘要翻译: 信息处理系统包括在软件应用程序内执行多个指令或指令线程的处理器。 信息处理系统包括在多任务环境中管理处理器系统硬件和软件的操作系统软件。 在一个实施例中,操作系统管理指令完成失速分析软件以确定指令停顿的原因或原因。 在另一个实施例中,失速分析软件与操作系统软件配合,以在应用程序执行时以每个指令为基础存储指令完成失速事件数据。 操作系统软件可以与失速分析软件配合以将指令完成失速数据存储在存储器中以供系统用户或其他软件稍后操作。

    Autonomic hotspot profiling using paired performance sampling
    5.
    发明授权
    Autonomic hotspot profiling using paired performance sampling 失效
    使用配对性能采样的自动热点分析

    公开(公告)号:US08615742B2

    公开(公告)日:2013-12-24

    申请号:US12946959

    申请日:2010-11-16

    IPC分类号: G06F9/44 G06F9/45

    摘要: A processor performance profiler is enabled to for identify specific instructions causing performance issues within a program being executed by a microprocessor through random sampling to find the worst-case offenders of a particular event type such as a cache miss or a branch mis-prediction. Tracking all instructions causing a particular event generates large data logs, creates performance penalties, and makes code analysis more difficult. However, by identifying and tracking the worst offenders within a random sample of events without having to hash all events results in smaller memory requirements for the performance profiler, lower performance impact while profiling, and decreased complexity to analyze the program to identify major performance issues, which, in turn, enables better optimization of the program in shorter developer time.

    摘要翻译: 处理器性能分析器能够用于识别由微处理器通过随机采样来执行的程序中导致性能问题的特定指令,以找到诸如高速缓存未命中或分支误预测的特定事件类型的最坏情况的违规者。 跟踪导致特定事件的所有指令会生成大量数据日志,创建性能损失,并使代码分析更加困难。 然而,通过识别和跟踪随机事件样本中的最坏罪犯,而不必对所有事件进行散列,从而导致性能分析器的较小内存需求,降低性能影响,同时分析并降低分析程序以识别主要性能问题的复杂性, 这反过来,可以在较短的开发人员时间内更好地优化程序。

    Quantifying Completion Stalls Using Instruction Sampling
    7.
    发明申请
    Quantifying Completion Stalls Using Instruction Sampling 失效
    使用指令采样量化完成失速

    公开(公告)号:US20090259830A1

    公开(公告)日:2009-10-15

    申请号:US12099944

    申请日:2008-04-09

    IPC分类号: G06F9/30

    摘要: A method, computer program product, and data processing system for collecting metrics regarding completion stalls in an out-of-order superscalar processor with branch prediction is disclosed. A preferred embodiment of the present invention selectively samples particular instructions (or classes of instructions). Each selected instruction, as it passes through the processor datapath, is marked (tagged) for monitoring by a performance monitoring unit. The progress of marked instructions is monitored by the performance monitoring unit, and various stall counters are triggered by the progress of the marked instructions and the instruction groups they form a part of. The stall counters count cycles to give an indication of when certain delays associated with particular instructions occur and how serious the delays are.

    摘要翻译: 公开了一种用于在具有分支预测的无序超标量处理器中收集关于完成停顿的度量的方法,计算机程序产品和数据处理系统。 本发明的优选实施例有选择地对特定指令(或指令类别)进行采样。 每个选定的指令在通过处理器数据路径时被标记(标记),用于由性能监视单元监视。 标记指令的进度由性能监控单元进行监控,各种失速计数器由标记指令和指令组的进度触发。 停顿计数器计数周期,以指示何时发生与特定指令相关的某些延迟以及延迟的严重程度。

    Processor core having a saturating event counter for making performance measurements

    公开(公告)号:US10169187B2

    公开(公告)日:2019-01-01

    申请号:US12858497

    申请日:2010-08-18

    IPC分类号: G06F11/34

    摘要: A performance monitor including a saturating counter provides a relative measure of event frequency without requiring a minimum polling rate or periodic reset to avoid or account for counter overflow. The saturating counter is incremented upon detection of an event and decremented if an event is not detected within a predetermined period. The period of detecting may be programmable and may be determined by real time clock, processor or instruction cycles. Multiple event types may be selected from for detection and input to a single counter, or alternatively multiple event counters may be provided for various event types. The saturating counter may additionally be periodically reset in a selected operating mode, in combination with the decrementing action performed on the counter.

    PROCESSOR CORE HAVING A SATURATING EVENT COUNTER FOR MAKING PERFORMANCE MEASUREMENTS
    9.
    发明申请
    PROCESSOR CORE HAVING A SATURATING EVENT COUNTER FOR MAKING PERFORMANCE MEASUREMENTS 审中-公开
    具有制作性能测量的饱和活动计数器的处理器芯

    公开(公告)号:US20120046912A1

    公开(公告)日:2012-02-23

    申请号:US12858497

    申请日:2010-08-18

    IPC分类号: G06F15/00

    CPC分类号: G06F11/348 G06F2201/88

    摘要: A performance monitor including a saturating counter provides a relative measure of event frequency without requiring a minimum polling rate or periodic reset to avoid or account for counter overflow. The saturating counter is incremented upon detection of an event and decremented if an event is not detected within a predetermined period. The period of detecting may be programmable and may be determined by real time clock, processor or instruction cycles. Multiple event types may be selected from for detection and input to a single counter, or alternatively multiple event counters may be provided for various event types. The saturating counter may additionally be periodically reset in a selected operating mode, in combination with the decrementing action performed on the counter.

    摘要翻译: 包括饱和计数器的性能监视器提供事件频率的相对度量,而不需要最小轮询速率或周期性复位来避免或考虑计数器溢出。 饱和计数器在检测到事件时增加,并且如果在预定时间段内未检测到事件,则递减计数器。 检测周期可以是可编程的,并且可以通过实时时钟,处理器或指令周期来确定。 可以选择多个事件类型用于检测和输入到单个计数器,或者可以为各种事件类型提供多个事件计数器。 饱和计数器可以另外在所选择的操作模式中周期性地复位,并结合在计数器上执行的递减动作。

    METHOD AND APPARATUS FOR INSTRUCTION COMPLETION STALL IDENTIFICATION IN AN INFORMATION HANDLING SYSTEM
    10.
    发明申请
    METHOD AND APPARATUS FOR INSTRUCTION COMPLETION STALL IDENTIFICATION IN AN INFORMATION HANDLING SYSTEM 有权
    信息处理系统中指示完成标识的方法和装置

    公开(公告)号:US20080294881A1

    公开(公告)日:2008-11-27

    申请号:US11753005

    申请日:2007-05-24

    IPC分类号: G06F11/34

    摘要: An information handling system includes a processor that executes multiple instructions or instruction threads within a software application program. The information handling system includes operating system software that manages processor system hardware and software in a multi-tasking environment. In one embodiment, the operating system manages instruction completion stall analysis software to determine the cause or causes of instruction stalls. In another embodiment, the stall analysis software cooperates with the operating system software to store instruction completion stall event data on a per instruction basis while the application program executes. The operating system software may cooperate with the stall analysis software to store instruction completion stall data in memory for later manipulation by system users or other software.

    摘要翻译: 信息处理系统包括在软件应用程序内执行多个指令或指令线程的处理器。 信息处理系统包括在多任务环境中管理处理器系统硬件和软件的操作系统软件。 在一个实施例中,操作系统管理指令完成失速分析软件以确定指令停顿的原因或原因。 在另一个实施例中,失速分析软件与操作系统软件配合,以在应用程序执行时以每个指令为基础存储指令完成失速事件数据。 操作系统软件可以与失速分析软件配合以将指令完成失速数据存储在存储器中以供系统用户或其他软件稍后操作。