PROFILING APPLICATION PERFORMANCE ACCORDING TO DATA STRUCTURE
    1.
    发明申请
    PROFILING APPLICATION PERFORMANCE ACCORDING TO DATA STRUCTURE 失效
    根据数据结构分析应用性能

    公开(公告)号:US20100287536A1

    公开(公告)日:2010-11-11

    申请号:US12436894

    申请日:2009-05-07

    IPC分类号: G06F11/36 G06F9/44 G06F9/45

    摘要: During runtime of a binary program file, streams of instructions are executed and memory references, generated by instrumentation applied to given ones of the instructions that refer to memory locations, are collected. A transformation is performed, based on the executed streams of instructions and the collected memory references, to obtain a table. The table lists memory events of interest for active data structures for each function in the program file. The transformation is performed to translate memory addresses for given ones of the instructions and given ones of the data structures into locations and variable names in a source file corresponding to the binary file. At least the memory events of interest are displayed, and the display is organized so as to correlate the memory events of interest with corresponding ones of the data structures.

    摘要翻译: 在二进制程序文件的运行期间,执行指令流,并且收集通过应用于指向存储器位置的给定指令的仪器产生的存储器引用。 基于所执行的指令流和所收集的存储器引用执行变换以获得表。 该表列出了程序文件中每个功能的活动数据结构感兴趣的内存事件。 执行转换以将给定的指令的内存地址转换为与二进制文件相对应的源文件中的数据结构中的位置和变量名。 至少显示感兴趣的存储器事件,并且显示被组织以使感兴趣的存储器事件与相应的数据结构相关联。

    Profiling application performance according to data structure
    2.
    发明授权
    Profiling application performance according to data structure 失效
    根据数据结构分析应用性能

    公开(公告)号:US08490061B2

    公开(公告)日:2013-07-16

    申请号:US12436894

    申请日:2009-05-07

    摘要: During runtime of a binary program file, streams of instructions are executed and memory references, generated by instrumentation applied to given ones of the instructions that refer to memory locations, are collected. A transformation is performed, based on the executed streams of instructions and the collected memory references, to obtain a table. The table lists memory events of interest for active data structures for each function in the program file. The transformation is performed to translate memory addresses for given ones of the instructions and given ones of the data structures into locations and variable names in a source file corresponding to the binary file. At least the memory events of interest are displayed, and the display is organized so as to correlate the memory events of interest with corresponding ones of the data structures.

    摘要翻译: 在二进制程序文件的运行期间,执行指令流,并且收集通过应用于指向存储器位置的给定指令的仪器产生的存储器引用。 基于所执行的指令流和所收集的存储器引用执行变换以获得表。 该表列出了程序文件中每个功能的活动数据结构感兴趣的内存事件。 执行转换以将给定的指令的内存地址转换为与二进制文件相对应的源文件中的数据结构中的位置和变量名。 至少显示感兴趣的存储器事件,并且显示被组织以使感兴趣的存储器事件与相应的数据结构相关联。

    BINARY PROGRAMMABLE METHOD FOR APPLICATION PERFORMANCE DATA COLLECTION
    3.
    发明申请
    BINARY PROGRAMMABLE METHOD FOR APPLICATION PERFORMANCE DATA COLLECTION 有权
    应用性能数据收集的二进制可编程方法

    公开(公告)号:US20090150874A1

    公开(公告)日:2009-06-11

    申请号:US11952922

    申请日:2007-12-07

    IPC分类号: G06F9/45 G06F9/44

    摘要: A method for application performance data collection includes steps or acts of: customizing a performance tool for collecting application performance data of an application; modifying the application by inserting the performance tool while the application does not need to be rebuilt from the source; executing the application; and collecting the application execution performance data such that only interesting data is collected. Customizing the performance tool proceeds by implementing at least one configurable tracing function that can be programmed by the user; compiling the function(s) into an object file; and inserting the object file into the performance tool using binary instrumentation.

    摘要翻译: 应用性能数据收集的方法包括以下步骤或动作:定制用于收集应用的应用性能数据的性能工具; 通过插入性能工具来修改应用程序,同时不需要从源重建应用程序; 执行应用程序; 并收集应用执行性能数据,使得只收集有趣的数据。 通过实现可由用户编程的至少一个可配置跟踪功能来自定义性能工具; 将函数编译成对象文件; 并使用二进制检测将目标文件插入性能工具。

    Binary programmable method for application performance data collection
    4.
    发明授权
    Binary programmable method for application performance data collection 有权
    用于应用性能数据采集的二进制可编程方法

    公开(公告)号:US08527959B2

    公开(公告)日:2013-09-03

    申请号:US11952922

    申请日:2007-12-07

    IPC分类号: G06F9/44

    摘要: A method for application performance data collection includes steps or acts of: customizing a performance tool for collecting application performance data of an application; modifying the application by inserting the performance tool while the application does not need to be rebuilt from the source; executing the application; and collecting the application execution performance data such that only interesting data is collected. Customizing the performance tool proceeds by implementing at least one configurable tracing function that can be programmed by the user; compiling the function(s) into an object file; and inserting the object file into the performance tool using binary instrumentation.

    摘要翻译: 应用性能数据收集的方法包括以下步骤或动作:定制用于收集应用的应用性能数据的性能工具; 通过插入性能工具来修改应用程序,同时不需要从源重建应用程序; 执行应用程序; 并收集应用执行性能数据,使得只收集有趣的数据。 通过实现可由用户编程的至少一个可配置跟踪功能来自定义性能工具; 将函数编译成对象文件; 并使用二进制检测将目标文件插入性能工具。

    Control flow based compression of execution traces
    5.
    发明申请
    Control flow based compression of execution traces 有权
    基于控制流的压缩执行跟踪

    公开(公告)号:US20050091643A1

    公开(公告)日:2005-04-28

    申请号:US10695139

    申请日:2003-10-28

    IPC分类号: G06F3/00 G06F9/44 G06F11/34

    摘要: A method and apparatus for creating a compressed trace for a program, wherein events are compressed separately to provide improved compression and tracing. A sequence of events for a program is selected, and a sequence of values is then determined for each of the selected events occurring during an execution of the program. Each sequence of values is then compressed to generate a compressed sequence of values for each event. These values are then ordered in accordance with information stored in selected events (such as for example, branch events), where the ordered values correspond to the trace.

    摘要翻译: 一种用于为程序创建压缩轨迹的方法和装置,其中分别压缩事件以提供改进的压缩和跟踪。 选择程序的事件序列,然后针对在执行程序期间发生的每个所选择的事件确定一系列值。 然后将每个值序列压缩以产生每个事件的压缩的值序列。 这些值然后根据存储在所选事件(例如分支事件)中的信息进行排序,其中有序值对应于跟踪。

    Control flow based compression of execution traces
    6.
    发明授权
    Control flow based compression of execution traces 有权
    基于控制流的压缩执行跟踪

    公开(公告)号:US07308681B2

    公开(公告)日:2007-12-11

    申请号:US10695139

    申请日:2003-10-28

    IPC分类号: G06F9/44

    摘要: A method and apparatus for creating a compressed trace for a program, wherein events are compressed separately to provide improved compression and tracing. A sequence of events for a program is selected, and a sequence of values is then determined for each of the selected events occurring during an execution of the program. Each sequence of values is then compressed to generate a compressed sequence of values for each event. These values are then ordered in accordance with information stored in selected events (such as for example, branch events), where the ordered values correspond to the trace.

    摘要翻译: 一种用于为程序创建压缩轨迹的方法和装置,其中分别压缩事件以提供改进的压缩和跟踪。 选择程序的事件序列,然后针对在执行程序期间发生的每个所选择的事件确定一系列值。 然后将每个值序列压缩以产生每个事件的压缩的值序列。 这些值然后根据存储在所选事件(例如分支事件)中的信息进行排序,其中有序值对应于跟踪。

    OPERATING A STACK OF INFORMATION IN AN INFORMATION HANDLING SYSTEM
    7.
    发明申请
    OPERATING A STACK OF INFORMATION IN AN INFORMATION HANDLING SYSTEM 有权
    在信息处理系统中操作信息堆栈

    公开(公告)号:US20110314259A1

    公开(公告)日:2011-12-22

    申请号:US12817609

    申请日:2010-06-17

    IPC分类号: G06F9/30

    摘要: A pointer is for pointing to a next-to-read location within a stack of information. For pushing information onto the stack: a value is saved of the pointer, which points to a first location within the stack as being the next-to-read location; the pointer is updated so that it points to a second location within the stack as being the next-to-read location; and the information is written for storage at the second location. For popping the information from the stack: in response to the pointer, the information is read from the second location as the next-to-read location; and the pointer is restored to equal the saved value so that it points to the first location as being the next-to-read location.

    摘要翻译: 一个指针用于指向一堆信息中的下一个读取位置。 将信息推送到堆栈中:保存指针的值,该指针指向堆栈内的第一个位置作为下一个读取位置; 指针被更新,使得它指向堆栈内的第二位置作为下一个读取位置; 并且将信息写入第二位置处的存储。 为了从堆栈弹出信息:响应于指针,从第二位置读取信息作为下一个读取位置; 并且指针被恢复为等于保存的值,使得其指向作为下一个读取位置的第一位置。

    Method of maintaining data coherency in a computer system having a
plurality of interconnected nodes
    8.
    发明授权
    Method of maintaining data coherency in a computer system having a plurality of interconnected nodes 失效
    在具有多个互连节点的计算机系统中维护数据一致性的方法

    公开(公告)号:US06085295A

    公开(公告)日:2000-07-04

    申请号:US954496

    申请日:1997-10-20

    IPC分类号: G06F12/08 G06F12/16

    摘要: A method of providing coherent shared memory access among a plurality of shared memory multiprocessor nodes. For each line of data in each of the nodes, a list of those processors of the node that have copies of the line in their caches is maintained. If a memory command is issued from a processor of one node, and if the command is directed to a line of memory of another node, then the memory command is sent directly to an adapter of the one node. When the adapter receives the command, it forwards the command from the one adapter to another adapter of the other node. When the other adapter receives the command, the command is forwarded to the local memory of the other node. The list of processors is then updated in the local memory of the other node to include or exclude the other adapter depending on the command. If the memory command is issued from one of the processors of one of the nodes, and if the command is directed to a line of memory of the one node, then the command is sent directly to local memory. When the local memory receives the command and if the adapter of the node is in the list of processors for a line associated with the command and if the command is a write command, then the command is forwarded to the adapter of the one node. When the adapter receives the command, the command is forwarded to remote adapters in each of the remote nodes which have processors which have cache copies of the line. Finally, when the latter remote adapters receive the command, the command is forwarded to the processors having the cache copies of the line.

    摘要翻译: 一种在多个共享存储器多处理器节点之间提供一致的共享存储器访问的方法。 对于每个节点中的每一行数据,维护节点中具有其高速缓存中的行的副本的那些处理器的列表。 如果从一个节点的处理器发出存储器命令,并且如果命令被定向到另一个节点的存储器行,则存储器命令被直接发送到该一个节点的适配器。 当适配器接收到命令时,它将该命令从一个适配器转发到另一个节点的另一个适配器。 当另一个适配器接收到该命令时,该命令将转发到另一个节点的本地内存。 然后在另一个节点的本地存储器中更新处理器列表,以根据命令包括或排除另一个适配器。 如果从其中一个节点的一个处理器发出存储器命令,并且如果命令被定向到一个节点的存储器行,则该命令被直接发送到本地存储器。 当本地内存接收到该命令时,如果节点的适配器位于与该命令相关联的一行的处理器列表中,并且该命令是写入命令,则该命令将转发到该一个节点的适配器。 当适配器接收到该命令时,该命令将转发到具有具有该行的高速缓存副本的处理器的每个远程节点中的远程适配器。 最后,当后一个远程适配器接收到该命令时,该命令被转发到具有该行的缓存副本的处理器。

    Self-scheduling parallel computer system and method
    9.
    发明授权
    Self-scheduling parallel computer system and method 失效
    自调并行计算机系统及方法

    公开(公告)号:US5408658A

    公开(公告)日:1995-04-18

    申请号:US730365

    申请日:1991-07-15

    IPC分类号: G06F9/38 G06F9/45 G06F15/16

    摘要: An incremental method is described for distributing the instructions of an execution sequence among a plurality of processing elements for execution in parallel. The distribution is based upon anticipated availability times of the needed input values for each instruction as well as the anticipated availability times of each processing element for handling each instruction. A self-parallelizing computer system and method are also described for asynchronously processing the distributed instructions in two modes of execution on a set of processing elements which communicate with each other.

    摘要翻译: 描述了用于在多个处理元件之间并行执行执行序列的指令的分发方法。 该分配基于每个指令的所需输入值的预期可用时间以及用于处理每个指令的每个处理元件的预期可用时间。 还描述了一种自并行计算机系统和方法,用于在一组彼此通信的处理元件上以两种执行模式异步处理分布式指令。

    SEAMLESS INTERFACE FOR MULTI-THREADED CORE ACCELERATORS
    10.
    发明申请
    SEAMLESS INTERFACE FOR MULTI-THREADED CORE ACCELERATORS 有权
    多线程加速器的无缝接口

    公开(公告)号:US20120239904A1

    公开(公告)日:2012-09-20

    申请号:US13048214

    申请日:2011-03-15

    IPC分类号: G06F9/30 G06F12/10

    摘要: A method, system and computer program product are disclosed for interfacing between a multi-threaded processing core and an accelerator. In one embodiment, the method comprises copying from the processing core to the hardware accelerator memory address translations for each of multiple threads operating on the processing core, and simultaneously storing on the hardware accelerator one or more of the memory address translations for each of the threads. Whenever any one of the multiple threads operating on the processing core instructs the hardware accelerator to perform a specified operation, the hardware accelerator has stored thereon one or more of the memory address translations for the any one of the threads. This facilitates starting that specified operation without memory translation faults. In an embodiment, the copying includes, each time one of the memory address translations is updated on the processing core, copying the updated one of the memory address translations to the hardware accelerator.

    摘要翻译: 公开了用于在多线程处理核心和加速器之间进行接口的方法,系统和计算机程序产品。 在一个实施例中,该方法包括从处理核心复制到在处理核心上操作的多个线程中的每个线程的硬件加速器存储器地址转换,以及同时在硬件加速器上存储每个线程的一个或多个存储器地址转换 。 只要在处理核心上操作的多个线程中的任何一个指示硬件加速器执行指定的操作,则硬件加速器在其上存储有针对任何一个线程的一个或多个存储器地址转换。 这有助于启动指定的操作而不会出现内存转换错误 在一个实施例中,复制包括每次在处理核心上更新一个存储器地址转换时,将更新的一个存储器地址转换复制到硬件加速器。