System and method for real time instruction tracing

    公开(公告)号:US10275242B2

    公开(公告)日:2019-04-30

    申请号:US13997016

    申请日:2012-03-30

    IPC分类号: G06F9/30 G06F11/34 G06F11/36

    摘要: An apparatus and method are described for real time instruction tracing. For example, a method according to one embodiment comprises: recording user specified address ranges for which tracing is required; monitoring a next linear instruction pointer (NLIP) values and/or branch linear instruction pointer (BLIP) values to determine if address range has been entered; when the range is entered, compressing the NLIP and/or BLIP values and constructing fixed length packets containing the tracing data; and transferring the fixed length packets to a memory execution cluster.

    Optional logging of debug activities in a real time instruction tracing log
    2.
    发明授权
    Optional logging of debug activities in a real time instruction tracing log 有权
    在实时指令跟踪日志中选择性地记录调试活动

    公开(公告)号:US09003375B2

    公开(公告)日:2015-04-07

    申请号:US13993477

    申请日:2011-12-30

    摘要: In accordance with embodiments disclosed herein, there are provided methods, systems, mechanisms, techniques, and apparatuses for implementing optional logging of debug activities in a real time instruction tracing log. For example, in one embodiment, such means may include an integrated circuit having means for initiating instruction tracing for instructions of a traced application, mode, or code region, as the instructions are executed by the integrated circuit; means for generating a plurality of packets to a debug log describing the instruction tracing; means for initiating an alternative mode of execution within the integrated circuit; and means for suppressing indication of entering the alternative mode of execution. Additional and alternative means may be implemented for selectively causing an integrated circuit to operate in accordance with an invisible trace mode or a visible trace mode upon transition to the alternative mode of execution.

    摘要翻译: 根据本文公开的实施例,提供了用于在实时指令跟踪日志中实现可选日志记录调试活动的方法,系统,机制,技术和装置。 例如,在一个实施例中,这种装置可以包括集成电路,该集成电路具有用于对由跟踪的应用,模式或代码区域的指令进行指令跟踪的装置,因为该指令由集成电路执行; 用于将多个分组生成到描述所述指令跟踪的调试日志的装置; 用于在集成电路内启动替代执行模式的装置; 以及用于抑制进入替代执行模式的指示的装置。 可以实现附加和替代手段,用于在转换到替代执行模式时,根据不可见的跟踪模式或可视跟踪模式选择性地使集成电路进行操作。

    Method and apparatus for efficient integer transform

    公开(公告)号:US08346838B2

    公开(公告)日:2013-01-01

    申请号:US12560225

    申请日:2009-09-15

    IPC分类号: G06F7/38

    摘要: A method and apparatus for including in a processor instructions for performing integer transforms including multiply-add operations and horizontal-add operations on packed data. In one embodiment, a processor is coupled to a memory that stores a first packed byte data and a second packed byte data. The processor performs operations on said first packed byte data and said second packed byte data to generate a third packed data in response to receiving a multiply-add instruction. A plurality of the 16-bit data elements in this third packed data storing the result of performing multiply-add operations on data elements in the first and second packed byte data. The processor adds together at least a first and a second 16-bit data element of the third packed data in response to receiving an horizontal-add instruction to generate a 16-bit result as one of a plurality of data elements of a fourth packed data.

    SIMD integer multiply high with round and shift
    4.
    发明授权
    SIMD integer multiply high with round and shift 失效
    SIMD整数乘以高乘法和移位

    公开(公告)号:US07689641B2

    公开(公告)日:2010-03-30

    申请号:US10610833

    申请日:2003-06-30

    IPC分类号: G06F7/52

    摘要: Method, apparatus, and program means for performing a packed multiply high with round and shift operation. The method of one embodiment comprises receiving a first operand having a first set of L data elements. A second operand having a second set of L data elements is received. L pairs of data elements are multiplied together to generate a set of L products. Each of the L pairs includes a first data element from the first set of L data element and a second data element from a corresponding data element position of the second set of L data elements. Each of the L products are rounded to generate L rounded values. Each of said L rounded values are scaled to generate L scaled values. Each of the L scaled values are truncated for storage at a destination. Each truncated value is to be stored at a data element position corresponding to its pair of data elements.

    摘要翻译: 方法,装置和程序装置,用于执行具有循环和换档操作的压缩倍增。 一个实施例的方法包括接收具有第一组L个数据元素的第一操作数。 接收具有第二组L个数据元素的第二操作数。 将L对数据元素相乘以生成一组L个乘积。 L对中的每一个包括来自第一组L数据元素的第一数据元素和来自第二组L个数据元素的相应数据元素位置的第二数据元素。 每个L产品都被舍入,以产生L个舍入值。 每个所述L舍入值被缩放以产生L个缩放值。 L缩放值中的每一个都被截断以存储在目的地。 每个截断的值被存储在与其对数据元素对应的数据元素位置处。

    Method and apparatus for efficient integer transform
    5.
    发明授权
    Method and apparatus for efficient integer transform 有权
    用于有效整数变换的方法和装置

    公开(公告)号:US07624138B2

    公开(公告)日:2009-11-24

    申请号:US10749738

    申请日:2003-12-30

    IPC分类号: G06F7/38

    摘要: A method and apparatus for including in a processor instructions for performing integer transforms including multiply-add operations and horizontal-add operations on packed data. In one embodiment, a processor is coupled to a memory that stores a first packed byte data and a second packed byte data. The processor performs operations on said first packed byte data and said second packed byte data to generate a third packed data in response to receiving a multiply-add instruction. A plurality of the 16-bit data elements in this third packed data storing the result of performing multiply-add operations on data elements in the first and second packed byte data. The processor adds together at least a first and a second 16-bit data element of the third packed data in response to receiving an horizontal-add instruction to generate a 16-bit result as one of a plurality of data elements of a fourth packed data.

    摘要翻译: 一种用于在处理器中包括用于执行整数变换的指令的方法和装置,包括对打包数据的乘法运算和水平加法运算。 在一个实施例中,处理器耦合到存储第一打包字节数据和第二打包字节数据的存储器。 处理器对所述第一打包字节数据和所述第二打包字节数据执行操作,以响应于接收到加法指令而产生第三打包数据。 该第三打包数据中的多个16位数据元素存储对第一和第二打包字节数据中的数据元素执行加法运算的结果。 处理器响应于接收到水平加法指令而将第三打包数据的至少第一和第二16位数据元素加在一起,以生成作为第四打包数据的多个数据元素之一的16位结果 。

    Method and Apparatus for Efficient Integer Transform
    8.
    发明申请
    Method and Apparatus for Efficient Integer Transform 有权
    高效整数变换的方法和装置

    公开(公告)号:US20100011042A1

    公开(公告)日:2010-01-14

    申请号:US12560225

    申请日:2009-09-15

    IPC分类号: G06F17/14 G06F7/44 G06F7/42

    摘要: A method and apparatus for including in a processor instructions for performing integer transforms including multiply-add operations and horizontal-add operations on packed data. In one embodiment, a processor is coupled to a memory that stores a first packed byte data and a second packed byte data. The processor performs operations on said first packed byte data and said second packed byte data to generate a third packed data in response to receiving a multiply-add instruction. A plurality of the 16-bit data elements in this third packed data storing the result of performing multiply-add operations on data elements in the first and second packed byte data. The processor adds together at least a first and a second 16-bit data element of the third packed data in response to receiving an horizontal-add instruction to generate a 16-bit result as one of a plurality of data elements of a fourth packed data.

    摘要翻译: 一种用于在处理器中包括用于执行整数变换的指令的方法和装置,包括对打包数据的乘法运算和水平加法运算。 在一个实施例中,处理器耦合到存储第一打包字节数据和第二打包字节数据的存储器。 处理器对所述第一打包字节数据和所述第二打包字节数据执行操作,以响应于接收到加法指令而产生第三打包数据。 该第三打包数据中的多个16位数据元素存储对第一和第二打包字节数据中的数据元素执行加法运算的结果。 处理器响应于接收到水平加法指令而将第三打包数据的至少第一和第二16位数据元素加在一起,以生成作为第四打包数据的多个数据元素之一的16位结果 。