Staggered execution stack for vector processing
    2.
    发明申请
    Staggered execution stack for vector processing 有权
    用于矢量处理的交错执行堆栈

    公开(公告)号:US20070079179A1

    公开(公告)日:2007-04-05

    申请号:US11240982

    申请日:2005-09-30

    IPC分类号: G06F11/00

    摘要: In one embodiment, the present invention includes a method for executing an operation on low order portions of first and second source operands using a first execution stack of a processor and executing the operation on high order portions of the first and second source operands using a second execution stack of the processor, where the operation in the second execution stack is staggered by one or more cycles from the operation in the first execution stack. Other embodiments are described and claimed.

    摘要翻译: 在一个实施例中,本发明包括一种使用处理器的第一执行堆栈来执行第一和第二源操作数的低阶部分的操作的方法,并且使用第二和第二源操作数对第一和第二源操作数的高阶部分执行操作 处理器的执行堆栈,其中第二执行堆栈中的操作与第一执行堆栈中的操作交错一个或多个周期。 描述和要求保护其他实施例。

    Flow optimization and prediction for VSSE memory operations
    5.
    发明申请
    Flow optimization and prediction for VSSE memory operations 有权
    VSSE存储器操作的流优化和预测

    公开(公告)号:US20070143575A1

    公开(公告)日:2007-06-21

    申请号:US11315964

    申请日:2005-12-21

    IPC分类号: G06F15/00

    摘要: In one embodiment, a method for flow optimization and prediction for vector streaming single instruction, multiple data (SIMD) extension (VSSE) memory operations is disclosed. The method comprises generating an optimized micro-operation (μop) flow for an instruction to operate on a vector if the instruction is predicted to be unmasked and unit-stride, the instruction to access elements in memory, and accessing via the optimized μop flow two or more of the elements at the same time without determining masks of the two or more elements. Other embodiments are also described.

    摘要翻译: 在一个实施例中,公开了一种用于向量流单个指令,多数据(SIMD)扩展(VSSE)存储器操作的流优化和预测的方法。 该方法包括:如果预测指令是未屏蔽和单步的,则生成用于对矢量进行操作的指令的优化的微操作(muop)流程,访问存储器中的元件的指令以及经由优化的muop流2访问 或更多的元素,而不确定两个或更多个元件的掩模。 还描述了其它实施例。

    Dynamic online optimizer
    9.
    发明申请
    Dynamic online optimizer 审中-公开
    动态在线优化器

    公开(公告)号:US20050149912A1

    公开(公告)日:2005-07-07

    申请号:US10748284

    申请日:2003-12-29

    IPC分类号: G06F9/38 G06F9/44

    CPC分类号: G06F9/3808 G06F8/443

    摘要: A system and method for optimizing a series of traces to be executed by a processing core is disclosed. The lines of a trace are sent to an optimizer each time they are sent to a processing core to be executed. Runtime information may be collected on a line of a trace each time that trace is executed by a processing core. The runtime information may be used by the optimizer to better optimize the micro-operations of the lines of the trace. The optimizer optimizes a trace each time the trace is executed to improve the efficiency of future iterations of the trace. Most of the optimizations result in a reduction of the number of μops within the trace. The optimizer may optimize two or more lines at a time in order to find more opportunities to remove μops and shorten the trace. The two lines may be alternately offset so that each line has the maximum allowed number of micro-operations.

    摘要翻译: 公开了一种用于优化要由处理核执行的一系列迹线的系统和方法。 每次将跟踪的行发送到执行处理核心时,都将其发送到优化器。 每次由处理核心执行跟踪时,运行时信息可以在跟踪线上收集。 优化器可以使用运行时信息来更好地优化跟踪线的微操作。 优化器每次执行跟踪时优化跟踪,以提高跟踪的未来迭代的效率。 大多数优化导致跟踪中的muops数量减少。 优化器可以一次优化两行或更多行,以便找到更多机会删除muops并缩短跟踪。 两条线可以交替地偏移,使得每条线具有允许的最大微操作数。