PROCESSOR INSTRUCTIONS FOR DETERMINING TWO MINIMUM AND TWO MAXIMUM VALUES

    公开(公告)号:US20180181394A1

    公开(公告)日:2018-06-28

    申请号:US15387823

    申请日:2016-12-22

    IPC分类号: G06F9/30 G06F15/80

    摘要: Processor instructions for determining two minimum and two maximum values and associated apparatus and methods. The instructions include various 2MIN instructions for determining the two smallest values among three or four input values and 2MAX instructions for determining the two largest values among three or four input values. The 2MIN instructions employ two operands, with the first operand in some of the variations storing concatenated min1 and min2 values in a first register and a scr2 comparison value or two src2 concatenated src2 values in a second register. Comparators are used to implement hardware logic for determining whether the scr2 value(s) is/are less than each of min1 and min2. Based on the hardware logic, the smallest two values among min1, min2, and src2 (or both src2 values) are stored as concatenated values in the first register. The 2MAX instructions are implemented in a similar manner, except the comparisons are whether the scr2 value(s) is/are greater than each of max1 and max2 values. 128-bit 2MIN and 2MAX SIMD instructions are also provided for processing two 64-bit data-paths in parallel.

    METHOD AND APPARATUS TO PROCESS 4-OPERAND SIMD INTEGER MULTIPLY-ACCUMULATE INSTRUCTION
    64.
    发明申请
    METHOD AND APPARATUS TO PROCESS 4-OPERAND SIMD INTEGER MULTIPLY-ACCUMULATE INSTRUCTION 有权
    过程4操作的方法和装置SIMD INTEGER MULTIPLY-ACCUMULATE指令

    公开(公告)号:US20140082328A1

    公开(公告)日:2014-03-20

    申请号:US13617021

    申请日:2012-09-14

    IPC分类号: G06F9/302 G06F9/30

    摘要: According to one embodiment, a processor includes an instruction decoder to receive an instruction to process a multiply-accumulate operation, the instruction having a first operand, a second operand, a third operand, and a fourth operand. The first operand is to specify a first storage location to store an accumulated value; the second operand is to specify a second storage location to store a first value and a second value; and the third operand is to specify a third storage location to store a third value. The processor further includes an execution unit coupled to the instruction decoder to perform the multiply-accumulate operation to multiply the first value with the second value to generate a multiply result and to accumulate the multiply result and at least a portion of a third value to an accumulated value based on the fourth operand.

    摘要翻译: 根据一个实施例,处理器包括指令解码器,用于接收处理多重累积运算的指令,该指令具有第一操作数,第二操作数,第三操作数和第四操作数。 第一个操作数是指定一个存储累积值的第一个存储位置; 第二操作数是指定存储第一值和第二值的第二存储位置; 并且第三操作数是指定存储第三值的第三存储位置。 所述处理器还包括执行单元,其耦合到所述指令解码器以执行所述乘法运算,以将所述第一值乘以所述第二值以产生乘法结果,并将乘法结果和第三值的至少一部分累积到 基于第四操作数的累计值。

    Method and apparatus to process 4-operand SIMD integer multiply-accumulate instruction
    66.
    发明授权
    Method and apparatus to process 4-operand SIMD integer multiply-accumulate instruction 有权
    处理4操作数SIMD整数乘法累加指令的方法和装置

    公开(公告)号:US09292297B2

    公开(公告)日:2016-03-22

    申请号:US13617021

    申请日:2012-09-14

    IPC分类号: G06F9/00 G06F9/38 G06F9/30

    摘要: According to one embodiment, a processor includes an instruction decoder to receive an instruction to process a multiply-accumulate operation, the instruction having a first operand, a second operand, a third operand, and a fourth operand. The first operand is to specify a first storage location to store an accumulated value; the second operand is to specify a second storage location to store a first value and a second value; and the third operand is to specify a third storage location to store a third value. The processor further includes an execution unit coupled to the instruction decoder to perform the multiply-accumulate operation to multiply the first value with the second value to generate a multiply result and to accumulate the multiply result and at least a portion of a third value to an accumulated value based on the fourth operand.

    摘要翻译: 根据一个实施例,处理器包括指令解码器,用于接收处理多重累积运算的指令,该指令具有第一操作数,第二操作数,第三操作数和第四操作数。 第一个操作数是指定一个存储累积值的第一个存储位置; 第二操作数是指定存储第一值和第二值的第二存储位置; 并且第三操作数是指定存储第三值的第三存储位置。 所述处理器还包括执行单元,其耦合到所述指令解码器以执行所述乘法运算,以将所述第一值乘以所述第二值以产生乘法结果,并将乘法结果和第三值的至少一部分累积到 基于第四操作数的累计值。

    Debug system having assembler correcting register allocation errors
    68.
    发明授权
    Debug system having assembler correcting register allocation errors 失效
    具有汇编器校正寄存器分配错误的调试系统

    公开(公告)号:US07478374B2

    公开(公告)日:2009-01-13

    申请号:US10807218

    申请日:2004-03-22

    申请人: James D. Guilford

    发明人: James D. Guilford

    IPC分类号: G06F9/45

    CPC分类号: G06F11/3624

    摘要: An assembler, which can be provided as part of a debugger and/or development system, avoids register allocation errors, such as register bank conflicts and/or insufficient physical registers, automatically.

    摘要翻译: 可以作为调试器和/或开发系统的一部分提供的汇编器自动避免寄存器分配错误,如寄存器组冲突和/或不足的物理寄存器。

    Breakpoint method for parallel hardware threads in multithreaded processor
    69.
    发明授权
    Breakpoint method for parallel hardware threads in multithreaded processor 失效
    多线程处理器中并行硬件线程的断点方法

    公开(公告)号:US07020871B2

    公开(公告)日:2006-03-28

    申请号:US09747019

    申请日:2000-12-21

    IPC分类号: G06F9/44

    CPC分类号: G06F11/362

    摘要: A method of debugging code that executes in a multithreaded processor having a microengines includes receiving a program instruction and an identification representing a selected one of the microengines from a remote user interface connected to the processor pausing program execution in the threads executing in the selected microengine, inserting a breakpoint after a program instruction in the selected microengine that matches the program instruction received from the remote user interface, resuming program execution in the selected microengine, executing a breakpoint routine if program execution in the selected microengine encounters the breakpoint and resuming program execution in the microengine.

    摘要翻译: 调试在具有微引擎的多线程处理器中执行的代码的方法包括从连接到处理器的远程用户界面接收表示所选择的一个微引擎中的所选择的一个微指令的标识,所述远程用户界面暂停在所选择的微引擎中执行的线程中的程序执行, 在所选择的微引擎中的程序指令之后插入与从远程用户界面接收的程序指令相匹配的断点,恢复所选微引擎中的程序执行,执行断点程序,如果所选择的微引擎中的程序执行遇到断点并恢复程序执行 微型发动机。

    Multiple image dynamic bind and load procedure for a multi-processor
    70.
    发明授权
    Multiple image dynamic bind and load procedure for a multi-processor 失效
    多处理器的多图像动态绑定和加载过程

    公开(公告)号:US06684395B2

    公开(公告)日:2004-01-27

    申请号:US09753084

    申请日:2000-12-28

    IPC分类号: G06F944

    CPC分类号: G06F9/44521

    摘要: A method and mechanism for executing an application by a processor in a multi-processor configuration of processors, each having an associated instruction memory is presented. The application receives object code that includes an image for at least one other processor in the multi-processor configuration of processors. The application binds an import variable in the image to a parameter value and stores the image for the at least one other processor into the associated instruction memory.

    摘要翻译: 提出了一种用于在处理器的多处理器配置中由处理器执行应用的方法和机制,每个处理器具有相关联的指令存储器。 该应用接收处理器的多处理器配置中的至少一个其他处理器的图像的目标代码。 应用程序将图像中的导入变量绑定到参数值,并将至少一个其他处理器的映像存储到相关联的指令存储器中。