Register Indirect Access of Program Floating Point Registers by Millicode
    91.
    发明申请
    Register Indirect Access of Program Floating Point Registers by Millicode 失效
    通过Millicode寄存器间接访问程序浮点寄存器

    公开(公告)号:US20080126759A1

    公开(公告)日:2008-05-29

    申请号:US11531301

    申请日:2006-09-13

    IPC分类号: G06F9/302

    CPC分类号: G06F9/3017 G06F9/35

    摘要: Complex floating point instructions are executed under millicode control when it is not cost effective to implement its function in hardware. One of the disadvantages to executing complex instructions using millicode routines is that determining and accessing the instructions operands are costly for millicode performance. To determine what the source and target location are, the instruction text is parsed. Furthermore the millicode instruction stream must be modified to access the operand data from and write the result to the program registers specified by the complex floating point instruction. The invention overcomes these disadvantages by providing millicode with register indirect access to the program floating point registers.

    摘要翻译: 当在硬件中实现其功能不具有成本效益时,复杂的浮点指令在毫秒控制下执行。 使用millicode例程执行复杂指令的一个缺点是确定和访问指令操作数对于millicode性能来说是昂贵的。 要确定源和目标位置是什么,解释说明文本。 此外,必须修改millicode指令流以访问操作数数据,并将结果写入由复杂浮点指令指定的程序寄存器。 本发明通过向编程浮点寄存器提供寄存器间接访问的毫代码来克服这些缺点。

    Method and apparatus for mirroring units within a processor
    92.
    发明授权
    Method and apparatus for mirroring units within a processor 失效
    用于在处理器内镜像单元的方法和装置

    公开(公告)号:US07082550B2

    公开(公告)日:2006-07-25

    申请号:US10435914

    申请日:2003-05-12

    IPC分类号: G06F11/16

    CPC分类号: G06F11/1695 G06F11/1641

    摘要: A processor responsive to a clock cycle includes a base-unit, a mirror-unit that is a duplicate instance of the base-unit, a non-duplicate-unit in signal communication with the base and mirror units, a first staging register disposed at the input to the mirror-unit for delaying the input signal thereto by at least one clock cycle, and a second staging register disposed at the output of the mirror-unit for delaying the output signal therefrom by at least one clock cycle. The non-duplicate-unit includes a comparator for comparing the output signals of the base and mirror units.

    摘要翻译: 响应于时钟周期的处理器包括基本单元,作为基本单元的重复实例的镜像单元,与基础单元和镜像单元进行信号通信的非重复单元,第一分段寄存器,其布置在 用于将输入信号延迟至少一个时钟周期的反射镜单元的输入,以及设置在镜单元的输出处的第二分段寄存器,用于将其输出信号延迟至少一个时钟周期。 非重复单元包括用于比较基座和反射镜单元的输出信号的比较器。

    Superscalar microprocessor having multi-pipe dispatch and execution unit
    93.
    发明授权
    Superscalar microprocessor having multi-pipe dispatch and execution unit 失效
    超标量微处理器具有多管调度和执行单元

    公开(公告)号:US07082517B2

    公开(公告)日:2006-07-25

    申请号:US10435806

    申请日:2003-05-12

    IPC分类号: G06F9/30 G06F15/00

    摘要: In a computer system for use as a symetrical multiprocessor, a superscalar microprocessor apparatus allows dispatching and executing multi-cycle and complex instructions Some control signals are generated in the dispatch unit and dispatched with the instruction to the Fixed Point Unit (FXU). Multiple execution pipes correspond to the instruction dispatch ports and the execution unit is a Fixed Point Unit (FXU) which contains three execution dataflow pipes (X, Y and Z) and one control pipe (R). The FXU logic then execute these instructions on the available FXU pipes. This results in optimum performance with little or no other complications. The presented technique places the flexibility of how these instructions will be executed in the FXU, where the actual execution takes place, instead of in the instruction decode or dispatch units or cracking by the compiler.

    摘要翻译: 在用作对称多处理器的计算机系统中,超标量微处理器装置允许调度和执行多周期和复杂指令。在调度单元中生成一些控制信号,并且通过指令发送到定点单元(FXU)。 多个执行管道对应于指令调度端口,执行单元是包含三个执行数据流管道(X,Y和Z)和一个控制管道(R)的定点单元(FXU)。 然后,FXU逻辑在可用的FXU管道上执行这些说明。 这导致最佳性能,很少或没有其他并发症。 所提出的技术使得如何在实际执行的FXU中执行这些指令的灵活性,而不是在指令解码或调度单元中或由编译器破解。

    Copying character data having a termination character from one memory location to another
    96.
    发明授权
    Copying character data having a termination character from one memory location to another 有权
    将具有终止字符的字符数据从一个存储器位置复制到另一个存储器位置

    公开(公告)号:US09454366B2

    公开(公告)日:2016-09-27

    申请号:US13421498

    申请日:2012-03-15

    IPC分类号: G06F12/00 G06F9/30

    摘要: Copying characters of a set of terminated character data from one memory location to another memory location using parallel processing and without causing unwarranted exceptions. The character data to be copied is loaded within one or more vector registers. In particular, in one embodiment, an instruction (e.g., a Vector Load to block Boundary instruction) is used that loads data in parallel in a vector register to a specified boundary, and provides a way to determine the number of characters loaded. To determine the number of characters loaded (a count), another instruction (e.g., a Load Count to Block Boundary instruction) is used. Further, an instruction (e.g., a Vector Find Element Not Equal instruction) is used to find the index of the first delimiter character, i.e., the first termination character, such as a zero or null character within the character data. This instruction checks a plurality of bytes of data in parallel.

    摘要翻译: 使用并行处理将一组终止的字符数据的字符从一个存储器位置复制到另一个存储器位置,并且不引起无理的异常。 要复制的字符数据被加载到一个或多个向量寄存器中。 特别地,在一个实施例中,使用将矢量寄存器中并行的数据加载到指定边界的指令(例如,向量块向量边界指令),并且提供了确定加载的字符数的方法。 为了确定加载的字符数(计数),使用另一条指令(例如,向块边界指令的加载计数)。 此外,使用指令(例如,矢量查找元素不等于指令)来找到第一分隔符字符的索引,即第一终止字符,例如字符数据内的零或空字符。 该指令并行地检查多个字节的数据。