REDUCING THE FETCH TIME OF TARGET INSTRUCTIONS OF A PREDICTED TAKEN BRANCH INSTRUCTION

    公开(公告)号:US20080276070A1

    公开(公告)日:2008-11-06

    申请号:US12176385

    申请日:2008-07-20

    IPC分类号: G06F9/312

    CPC分类号: G06F9/3804 G06F9/3844

    摘要: A method and processor for reducing the fetch time of target instructions of a predicted taken branch instruction. Each entry in a buffer, referred to herein as a “branch target buffer”, may store an address of a branch instruction predicted taken and the instructions beginning at the target address of the branch instruction predicted taken. When an instruction is fetched from the instruction cache, a particular entry in the branch target buffer is indexed using particular bits of the fetched instruction. The address of the branch instruction in the indexed entry is compared with the address of the instruction fetched from the instruction cache. If there is a match, then the instructions beginning at the target address of that branch instruction are dispatched directly behind the branch instruction. In this manner, the fetch time of target instructions of a predicted taken branch instruction is reduced.

    Reducing the fetch time of target instructions of a predicted taken branch instruction
    32.
    发明授权
    Reducing the fetch time of target instructions of a predicted taken branch instruction 失效
    减少预测的分支指令的目标指令的获取时间

    公开(公告)号:US07437543B2

    公开(公告)日:2008-10-14

    申请号:US11109001

    申请日:2005-04-19

    IPC分类号: G06F9/40

    CPC分类号: G06F9/3804 G06F9/3844

    摘要: A method and processor for reducing the fetch time of target instructions of a predicted taken branch instruction. Each entry in a buffer, referred to herein as a “branch target buffer”, may store an address of a branch instruction predicted taken and the instructions beginning at the target address of the branch instruction predicted taken. When an instruction is fetched from the instruction cache, a particular entry in the branch target buffer is indexed using particular bits of the fetched instruction. The address of the branch instruction in the indexed entry is compared with the address of the instruction fetched from the instruction cache. If there is a match, then the instructions beginning at the target address of that branch instruction are dispatched directly behind the branch instruction. In this manner, the fetch time of target instructions of a predicted taken branch instruction is reduced.

    摘要翻译: 一种用于减少预测的分支指令的目标指令的获取时间的方法和处理器。 缓冲器中的每个条目(这里称为“分支目标缓冲器”)可以存储预测的分支指令的地址和从预测的分支指令的目标地址开始的指令。 当从指令高速缓存中取出指令时,使用获取的指令的特定位来对分支目标缓冲器中的特定条目进行索引。 将索引条目中的分支指令的地址与从指令高速缓存获取的指令的地址进行比较。 如果有匹配,则从该分支指令的目标地址开始的指令直接在分支指令的后面进行调度。 以这种方式,减少预测的分支指令的目标指令的获取时间。

    Wide shifting in the vector permute unit
    34.
    发明授权
    Wide shifting in the vector permute unit 有权
    矢量变换单位宽移

    公开(公告)号:US06327651B1

    公开(公告)日:2001-12-04

    申请号:US09149466

    申请日:1998-09-08

    IPC分类号: G06F1500

    CPC分类号: G06F7/766 G06F5/015

    摘要: A crossbar is implemented within multimedia facilities of a processor to perform vector permute operations, in which the bytes of a source operand are reordered in the target output. The crossbar is then reused for other instructions requiring multiplexing or shifting operations, particularly those in which the size of additional multiplexers or the size and delay of a barrel shifter is significant. A wide shift operation, for example, may be performed with one cycle latency by the crossbar and one additional layer of multiplexers or a small barrel shifter. The crossbar facility thus gets reused with improved performance of the instructions now sharing the crossbar and a reduction in the total area required by a multimedia facility within a processor.

    摘要翻译: 在处理器的多媒体设施内实现交叉开关以执行矢量置换操作,其中源操作数的字节在目标输出中重新排序。 然后,交叉开关重新用于需要复用或移位操作的其他指令,特别是其中附加多路复用器的大小或桶形移位器的大小和延迟是显着的那些指令。 例如,可以通过交叉开关和一个附加的多路复用器层或小桶形移位器以一个周期的等待时间来执行宽移位操作。 因此,交叉开关设备被重新使用,现在共享交叉开关的指令的性能得到改善,并且减少处理器内的多媒体设备所需的总面积。

    Method and system for vector processing utilizing selected vector
elements
    36.
    发明授权
    Method and system for vector processing utilizing selected vector elements 失效
    利用选定向量元素进行向量处理的方法和系统

    公开(公告)号:US5680338A

    公开(公告)日:1997-10-21

    申请号:US368172

    申请日:1995-01-04

    IPC分类号: G06F17/16

    CPC分类号: G06F17/16

    摘要: In a vector processing system for processing vector calculations utilizing a portion of a vector comprising a plurality of elements, means for receiving a vector and a vector processing command are provided. The vector processing system also includes means for receiving and storing a start-element value and an end-element value. An arithmetic logic unit is coupled to the means for receiving the vector, the means for receiving the vector processing command, and the means for receiving the start-element and end-element values. The arithmetic logic unit also includes means for executing the vector processing command utilizing only one or more of the elements in the vector, which are selected by the start-element value and the end-element value.

    摘要翻译: 在使用包括多个元素的向量的一部分来处理向量计算的向量处理系统中,提供了用于接收向量和向量处理命令的装置。 矢量处理系统还包括用于接收和存储起始元素值和终止元素值的装置。 算术逻辑单元耦合到用于接收向量的装置,用于接收向量处理命令的装置和用于接收起始元素和终止元素值的装置。 算术逻辑单元还包括仅利用向量中的一个或多个元素来执行向量处理命令的装置,其由起始元素值和终止元素值选择。

    Sharing data in internal and memory representations with dynamic data-driven conversion
    39.
    发明授权
    Sharing data in internal and memory representations with dynamic data-driven conversion 有权
    通过动态数据驱动转换在内部和内存表示中共享数据

    公开(公告)号:US07849294B2

    公开(公告)日:2010-12-07

    申请号:US12023768

    申请日:2008-01-31

    IPC分类号: G06F9/302 G06F9/305

    摘要: Illustrative embodiments determine the data type of the operand being accessed as well as analyze the data value subrange of the input operand data type. If the operand's data type does not match the required format of the instruction being processed, a determination is made as to whether a subrange of data values of the data type of the input operand is supported natively. If the subrange of data values of the input operand is not supported natively, then a format conversion is performed on the data and the instruction may then operate on the data. Otherwise, the data may be operated on directly by the instruction without a format conversion operation and thus, the conversion is not performed.

    摘要翻译: 说明性实施例确定被访问的操作数的数据类型以及分析输入操作数数据类型的数据值子范围。 如果操作数的数据类型与正在处理的指令的所需格式不匹配,则确定本地是否支持输入操作数的数据类型的数据值的子范围。 如果本地不支持输入操作数的数据值的子范围,则对数据进行格式转换,然后该指令可以对数据进行操作。 否则,数据可以直接由指令操作,而不进行格式转换操作,因此不进行转换。

    REDUCING THE FETCH TIME OF TARGET INSTRUCTIONS OF A PREDICTED TAKEN BRANCH INSTRUCTION
    40.
    发明申请
    REDUCING THE FETCH TIME OF TARGET INSTRUCTIONS OF A PREDICTED TAKEN BRANCH INSTRUCTION 审中-公开
    减少预期的分支指导目标指示的时间

    公开(公告)号:US20080276071A1

    公开(公告)日:2008-11-06

    申请号:US12176386

    申请日:2008-07-20

    IPC分类号: G06F9/312

    CPC分类号: G06F9/3804 G06F9/3844

    摘要: A method and processor for reducing the fetch time of target instructions of a predicted taken branch instruction. Each entry in a buffer, referred to herein as a “branch target buffer”, may store an address of a branch instruction predicted taken and the instructions beginning at the target address of the branch instruction predicted taken. When an instruction is fetched from the instruction cache, a particular entry in the branch target buffer is indexed using particular bits of the fetched instruction. The address of the branch instruction in the indexed entry is compared with the address of the instruction fetched from the instruction cache. If there is a match, then the instructions beginning at the target address of that branch instruction are dispatched directly behind the branch instruction. In this manner, the fetch time of target instructions of a predicted taken branch instruction is reduced.

    摘要翻译: 一种用于减少预测的分支指令的目标指令的获取时间的方法和处理器。 缓冲器中的每个条目(这里称为“分支目标缓冲器”)可以存储预测的分支指令的地址和从预测的分支指令的目标地址开始的指令。 当从指令高速缓存中取出指令时,使用获取的指令的特定位来对分支目标缓冲器中的特定条目进行索引。 将索引条目中的分支指令的地址与从指令高速缓存获取的指令的地址进行比较。 如果有匹配,则从该分支指令的目标地址开始的指令直接在分支指令的后面进行调度。 以这种方式,减少预测的分支指令的目标指令的获取时间。