Elimination of potential renaming stalls due to use of partial registers
    52.
    发明授权
    Elimination of potential renaming stalls due to use of partial registers 有权
    消除由于使用部分寄存器引起的潜在重命名失速

    公开(公告)号:US07162614B2

    公开(公告)日:2007-01-09

    申请号:US10608121

    申请日:2003-06-30

    IPC分类号: G06F9/38

    摘要: Two or more pointers, each of which indicates where values of a respective group of bits of a source of a particular micro-operation will be found when the particular micro-operation is executed, may not all point to the same register. Renaming of the source of the particular micro-operation may be enabled by generating one or more new micro-operations that merge the values into a single register. The one or more new micro-operations are inserted into a sequence of micro-operations that includes the particular micro-operation. Once the source of the particular micro-operation has been renamed, subsequent micro-operations in the sequence may be renamed, if appropriate, and executed, without having to wait for the values to be calculated.

    摘要翻译: 两个或更多个指针,每个指针指示当执行特定微操作时将发现特定微操作的源的相应组的位置的值,其可能不都指向相同的寄存器。 可以通过生成将值合并到单个寄存器中的一个或多个新的微操作来实现特定微操作的源的重命名。 一个或多个新的微操作被插入到包括特定微操作的微操作的序列中。 一旦特定微操作的源被重新命名,则可以重新命名该序列中的后续微操作,如果适用并被执行,而不必等待该值被计算。

    Apparatus and method for down conversion of data types

    公开(公告)号:US10474463B2

    公开(公告)日:2019-11-12

    申请号:US13997006

    申请日:2011-12-23

    IPC分类号: G06F9/30

    摘要: An apparatus and method are described for down-converting from a source operand to a destination operand with masking. For example, a method according to one embodiment includes the following operations: reading a source operand value to be down-converted from a first value to a down-converted value and stored in a destination location; reading each mask register bit stored in a mask register, the mask register bit(s) indicating whether to perform a masking operation or a conversion operation on the source operand value; if the mask register bit(s) indicates that a masking operation is to be performed, then performing a specified masking operation and storing the results of the masking operation in the destination location; and if the mask register bit indicates that a masking operation is not to be performed, then down-converting the source operand value and storing the down-converted value in the specified destination location.

    Apparatus and method for performing a permute operation
    57.
    发明授权
    Apparatus and method for performing a permute operation 有权
    用于执行置换操作的装置和方法

    公开(公告)号:US09495162B2

    公开(公告)日:2016-11-15

    申请号:US13996072

    申请日:2011-12-23

    IPC分类号: G06F9/30

    摘要: An apparatus and method are described for permuting data elements with masking. For example, a method according to one embodiment includes the following operations: reading values from a mask data structure to determine whether masking is implemented for each data element of a destination operand; if masking not implemented for a particular data element, then selecting data elements from the destination operand and a second source operand based on index values stored in a first source operand to be copied to data element positions within the destination operand, wherein any one of the data elements from either the destination operand and the second source operand may be copied to any one of the data element positions within the destination operand; if masking is implemented for a particular data element of the destination operand, then performing a designated masking operation with respect to that particular data element.

    摘要翻译: 描述了用掩模来置换数据元素的装置和方法。 例如,根据一个实施例的方法包括以下操作:从掩模数据结构读取值以确定是否对目的地操作数的每个数据元素实施掩蔽; 如果对特定数据元素没有实现掩蔽,则根据存储在第一源操作数中的索引值从目的地操作数和第二源操作数中选择要复制到目的地操作数内的数据元素位置的第二源操作数,其中, 来自目的地操作数和第二源操作数的数据元素可以被复制到目的地操作数中的任何一个数据元素位置; 如果针对目的地操作数的特定数据元素实现掩蔽,则对该特定数据元素执行指定的屏蔽操作。

    Vector shuffle instructions operating on multiple lanes each having a plurality of data elements using a same set of per-lane control bits
    59.
    发明授权
    Vector shuffle instructions operating on multiple lanes each having a plurality of data elements using a same set of per-lane control bits 有权
    在多个通道上操作的矢量洗牌指令,每个通道使用相同的每通道控制位集合具有多个数据元素

    公开(公告)号:US08914613B2

    公开(公告)日:2014-12-16

    申请号:US13219418

    申请日:2011-08-26

    IPC分类号: G06F15/16 G06F9/30 G06F9/38

    摘要: In-lane vector shuffle operations are described. In one embodiment a shuffle instruction specifies a field of per-lane control bits, a source operand and a destination operand, these operands having corresponding lanes, each lane divided into corresponding portions of multiple data elements. Sets of data elements are selected from corresponding portions of every lane of the source operand according to per-lane control bits. Elements of these sets are copied to specified fields in corresponding portions of every lane of the destination operand. Another embodiment of the shuffle instruction also specifies a second source operand, all operands having corresponding lanes divided into multiple data elements. A set selected according to per-lane control bits contains data elements from every lane portion of a first source operand and data elements from every corresponding lane portion of the second source operand. Set elements are copied to specified fields in every lane of the destination operand.

    摘要翻译: 描述车道内向量随机操作。 在一个实施例中,混洗指令指定每通道控制位,源操作数和目的地操作数的字段,这些操作数具有相应的通道,每个通道被划分为多个数据元素的相应部分。 根据每通道控制位,从源操作数的每个通道的相应部分中选择数据元素的集合。 这些集合的元素被复制到目标操作数的每个通道的相应部分中的指定字段。 混洗指令的另一实施例还指定第二源操作数,所有操作数具有被划分为多个数据元素的相应通道。 根据每通道控制位选择的集合包含来自第一源操作数的每个通道部分的数据元素和来自第二源操作数的每个对应通道部分的数据元素。 将元素复制到目标操作数的每个通道中的指定字段。