APPARATUS AND METHOD OF IMPROVED INSERT INSTRUCTIONS

    公开(公告)号:US20170357510A1

    公开(公告)日:2017-12-14

    申请号:US15668461

    申请日:2017-08-03

    Abstract: An apparatus is described having instruction execution logic circuitry to execute first, second, third and fourth instruction. Both the first instruction and the second instruction insert a first group of input vector elements to one of multiple first non overlapping sections of respective first and second resultant vectors. The first group has a first bit width. Each of the multiple first non overlapping sections have a same bit width as the first group. Both the third instruction and the fourth instruction insert a second group of input vector elements to one of multiple second non overlapping sections of respective third and fourth resultant vectors. The second group has a second bit width that is larger than said first bit width. Each of the multiple second non overlapping sections have a same bit width as the second group. The apparatus also includes masking layer circuitry to mask the first and third instructions at a first resultant vector granularity, and, mask the second and fourth instructions at a second resultant vector granularity.

    APPARATUS AND METHOD OF IMPROVED EXTRACT INSTRUCTIONS

    公开(公告)号:US20170242704A1

    公开(公告)日:2017-08-24

    申请号:US15452631

    申请日:2017-03-07

    Abstract: An apparatus is described that includes instruction execution circuitry to execute first, second, third, and fourth instructions, the first and second instructions select a first group of input vector elements from one of multiple first non-overlapping sections of respective first and second input vectors. Each of the multiple first non-overlapping sections have a same bit width as the first group. Both the third and fourth instructions select a second group of input vector elements from one of multiple second non-overlapping sections of respective third and fourth input vectors. The second group has a second bit width that is larger than the first bit width. Each of multiple second non-overlapping sections have a same bit width as the second group. The apparatus includes masking layer circuitry to mask the first and second groups at a first granularity a second granularity.

    METHOD AND APPARATUS FOR PERFORMING A VECTOR PERMUTE WITH AN INDEX AND AN IMMEDIATE
    25.
    发明申请
    METHOD AND APPARATUS FOR PERFORMING A VECTOR PERMUTE WITH AN INDEX AND AN IMMEDIATE 审中-公开
    用索引和立即执行矢量保护的方法和装置

    公开(公告)号:US20160188530A1

    公开(公告)日:2016-06-30

    申请号:US14583644

    申请日:2014-12-27

    Abstract: An apparatus and method for performing a vector permute. For example, one embodiment of a processor comprises: a source vector register to store a plurality of source data elements; a destination vector register to store a plurality of destination data elements; a control vector register to store a plurality of control data elements, each control data element corresponding to one of the destination data elements and including an N bit value indicating whether a source data element is to be copied to the corresponding destination data element; vector permute logic to compare the N bit value of each control data element to an N bit portion of an immediate to determine whether to copy a source data element to the corresponding destination data element, wherein if the N bit values match, then the vector permute logic is to identify a source data element using an index value included in the control data element and to responsively copy the source data element to the corresponding destination data element in the destination vector register.

    Abstract translation: 用于执行向量置换的装置和方法。 例如,处理器的一个实施例包括:源向量寄存器,用于存储多个源数据元素; 目的地向量寄存器,用于存储多个目的地数据元素; 用于存储多个控制数据元素的控制向量寄存器,与目的地数据元素之一对应的每个控制数据元素,并且包括指示源数据元素是否被复制到对应的目的地数据元素的N位值; 向量置换逻辑,以将每个控制数据元素的N位值与立即数的N位部分进行比较,以确定是否将源数据元素复制到对应的目标数据元素,其中如果N位值匹配,则向量置换 逻辑是使用包括在控制数据元素中的索引值来识别源数据元素,并且将源数据元素响应地复制到目的地向量寄存器中的相应目的地数据元素。

Patent Agency Ranking