Controlling the number of powered vector lanes via a register field

    公开(公告)号:US10732689B2

    公开(公告)日:2020-08-04

    申请号:US15638407

    申请日:2017-06-30

    Abstract: The vector data path is divided into smaller vector lanes. A register such as a memory mapped control register stores a vector lane number (VLX) indicating the number of vector lanes to be powered. A decoder converts this VLX into a vector lane control word, each bit controlling the ON of OFF state of the corresponding vector lane. This number of contiguous least significant vector lanes are powered. In the preferred embodiment the stored data VLX indicates that 2VLX contiguous least significant vector lanes are to be powered. Thus the number of vector lanes powered is limited to an integral power of 2. This manner of coding produces a very compact controlling bit field while obtaining substantially all the power saving advantage of individually controlling the power of all vector lanes.

    Conditional execution specification of instructions using conditional extension slots in the same execute packet in a VLIW processor

    公开(公告)号:US10402199B2

    公开(公告)日:2019-09-03

    申请号:US14920298

    申请日:2015-10-22

    Abstract: One embodiment of this invention provides two conditional execution auxiliary instructions directed to disparate subsets of the plural functional units. Depending on the conditional execution desired, only one of the two conditional execution auxiliary instructions may be required for a particular execute packet. Another embodiment of this invention employs only one of two possible register files for the condition registers. In a VLIW processor it may be advantageous to split the functional units into separate sets with corresponding register files. This limits the number of functional units that may simultaneously access the register files. In the preferred embodiment of this invention the functional units are divided into a scalar set which access scalar registers and a vector set which access vector registers. The data registers storing the conditions for both scalar and vector instructions are in the scalar data register file.

    BUTTERFLY NETWORK ON LOAD DATA RETURN
    67.
    发明申请

    公开(公告)号:US20190020360A1

    公开(公告)日:2019-01-17

    申请号:US15651055

    申请日:2017-07-17

    Abstract: A method is shown that is operable to transform and align a plurality of fields from an input to an output data stream using a multilayer butterfly or inverse butterfly network. Many transformations are possible with such a network which may include separate control of each multiplexer. This invention supports a limited set of multiplexer control signals, which enables a similarly limited set of data transformations. This limited capability is offset by the reduced complexity of the multiplexor control circuits.

    REGISTER FILE STRUCTURES COMBINING VECTOR AND SCALAR DATA WITH GLOBAL AND LOCAL ACCESSES
    70.
    发明申请
    REGISTER FILE STRUCTURES COMBINING VECTOR AND SCALAR DATA WITH GLOBAL AND LOCAL ACCESSES 有权
    寄存器文件结构组合向量和标量数据与全局和本地访问

    公开(公告)号:US20150019836A1

    公开(公告)日:2015-01-15

    申请号:US14327066

    申请日:2014-07-09

    CPC classification number: G06F9/30036 G06F9/30014 G06F9/30094 G06F9/3012

    Abstract: The number of registers required is reduced by overlapping scalar and vector registers. This also allows increased compiler flexibility when mixing scalar and vector instructions. Local register read ports are minimized by restricting read access. Dedicated predicate registers reduces requirements for general registers, and allows reduction of critical timing paths by allowing the predicate registers to be placed next to the predicate unit.

    Abstract translation: 所需的寄存器数量通过重叠标量和向量寄存器来减少。 这也允许在混合标量和向量指令时增加编译器的灵活性。 通过限制读取访问来使本地寄存器读取端口最小化。 专用谓词寄存器减少通用寄存器的要求,并允许通过允许将谓词寄存器放置在谓词单元旁边来减少关键定时路径。

Patent Agency Ranking