HIGH LEVEL SOFTWARE EXECUTION MASK OVERRIDE
    1.
    发明申请
    HIGH LEVEL SOFTWARE EXECUTION MASK OVERRIDE 有权
    高级软件执行掩码

    公开(公告)号:US20140181467A1

    公开(公告)日:2014-06-26

    申请号:US13725063

    申请日:2012-12-21

    CPC classification number: G06F9/3887 G06F9/30036

    Abstract: Methods, and media, and computer systems are provided. The method includes, the media includes control logic for, and the computer system includes a processor with control logic for overriding an execution mask of SIMD hardware to enable at least one of a plurality of lanes of the SIMD hardware. Overriding the execution mask is responsive to a data parallel computation and a diverged control flow of a workgroup.

    Abstract translation: 提供了方法,媒体和计算机系统。 该方法包括:媒体包括用于的控制逻辑,并且计算机系统包括具有用于覆盖SIMD硬件的执行掩码的控制逻辑的处理器,以使能SIMD硬件的多个通道中的至少一个。 覆盖执行掩码响应于数据并行计算和工作组的分散控制流。

    CREATING SIMD EFFICIENT CODE BY TRANSFERRING REGISTER STATE THROUGH COMMON MEMORY
    2.
    发明申请
    CREATING SIMD EFFICIENT CODE BY TRANSFERRING REGISTER STATE THROUGH COMMON MEMORY 有权
    通过通用通信传输寄存器状态创建简单有效的代码

    公开(公告)号:US20140149710A1

    公开(公告)日:2014-05-29

    申请号:US13689421

    申请日:2012-11-29

    CPC classification number: G06F9/3887 G06F9/3851

    Abstract: Methods, media, and computing systems are provided. The method includes, the media are configured for, and the computing system includes a processor with control logic for allocating memory for storing a plurality of local register states for work items to be executed in single instruction multiple data hardware and for repacking wavefronts that include work items associated with a program instruction responsive to a conditional statement. The repacking is configured to create repacked wavefronts that include at least one of a wavefront containing work items that all pass the conditional statement and a wavefront containing work items that all fail the conditional statement.

    Abstract translation: 提供了方法,媒体和计算系统。 该方法包括:媒体被配置用于计算系统,并且计算系统包括具有控制逻辑的处理器,该控制逻辑用于分配存储器,用于存储要在单指令多数据硬件中执行的工作项的多个本地寄存器状态,以及用于重新包装工作的波前 与响应于条件语句的程序指令相关联的项目。 重新配置被配置为创建重新包装的波前,其包括包含工作项的波前中的至少一个,所述工作项全部通过条件语句,以及包含所有未完成条件语句的工作项的波阵面。

    Creating SIMD efficient code by transferring register state through common memory
    3.
    发明授权
    Creating SIMD efficient code by transferring register state through common memory 有权
    通过公共存储器传送寄存器状态来创建SIMD高效代码

    公开(公告)号:US09354892B2

    公开(公告)日:2016-05-31

    申请号:US13689421

    申请日:2012-11-29

    CPC classification number: G06F9/3887 G06F9/3851

    Abstract: Methods, media, and computing systems are provided. The method includes, the media are configured for, and the computing system includes a processor with control logic for allocating memory for storing a plurality of local register states for work items to be executed in single instruction multiple data hardware and for repacking wavefronts that include work items associated with a program instruction responsive to a conditional statement. The repacking is configured to create repacked wavefronts that include at least one of a wavefront containing work items that all pass the conditional statement and a wavefront containing work items that all fail the conditional statement.

    Abstract translation: 提供了方法,媒体和计算系统。 该方法包括:媒体被配置用于计算系统,并且计算系统包括具有控制逻辑的处理器,该控制逻辑用于分配存储器,用于存储要在单指令多数据硬件中执行的工作项的多个本地寄存器状态,以及用于重新包装工作的波前 与响应于条件语句的程序指令相关联的项目。 重新配置被配置为创建重新包装的波前,其包括包含工作项的波前中的至少一个,所述工作项全部通过条件语句,以及包含所有未完成条件语句的工作项的波阵面。

    High level software execution mask override
    4.
    发明授权
    High level software execution mask override 有权
    高级软件执行掩码覆盖

    公开(公告)号:US09317296B2

    公开(公告)日:2016-04-19

    申请号:US13725063

    申请日:2012-12-21

    CPC classification number: G06F9/3887 G06F9/30036

    Abstract: Methods, and media, and computer systems are provided. The method includes, the media includes control logic for, and the computer system includes a processor with control logic for overriding an execution mask of SIMD hardware to enable at least one of a plurality of lanes of the SIMD hardware. Overriding the execution mask is responsive to a data parallel computation and a diverged control flow of a workgroup.

    Abstract translation: 提供了方法,媒体和计算机系统。 该方法包括:媒体包括用于的控制逻辑,并且计算机系统包括具有用于覆盖SIMD硬件的执行掩码的控制逻辑的处理器,以使能SIMD硬件的多个通道中的至少一个。 覆盖执行掩码响应于数据并行计算和工作组的分散控制流。

    DATA PROCESSOR AND METHOD OF LANE REALIGNMENT
    5.
    发明申请
    DATA PROCESSOR AND METHOD OF LANE REALIGNMENT 审中-公开
    数据处理器和LANE实现方法

    公开(公告)号:US20150100758A1

    公开(公告)日:2015-04-09

    申请号:US14045114

    申请日:2013-10-03

    Abstract: A data processor includes a register file divided into at least a first portion and a second portion for storing data. A single instruction, multiple data (SIMD) unit is also divided into at least a first lane and a second lane. The first and second lanes of the SIMD unit correspond respectively to the first and second portions of the register file. Furthermore, each lane of the SIMD unit is capable of data processing. The data processor also includes a realignment element in communication with the register file and the SIMD unit. The realignment element is configured to selectively realign conveyance of data between the first portion of the register file and the first lane of the SIMD unit to the second lane of the SIMD unit.

    Abstract translation: 数据处理器包括被分成至少第一部分的寄存器文件和用于存储数据的第二部分。 单指令,多数据(SIMD)单元也被划分为至少第一通道和第二通道。 SIMD单元的第一和第二通道分别对应于寄存器文件的第一和第二部分。 此外,SIMD单元的每个通道能够进行数据处理。 数据处理器还包括与寄存器文件和SIMD单元通信的重新对准元件。 重新对准元件被配置为选择性地将寄存器文件的第一部分与SIMD单元的第一通道之间的数据传送到SIMD单元的第二通道。

Patent Agency Ranking