HIGH LEVEL SOFTWARE EXECUTION MASK OVERRIDE
    11.
    发明申请
    HIGH LEVEL SOFTWARE EXECUTION MASK OVERRIDE 有权
    高级软件执行掩码

    公开(公告)号:US20140181467A1

    公开(公告)日:2014-06-26

    申请号:US13725063

    申请日:2012-12-21

    CPC classification number: G06F9/3887 G06F9/30036

    Abstract: Methods, and media, and computer systems are provided. The method includes, the media includes control logic for, and the computer system includes a processor with control logic for overriding an execution mask of SIMD hardware to enable at least one of a plurality of lanes of the SIMD hardware. Overriding the execution mask is responsive to a data parallel computation and a diverged control flow of a workgroup.

    Abstract translation: 提供了方法,媒体和计算机系统。 该方法包括:媒体包括用于的控制逻辑,并且计算机系统包括具有用于覆盖SIMD硬件的执行掩码的控制逻辑的处理器,以使能SIMD硬件的多个通道中的至少一个。 覆盖执行掩码响应于数据并行计算和工作组的分散控制流。

    MULTI-CORE PROCESSING DEVICE WITH INVALIDATION CACHE TAGS AND METHODS
    12.
    发明申请
    MULTI-CORE PROCESSING DEVICE WITH INVALIDATION CACHE TAGS AND METHODS 有权
    具有无效缓存标签和方法的多核处理设备

    公开(公告)号:US20140173210A1

    公开(公告)日:2014-06-19

    申请号:US13719730

    申请日:2012-12-19

    CPC classification number: G06F12/0864 G06F12/0815

    Abstract: A data processing device is provided that facilitates cache coherence policies. In one embodiment, a data processing device utilizes invalidation tags in connection with a cache that is associated with a processing engine. In some embodiments, the cache is configured to store a plurality of cache entries where each cache entry includes a cache line configured to store data and a corresponding cache tag configured to store address information associated with data stored in the cache line. Such address information includes invalidation flags with respect to addresses stored in the cache tags. Each cache tag is associated with an invalidation tag configured to store information related to invalidation commands of addresses stored in the cache tag. In such embodiment, the cache is configured to set invalidation flags of cache tags based upon information stored in respective invalidation tags.

    Abstract translation: 提供了一种有助于高速缓存一致性策略的数据处理设备。 在一个实施例中,数据处理设备利用与处理引擎相关联的高速缓存的无效标签。 在一些实施例中,高速缓存被配置为存储多个高速缓存条目,其中每个高速缓存条目包括被配置为存储数据的高速缓存行和被配置为存储与存储在高速缓存行中的数据相关联的地址信息的对应高速缓存标签。 这样的地址信息包括关于存储在高速缓存标签中的地址的无效标志。 每个缓存标签与被配置为存储与存储在高速缓存标签中的地址的无效命令相关的信息的无效标签相关联。 在这种实施例中,高速缓存被配置为基于存储在相应无效标签中的信息来设置高速缓存标签的无效标志。

    CREATING SIMD EFFICIENT CODE BY TRANSFERRING REGISTER STATE THROUGH COMMON MEMORY
    13.
    发明申请
    CREATING SIMD EFFICIENT CODE BY TRANSFERRING REGISTER STATE THROUGH COMMON MEMORY 有权
    通过通用通信传输寄存器状态创建简单有效的代码

    公开(公告)号:US20140149710A1

    公开(公告)日:2014-05-29

    申请号:US13689421

    申请日:2012-11-29

    CPC classification number: G06F9/3887 G06F9/3851

    Abstract: Methods, media, and computing systems are provided. The method includes, the media are configured for, and the computing system includes a processor with control logic for allocating memory for storing a plurality of local register states for work items to be executed in single instruction multiple data hardware and for repacking wavefronts that include work items associated with a program instruction responsive to a conditional statement. The repacking is configured to create repacked wavefronts that include at least one of a wavefront containing work items that all pass the conditional statement and a wavefront containing work items that all fail the conditional statement.

    Abstract translation: 提供了方法,媒体和计算系统。 该方法包括:媒体被配置用于计算系统,并且计算系统包括具有控制逻辑的处理器,该控制逻辑用于分配存储器,用于存储要在单指令多数据硬件中执行的工作项的多个本地寄存器状态,以及用于重新包装工作的波前 与响应于条件语句的程序指令相关联的项目。 重新配置被配置为创建重新包装的波前,其包括包含工作项的波前中的至少一个,所述工作项全部通过条件语句,以及包含所有未完成条件语句的工作项的波阵面。

    Method for memory consistency among heterogeneous computer components
    20.
    发明授权
    Method for memory consistency among heterogeneous computer components 有权
    异构计算机组件之间内存一致性的方法

    公开(公告)号:US09361118B2

    公开(公告)日:2016-06-07

    申请号:US14275271

    申请日:2014-05-12

    Abstract: A method, computer program product, and system is described that determines the correctness of using memory operations in a computing device with heterogeneous computer components. Embodiments include an optimizer based on the characteristics of a Sequential Consistency for Heterogeneous-Race-Free (SC for HRF) model that analyzes a program and determines the correctness of the ordering of events in the program. HRF models include combinations of the properties: scope order, scope inclusion, and scope transitivity. The optimizer can determine when a program is heterogeneous-race-free in accordance with an SC for HRF memory consistency model. For example, the optimizer can analyze a portion of program code, respect the properties of the SC for HRF model, and determine whether a value produced by a store memory event will be a candidate for a value observed by a load memory event. In addition, the optimizer can determine whether reordering of events is possible.

    Abstract translation: 描述了一种方法,计算机程序产品和系统,其确定在具有异构计算机组件的计算设备中使用存储器操作的正确性。 实施例包括基于用于异构无竞争(SC for HRF)的顺序一致性的特性的优化器,该模型分析程序并确定程序中的事件的顺序的正确性。 HRF模型包括属性的组合:范围顺序,范围包含和范围传递性。 优化器可以根据HR对HRF内存一致性模型的SC来确定程序何时是异构无竞争的。 例如,优化器可以分析程序代码的一部分,尊重SC的HRF模型的属性,并且确定由存储器存储器事件产生的值是否将是由加载存储器事件观察到的值的候选。 此外,优化器可以确定是否可能重新排序事件。

Patent Agency Ranking