Hybrid computing device selection analysis

    公开(公告)号:US11188348B2

    公开(公告)日:2021-11-30

    申请号:US16118873

    申请日:2018-08-31

    IPC分类号: G06F9/445 G06F11/30 G06F11/34

    摘要: Methods, systems, and computer program products for hardware device selection in a computing environment are provided. Aspects include receiving, by a processor, a request to execute a programming code, wherein the processor is operating in a hybrid computing environment comprising a plurality of hardware devices. A performance model associated with the programming code is obtained by the processor. Runtime data associated with the programming code is obtained by the processor. The runtime data is fed in to the performance model to determine an execution cost for executing the programming code on each of the plurality of hardware devices and a target hardware device is selected from the plurality of hardware devices based on the execution costs.

    Run-length base-delta encoding for high-speed compression

    公开(公告)号:US10419022B2

    公开(公告)日:2019-09-17

    申请号:US15828385

    申请日:2017-11-30

    IPC分类号: H03M7/34 H03M7/48 H03M7/30

    摘要: A method, computer system, and a computer program product for high-speed data compression is provided. The present invention may include receiving an input stream. The present invention may include selecting a header based on the received input stream, wherein the header includes a base, a scheme and a delta count. The present invention may include determining whether there are any remaining values in an uncompressed input stream. The present invention may include reading a first next value from the input stream. The present invention may include determining whether the read first next value is representable with a current base scheme. The present invention may include calculating the delta count based on determining that the read first next value is representable with the current base scheme. The present invention may include writing the calculated delta count to the selected header. The present invention may include incrementing the written delta count.

    COMPILER OPTIMIZATION BASED ON COLLECTIVITY ANALYSIS
    6.
    发明申请
    COMPILER OPTIMIZATION BASED ON COLLECTIVITY ANALYSIS 有权
    基于收敛性分析的编译器优化

    公开(公告)号:US20140196018A1

    公开(公告)日:2014-07-10

    申请号:US14217983

    申请日:2014-03-18

    IPC分类号: G06F9/45

    摘要: An embodiment is directed to determining, by a compiler, that a call to a named barrier is matched across all of a plurality of threads, and based at least in part on determining that the call to the named barrier is matched across all of the plurality of threads, replacing, by the compiler, the named barrier with an unnamed barrier.

    摘要翻译: 一个实施例涉及到由编译器确定对所述命名屏障的调用在所有多个线程中匹配,并且至少部分地基于确定对所述命名屏障的调用在所有多个线程中匹配 的线程,由编译器用未命名的屏障代替命名的屏障。

    Data Prefetching and Coalescing for Partitioned Global Address Space Languages
    7.
    发明申请
    Data Prefetching and Coalescing for Partitioned Global Address Space Languages 有权
    数据预取和聚合分区全球地址空间语言

    公开(公告)号:US20130167130A1

    公开(公告)日:2013-06-27

    申请号:US13659048

    申请日:2012-10-24

    IPC分类号: G06F9/45

    CPC分类号: G06F8/4442 G06F9/4494

    摘要: An illustrative embodiment of a computer-implemented process for shared data prefetching and coalescing optimization versions a loop containing one or more shared references into an optimized loop and an un-optimized loop, transforms the optimized loop into a set of loops, and stores shared access associated information of the loop using a prologue loop in the set of loops. The shared access associated information pertains to remote data and is collected using the prologue loop in absence of network communication and builds a hash table. An associated data structure is updated each time the hash table is entered, and is sorted to remove duplicate entries and create a reduced data structure. Patterns across entries of the reduced data structure are identified and entries are coalesced. Data associated with a coalesced entry is pre-fetched using a single communication and a local buffer is populated with the fetched data for reuse.

    摘要翻译: 用于共享数据预取和聚合优化的计算机实现过程的说明性实施例将包含一个或多个共享引用的循环包含到优化循环和未优化循环中,将优化循环转换成一组循环,并存储共享访问 使用循环集合中的序言循环的循环的相关信息。 共享访问相关信息属于远程数据,并且在没有网络通信的情况下使用序言循环进行收集,并构建哈希表。 每次输入散列表时都会更新关联的数据结构,并将其排序以删除重复条目并创建简化的数据结构。 识别缩减数据结构的条目中的模式,并且条目合并。 与合并条目相关联的数据使用单个通信进行预取,并且本地缓冲区填充有获取的数据以供重用。

    Instrumentation for nested conditional checks

    公开(公告)号:US11561778B1

    公开(公告)日:2023-01-24

    申请号:US17533475

    申请日:2021-11-23

    IPC分类号: G06F8/41

    摘要: Aspects include executing a first phase that includes injecting instrumentation into program code in response to identifying an inner conditional check in the program code and running the instrumented program with a representative workload. The injecting includes duplicating the inner conditional check and placing a duplicate of the inner conditional check before a respective original nested conditional check in the program code to create an instrumented program. The instrumented program includes a plurality of basic blocks including original basic blocks and a newly added basic block that includes the duplicate of the inner conditional check. The method also includes executing a second phase that includes collecting execution frequency values from counters associated with the basic blocks to form metadata used to make optimization decisions for the program code.