专利检索 ap:("Shih-wei Liao" OR "Xinmin Tian" OR "Gerolf F. Hoflehner" OR "Hong Wang" OR "Daniel M. Lavery" OR "Perry Wang" OR "Dongkeun Kim" OR "Milind Girkar" OR "John P. Shen") AND inv:"Shih-wei Liao" 第 1 页

1.

发明授权
Methods and apparatuses for thread management of multi-threading 失效
标题翻译：多线程线程管理方法与设备

公开(公告)号：US07398521B2

公开(公告)日：2008-07-08

申请号：US10779193

申请日：2004-02-13

申请人： Gerolf F. Hoflehner , Shih-wei Liao , Xinmin Tian , Hong Wang , Daniel M. Lavery , Perry Wang , Dongkeun Kim , Milind Girkar , John P. Shen

发明人： Gerolf F. Hoflehner , Shih-wei Liao , Xinmin Tian , Hong Wang , Daniel M. Lavery , Perry Wang , Dongkeun Kim , Milind Girkar , John P. Shen

IPC分类号： G06F9/45

CPC分类号： G06F8/441

摘要： Methods and apparatuses for thread management for multi-threading are described herein. In one embodiment, exemplary process includes selecting, during a compilation of code having one or more threads executable in a data processing system, a current thread having a most bottom order, determining resources allocated to one or more child threads spawned from the current thread, and allocating resources for the current thread in consideration of the resources allocated to the current thread's one or more child threads to avoid resource conflicts between the current thread and its one or more child threads. Other methods and apparatuses are also described.

摘要翻译： 本文描述了用于多线程的线程管理的方法和装置。在一个实施例中，示例性过程包括在具有在数据处理系统中可执行的一个或多个线程的代码的编译期间选择具有最低阶的当前线程，确定分配给从当前线程产生的一个或多个子线程的资源，并且考虑分配给当前线程的一个或多个子线程的资源来为当前线程分配资源，以避免当前线程与其一个或多个子线程之间的资源冲突。还描述了其它方法和装置。

2.

发明授权
Methods and apparatuses for compiler-creating helper threads for multi-threading 有权
标题翻译：用于多线程的编译器创建帮助线程的方法和设备

公开(公告)号：US08612949B2

公开(公告)日：2013-12-17

申请号：US12650630

申请日：2009-12-31

申请人： Shih-wei Liao , Xinmin Tian , Gerolf F. Hoflehner , Hong Wang , Daniel M. Lavery , Perry Wang , Dongkeun Kim , Milind Girkar , John P. Shen

发明人： Shih-wei Liao , Xinmin Tian , Gerolf F. Hoflehner , Hong Wang , Daniel M. Lavery , Perry Wang , Dongkeun Kim , Milind Girkar , John P. Shen

IPC分类号： G06F9/45

CPC分类号： G06F9/3842 , G06F8/4442 , G06F9/383 , G06F9/3851

摘要： Methods and apparatuses for compiler-created helper thread for multi-threading are described herein. In one embodiment, exemplary process includes identifying a region of a main thread that likely has one or more delinquent loads, the one or more delinquent loads representing loads which likely suffer cache misses during an execution of the main thread, analyzing the region for one or more helper threads with respect to the main thread, and generating code for the one or more helper threads, the one or more helper threads being speculatively executed in parallel with the main thread to perform one or more tasks for the region of the main thread. Other methods and apparatuses are also described.

摘要翻译： 本文描述了用于多线程的编译器创建的辅助线程的方法和装置。在一个实施例中，示例性过程包括识别可能具有一个或多个拖欠负载的主线程的区域，所述一个或多个违规负载表示在执行主线程期间可能遭受高速缓存未命中的负载，分析该区域中的一个或多个相对于主线程的更多帮助线程，以及为一个或多个辅助线程生成代码，一个或多个辅助线程与主线程并行地被推测地执行，以对主线程的区域执行一个或多个任务。还描述了其它方法和装置。

3.

发明申请
Methods and apparatuses for thread management of multi-threading 失效
标题翻译：多线程线程管理方法与设备

公开(公告)号：US20050081207A1

公开(公告)日：2005-04-14

申请号：US10779193

申请日：2004-02-13

申请人： Gerolf Hoflehner , Shih-wei Liao , Xinmin Tian , Hong Wang , Daniel Lavery , Perry Wang , Dongkeun Kim , Milind Girkar , John Shen

发明人： Gerolf Hoflehner , Shih-wei Liao , Xinmin Tian , Hong Wang , Daniel Lavery , Perry Wang , Dongkeun Kim , Milind Girkar , John Shen

IPC分类号： G06F9/45 , G06F9/46

CPC分类号： G06F8/441

摘要： Methods and apparatuses for thread management for multi-threading are described herein. In one embodiment, exemplary process includes selecting, during a compilation of code having one or more threads executable in a data processing system, a current thread having a most bottom order, determining resources allocated to one or more child threads spawned from the current thread, and allocating resources for the current thread in consideration of the resources allocated to the current thread's one or more child threads to avoid resource conflicts between the current thread and its one or more child threads. Other methods and apparatuses are also described.

摘要翻译： 本文描述了用于多线程的线程管理的方法和装置。在一个实施例中，示例性过程包括在具有在数据处理系统中可执行的一个或多个线程的代码的编译期间选择具有最低阶的当前线程，确定分配给从当前线程产生的一个或多个子线程的资源，并且考虑分配给当前线程的一个或多个子线程的资源来为当前线程分配资源，以避免当前线程与其一个或多个子线程之间的资源冲突。还描述了其它方法和装置。

4.

发明授权
Methods and apparatus for reducing memory latency in a software application 有权
标题翻译：减少软件应用程序内存延迟的方法和装置

公开(公告)号：US07328433B2

公开(公告)日：2008-02-05

申请号：US10677414

申请日：2003-10-02

申请人： Xinmin Tian , Shih-wei Liao , Hong Wang , Milind Girkar , John Shen , Perry Wang , Grant Haab , Gerolf Hoflehner , Daniel Lavery , Hideki Saito , Sanjiv Shah , Dongkeun Kim

发明人： Xinmin Tian , Shih-wei Liao , Hong Wang , Milind Girkar , John Shen , Perry Wang , Grant Haab , Gerolf Hoflehner , Daniel Lavery , Hideki Saito , Sanjiv Shah , Dongkeun Kim

IPC分类号： G06F9/44

CPC分类号： G06F9/3851 , G06F8/4442 , G06F9/383 , G06F9/4843 , G06F9/52

摘要： Methods and apparatus for reducing memory latency in a software application are disclosed. A disclosed system uses one or more helper threads to prefetch variables for a main thread to reduce performance bottlenecks due to memory latency and/or a cache miss. A performance analysis tool is used to profile the software application's resource usage and identifies areas in the software application experiencing performance bottlenecks. Compiler-runtime instructions are generated into the software application to create and manage the helper thread. The helper thread prefetches data in the identified areas of the software application experiencing performance bottlenecks. A counting mechanism is inserted into the helper thread and a counting mechanism is inserted into the main thread to coordinate the execution of the helper thread with the main thread and to help ensure the prefetched data is not removed from the cache before the main thread is able to take advantage of the prefetched data.

摘要翻译： 公开了一种用于减少软件应用中的存储器延迟的方法和装置。所公开的系统使用一个或多个辅助线程来预取主线程的变量，以减少由于存储器延迟和/或高速缓存未命中引起的性能瓶颈。使用性能分析工具来描述软件应用程序的资源使用情况，并识别遇到性能瓶颈的软件应用程序中的区域。编译器运行时指令生成到软件应用程序中以创建和管理辅助线程。辅助线程预取了遇到性能瓶颈的软件应用程序的已识别区域中的数据。计数机制被插入到辅助线程中，并且计数机制被插入到主线程中以协调辅助线程与主线程的执行，并且有助于确保在主线程可用之前预取数据不被从高速缓存中移除以利用预取的数据。

5.

发明授权
Compiler-based scheduling optimization hints for user-level threads 有权
标题翻译：基于编译器的调度优化提示用户级线程

公开(公告)号：US08205200B2

公开(公告)日：2012-06-19

申请号：US11289803

申请日：2005-11-29

申请人： Shih-wei Liao , Ryan N. Rakvic , Richard A. Hankins , Hong Wang , Gansha Wu , Guei-Yuan Lueh , Xinmin Tian , Paul M. Petersen , Sanjiv Shah , Trung Diep , John Shen , Gautham Chinya

发明人： Shih-wei Liao , Ryan N. Rakvic , Richard A. Hankins , Hong Wang , Gansha Wu , Guei-Yuan Lueh , Xinmin Tian , Paul M. Petersen , Sanjiv Shah , Trung Diep , John Shen , Gautham Chinya

IPC分类号： G06F9/44 , G06F9/46

CPC分类号： G06F9/485 , G06F9/4881

摘要： Method, apparatus and system embodiments to schedule user-level OS-independent “shreds” without intervention of an operating system. For at least one embodiment, the shred is scheduled for execution by a scheduler routine rather than the operating system. The scheduler routine may receive compiler-generated hints from a compiler. The compiler hints may be generated by the compiler without user-provided pragmas, and may be passed to the scheduler routine via an API-like interface. The interface may include a scheduling hint data structure that is maintained by the compiler. Other embodiments are also described and claimed.

摘要翻译： 方法，装置和系统实施例来调度用户级别的与OS无关的“碎片”，而不需要操作系统的干预。对于至少一个实施例，碎片被调度为由调度器例程而不是操作系统执行。调度程序例程可以从编译器接收编译器生成的提示。编译器提示可能由编译器生成，而不需要用户提供的编译指示，并且可以通过类API接口传递给调度程序。接口可以包括由编译器维护的调度提示数据结构。还描述和要求保护其他实施例。

6.

发明申请
Generating efficient parallel code using partitioning, coalescing, and degenerative loop and guard removal 有权
标题翻译：使用分区，聚结和退化循环和保护去除生成高效的并行代码

公开(公告)号：US20070079281A1

公开(公告)日：2007-04-05

申请号：US11239818

申请日：2005-09-30

申请人： Shih-wei Liao , Zhao Du , Bu Cheng , Gansha Wu , Guei-Yuan Lueh

发明人： Shih-wei Liao , Zhao Du , Bu Cheng , Gansha Wu , Guei-Yuan Lueh

IPC分类号： G06F9/44

CPC分类号： G06F8/45

摘要： Code is affine partitioned to generate affine partitioning mappings. Parallel code is generated based on the affine partitioning mappings. Generating the parallel code includes coalescing loops in the parallel code generated from the affine partitioning mappings to generate coalesced parallel code and optimizing the coalesced parallel code.

摘要翻译： 代码被仿射分割以生成仿射分割映射。基于仿射分割映射生成并行代码。生成并行代码包括从仿射分割映射生成的并行代码中的合并循环，以生成合并并行代码并优化合并并行代码。

7.

发明申请
System and method for generating object code for map-reduce idioms in multiprocessor systems 审中-公开
标题翻译：用于生成多处理器系统中map-reduce习语的目标代码的系统和方法

公开(公告)号：US20080127146A1

公开(公告)日：2008-05-29

申请号：US11516292

申请日：2006-09-06

申请人： Shih-wei Liao , Bo Huang , Guilin Chen

发明人： Shih-wei Liao , Bo Huang , Guilin Chen

IPC分类号： G06F9/45

CPC分类号： G06F8/456

摘要： Methods and systems are provided for recognizing and processing reduction operations to optimize generated binary code for execution in a multiprocessor computer system. Reduction operations facilitate data parallelism whereby each processing thread contributes a value and the values are reduced using a function to obtain and return a reduced value to each of the threads. Embodiments of an idiom-based interprocedural compiler provide a unified framework for processing both implicit and explicit reductions. The compiler integrates explicit reductions and implicit reductions by providing a uniform intermediate format. The compiler resolves dependencies among processing threads within program code by checking for privatization of dependent threads or parallelizing reduction idioms within the threads, and generates parallelized object code for execution in a multiprocessor computer.

摘要翻译： 提供了用于识别和处理缩减操作以优化用于在多处理器计算机系统中执行的生成的二进制代码的方法和系统。缩减操作有助于数据并行性，从而每个处理线程提供一个值，并且通过使用函数来减小值以减少每个线程的值。基于习语的过程间编译器的实施例提供了用于处理隐式和显式缩减的统一框架。编译器通过提供统一的中间格式来集成显式的减少和隐式的减少。编译器通过检查线程内的依赖线程的私有化或并行化还原成语来解决程序代码内的处理线程之间的依赖关系，并生成用于在多处理器计算机中执行的并行化目标代码。

8.

发明申请
Computation transformations for streaming applications on multiprocessors 有权
标题翻译：多处理器上的流应用的计算转换

公开(公告)号：US20070003161A1

公开(公告)日：2007-01-04

申请号：US11170978

申请日：2005-06-30

申请人： Shih-wei Liao , Zhaohui Du , Gansha Wu , Ken Lueh , Zhiwei Ying , Jinzhan Peng

发明人： Shih-wei Liao , Zhaohui Du , Gansha Wu , Ken Lueh , Zhiwei Ying , Jinzhan Peng

IPC分类号： G06K9/36

CPC分类号： G06F8/456

摘要： A method including providing a stream of content to a processor, transforming kernels within the stream of content through affine modeling, transforming the affine modeled kernels, stream contracting kernel processes, and stream blocking the kernel processes.

摘要翻译： 一种方法，包括向处理器提供内容流，通过仿射建模，改变仿射建模的内核，流收缩内核进程和流封锁内核进程来转换内容流内的内核。

9.

发明授权
Computation transformations for streaming applications on multiprocessors 有权
标题翻译：多处理器上的流应用的计算转换

公开(公告)号：US07953158B2

公开(公告)日：2011-05-31

申请号：US11170978

申请日：2005-06-30

申请人： Shih-wei Liao , Zhaohui Du , Gansha Wu , Ken Lueh , Zhiwei Ying , Jinzhan Peng

发明人： Shih-wei Liao , Zhaohui Du , Gansha Wu , Ken Lueh , Zhiwei Ying , Jinzhan Peng

IPC分类号： H04N7/18

CPC分类号： G06F8/456

摘要： A method including providing a stream of content to a processor, transforming kernels within the stream of content through affine modeling, transforming the affine modeled kernels, stream contracting kernel processes, and stream blocking the kernel processes.

摘要翻译： 一种方法，包括向处理器提供内容流，通过仿射建模，改变仿射建模的内核，流收缩内核进程和流封锁内核进程来转换内容流内的内核。

10.

发明授权
Splitting the computation space to optimize parallel code 失效
标题翻译：拆分计算空间以优化并行代码

公开(公告)号：US07689980B2

公开(公告)日：2010-03-30

申请号：US11241396

申请日：2005-09-30

申请人： Zhao Hui Du , Shih-wei Liao , Gansha Wu , Guei-Yuan Lueh

发明人： Zhao Hui Du , Shih-wei Liao , Gansha Wu , Guei-Yuan Lueh

IPC分类号： G06F9/45

CPC分类号： G06F8/453 , G06F8/456 , G06F11/3604 , G06F11/3624

摘要： Linear transformations of statements in code are performed to generate linear expressions associated with the statements. Parallel code is generated using the linear expressions. Generating the parallel code includes splitting the computation-space of the statements into intervals and generating parallel code for the intervals.

摘要翻译： 执行代码中的语句的线性变换以生成与语句相关联的线性表达式。使用线性表达式生成并行代码。生成并行代码包括将语句的计算空间划分为间隔并为间隔生成并行代码。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类