专利检索 ap:("Michael D. Linderman" OR "Jamison D. Collins" OR "Perry Wang" OR "Hong Wang") AND inv:"Perry Wang" 第 1 页

1.

发明授权
Compiler and runtime for heterogeneous multiprocessor systems 有权
标题翻译：异构多处理器系统的编译器和运行时

公开(公告)号：US08296743B2

公开(公告)日：2012-10-23

申请号：US11958307

申请日：2007-12-17

申请人： Michael D. Linderman , Jamison D. Collins , Perry Wang , Hong Wang

发明人： Michael D. Linderman , Jamison D. Collins , Perry Wang , Hong Wang

IPC分类号： G06F9/45

CPC分类号： G06F9/505 , G06F2209/5017

摘要： Presented are embodiments of methods and systems for library-based compilation and dispatch to automatically spread computations of a program across heterogeneous cores in a processing system. The source program contains a parallel-programming keyword, such as mapreduce, from a high-level, library-oriented parallel programming language. The compiler inserts one or more calls for a generic function, associated with the parallel-programming keyword, into the compiled code. A runtime library provides a predicate-based library system that includes multiple hardware specific implementations (“variants”) of the generic function. A runtime dispatch engine dynamically selects the best-available (e.g., most specific) variant, from a bundle of hardware-specific variants, for a given input and machine configuration. That is, the dispatch engine may take into account run-time availability of processing elements, choose one of them, and then select for dispatch an appropriate variant to be executed on the selected processing element. Other embodiments are also described and claimed.

摘要翻译： 提出了用于基于库的编译和调度的方法和系统的实施例，以便在处理系统中跨异构核心自动扩展程序的计算。源程序包含一个并行编程关键字，如mapreduce，来自高级的面向库的并行编程语言。编译器将一个或多个与并行编程关键字关联的通用函数的调用插入到编译代码中。运行时库提供了一个基于谓词的库系统，其中包含通用功能的多个硬件特定实现（变体）。对于给定的输入和机器配置，运行时调度引擎从一组特定于硬件的变体动态地选择最佳可用（例如，最具体的）变体。也就是说，调度引擎可以考虑处理元件的运行时间可用性，选择其中之一，然后选择在所选择的处理元件上调度要执行的适当变体。还描述和要求保护其他实施例。

2.

发明申请
Compiler and Runtime for Heterogeneous Multiprocessor Systems 有权
标题翻译：用于异构多处理器系统的编译器和运行时

公开(公告)号：US20090158248A1

公开(公告)日：2009-06-18

申请号：US11958307

申请日：2007-12-17

申请人： Michael D. Linderman , Jamison D. Collins , Perry Wang , Hong Wang

发明人： Michael D. Linderman , Jamison D. Collins , Perry Wang , Hong Wang

IPC分类号： G06F9/44

CPC分类号： G06F9/505 , G06F2209/5017

摘要： Presented are embodiments of methods and systems for library-based compilation and dispatch to automatically spread computations of a program across heterogeneous cores in a processing system. The source program contains a parallel-programming keyword, such as mapreduce, from a high-level, library-oriented parallel programming language. The compiler inserts one or more calls for a generic function, associated with the parallel-programming keyword, into the compiled code. A runtime library provides a predicate-based library system that includes multiple hardware specific implementations (“variants”) of the generic function. A runtime dispatch engine dynamically selects the best-available (e.g., most specific) variant, from a bundle of hardware-specific variants, for a given input and machine configuration. That is, the dispatch engine may take into account run-time availability of processing elements, choose one of them, and then select for dispatch an appropriate variant to be executed on the selected processing element. Other embodiments are also described and claimed.

摘要翻译： 提出了用于基于库的编译和调度的方法和系统的实施例，以便在处理系统中跨异构核心自动扩展程序的计算。源程序包含一个并行编程关键字，如mapreduce，来自高级的面向库的并行编程语言。编译器将一个或多个与并行编程关键字关联的通用函数的调用插入到编译代码中。运行时库提供了一个基于谓词的库系统，它包含通用函数的多个硬件特定实现（“变体”）。对于给定的输入和机器配置，运行时调度引擎从一组特定于硬件的变体动态地选择最佳可用（例如，最具体的）变体。也就是说，调度引擎可以考虑处理元件的运行时间可用性，选择其中之一，然后选择在所选择的处理元件上调度要执行的适当变体。还描述和要求保护其他实施例。

3.

发明授权
Mechanism to exploit synchronization overhead to improve multithreaded performance 有权
标题翻译：利用同步开销来提高多线程性能的机制

公开(公告)号：US07587584B2

公开(公告)日：2009-09-08

申请号：US11070991

申请日：2005-03-02

申请人： Natalie D. Enright , Jamison D. Collins , Perry Wang , Hong Wang , Xinmin Tran , John Shen , Gad Sheaffer , Per Hammarlund

发明人： Natalie D. Enright , Jamison D. Collins , Perry Wang , Hong Wang , Xinmin Tran , John Shen , Gad Sheaffer , Per Hammarlund

IPC分类号： G06F9/00

CPC分类号： G06F9/3851 , G06F9/3009 , G06F9/4843 , G06F11/3419 , G06F11/348 , G06F2201/86 , G06F2201/88 , G06F2201/885

摘要： Method, apparatus, and program means for a programmable event driven yield mechanism that may activate other threads. In one embodiment, an apparatus includes execution resources to execute a plurality of instructions and an event detector to detect a long latency event associated with a synchronization object. The event detector can cause a first thread switch in response to the long latency event associated with the synchronization object. The apparatus may also include a spin detector to detect that the synchronization object is a contended synchronization object. The spin detector can cause a second thread switch in response to the detection of the contended synchronization object to enable a spin detect response.

摘要翻译： 用于可激活其他线程的可编程事件驱动产量机制的方法，装置和程序装置。在一个实施例中，装置包括执行多个指令的执行资源和用于检测与同步对象相关联的长延迟事件的事件检测器。事件检测器可以响应于与同步对象相关联的长等待时间事件而导致第一线程切换。该装置还可以包括检测同步对象是竞争同步对象的自旋检测器。响应于竞争的同步对象的检测，自旋检测器可以引起第二线程切换以启用自旋检测响应。

4.

发明申请
Providing A Dedicated Communication Path Separate From A Second Path To Enable Communication Between Complaint Sequencers Of A Processor Using An Assertion Signal 有权
标题翻译：提供从第二条路径分离的专用通信路径，以实现使用断言信号的处理器的投诉排序器之间的通信

公开(公告)号：US20130080746A1

公开(公告)日：2013-03-28

申请号：US13682111

申请日：2012-11-20

申请人： Perry Wang , Jamison Collins , Hong Wang

发明人： Perry Wang , Jamison Collins , Hong Wang

IPC分类号： G06F9/30

CPC分类号： G06F9/30 , G06F9/3877 , G06F9/3879 , G06F9/3881 , G06F15/17368 , G06F15/7832

摘要： In one embodiment, the present invention includes a method for communicating an assertion signal from a first instruction sequencer to a plurality of accelerators coupled to the first instruction sequencer, detecting the assertion signal in the accelerators and communicating a request for a lock, and registering an accelerator that achieves the lock by communication of a registration message for the accelerator to the first instruction sequencer. Other embodiments are described and claimed.

摘要翻译： 在一个实施例中，本发明包括一种用于将断言信号从第一指令定序器传送到耦合到第一指令定序器的多个加速器，检测加速器中的断言信号并传送锁定请求的方法，加速器，其通过将加速器的注册消息通信给第一指令定序器来实现锁定。描述和要求保护其他实施例。

5.

发明授权
Programming environment for heterogeneous processor resource integration 有权
标题翻译：用于异构处理器资源整合的编程环境

公开(公告)号：US07941791B2

公开(公告)日：2011-05-10

申请号：US11786920

申请日：2007-04-13

申请人： Perry Wang , Jamison Collins , Gautham Chinya , Hong Jiang , Hong Wang , Xinmin Tian , Guei-Yuan Lueh

发明人： Perry Wang , Jamison Collins , Gautham Chinya , Hong Jiang , Hong Wang , Xinmin Tian , Guei-Yuan Lueh

IPC分类号： G06F9/45

CPC分类号： G06F9/3879 , G06F8/447 , G06F8/45 , G06F8/47 , G06F9/3851

摘要： Compiling a source code program for a heterogeneous multi-core processor having a first instruction sequencer, having a first instruction set architecture, an accelerator to the first instruction sequencer, wherein the accelerator comprises a heterogeneous resource with respect to the first instruction sequencer having a second instruction set architecture, the source code program having specified therein a region of source code for the first instruction set architecture of the processor and a region of source code for the second instruction set architecture of the processor.

摘要翻译： 编译具有第一指令定序器的异构多核处理器的源代码程序，第一指令定序器具有第一指令集架构，加速器到第一指令定序器，其中加速器包括关于第一指令定序器的异质资源，指令集架构，源代码程序在其中指定了用于处理器的第一指令集架构的源代码区域和处理器的第二指令集架构的源代码区域。

6.

发明申请
Mechanism to exploit synchronization overhead to improve multithreaded performance 有权
标题翻译：利用同步开销来提高多线程性能的机制

公开(公告)号：US20050149697A1

公开(公告)日：2005-07-07

申请号：US11070991

申请日：2005-03-02

申请人： Natalie Enright , Jamison Collins , Perry Wang , Hong Wang , Xinmin Tran , John Shen , Gad Sheaffer , Per Hammarlund

发明人： Natalie Enright , Jamison Collins , Perry Wang , Hong Wang , Xinmin Tran , John Shen , Gad Sheaffer , Per Hammarlund

IPC分类号： G06F9/30 , G06F9/38 , G06F9/48 , G06F11/34

CPC分类号： G06F9/3851 , G06F9/3009 , G06F9/4843 , G06F11/3419 , G06F11/348 , G06F2201/86 , G06F2201/88 , G06F2201/885

摘要： Method, apparatus, and program means for a programmable event driven yield mechanism that may activate other threads. In one embodiment, an apparatus includes execution resources to execute a plurality of instructions and an event detector to detect a long latency event associated with a synchronization object. The event detector can cause a first thread switch in response to the long latency event associated with the synchronization object. The apparatus may also include a spin detector to detect that the synchronization object is a contended synchronization object. The spin detector can cause a second thread switch in response to the detection of the contended synchronization object to enable a spin detect response.

摘要翻译： 用于可激活其他线程的可编程事件驱动产量机制的方法，装置和程序装置。在一个实施例中，装置包括执行多个指令的执行资源和用于检测与同步对象相关联的长延迟事件的事件检测器。事件检测器可以响应于与同步对象相关联的长等待时间事件而导致第一线程切换。该装置还可以包括检测同步对象是竞争同步对象的自旋检测器。响应于竞争的同步对象的检测，自旋检测器可以引起第二线程切换以启用自旋检测响应。

7.

发明申请
Providing a dedicated communication path for compliant sequencers 有权
标题翻译：为顺应性排序器提供专门的通信路径

公开(公告)号：US20090077348A1

公开(公告)日：2009-03-19

申请号：US11901178

申请日：2007-09-14

申请人： Perry Wang , Jamison Collins , Hong Wang

发明人： Perry Wang , Jamison Collins , Hong Wang

IPC分类号： G06F15/76 , G06F9/02

CPC分类号： G06F9/30 , G06F9/3877 , G06F9/3879 , G06F9/3881 , G06F15/17368 , G06F15/7832

摘要： In one embodiment, the present invention includes a method for communicating an assertion signal from a first instruction sequencer to a plurality of accelerators coupled to the first instruction sequencer via a dedicated interconnect, detecting the assertion signal in the accelerators and communicating a request for a lock on a second interconnect coupled to the first instruction sequencer and the accelerators, and registering an accelerator that achieves the lock by communication of a registration message for the accelerator to the first instruction sequencer via the second interconnect. Other embodiments are described and claimed.

摘要翻译： 在一个实施例中，本发明包括一种用于将断言信号从第一指令定序器传送到经由专用互连耦合到第一指令定序器的多个加速器的方法，检测加速器中的断言信号并传送锁定请求在耦合到所述第一指令定序器和所述加速器的第二互连上，以及通过经由所述第二互连将所述加速器的用于所述加速器的注册消息通信到所述第一指令定序器来登记实现所述锁定的加速器。描述和要求保护其他实施例。

8.

发明申请
System, method and apparatus for dependency chain processing 有权

公开(公告)号：US20060070047A1

公开(公告)日：2006-03-30

申请号：US10950693

申请日：2004-09-28

申请人： Satish Narayanasamy , Hong Wang , John Shen , Roni Rosner , Yoav Almog , Naftali Schwartz , Gerolf Hoflehner , Daniel LaVery , Wei Li , Xinmin Tian , Milind Girkar , Perry Wang

发明人： Satish Narayanasamy , Hong Wang , John Shen , Roni Rosner , Yoav Almog , Naftali Schwartz , Gerolf Hoflehner , Daniel LaVery , Wei Li , Xinmin Tian , Milind Girkar , Perry Wang

IPC分类号： G06F9/45

CPC分类号： G06F8/443 , G06F8/433 , G06F8/451

摘要： Embodiments of the present invention provide a method, apparatus and system which may include splitting a dependency chain into a set of reduced-width dependency chains; mapping one or more dependency chains onto one or more clustered dependency chain processors, wherein an issue-width of one or more of the clusters is adapted to accommodate a size of the dependency chains; and/or processing in parallel a plurality of dependency chains of a trace. Other embodiments are described and claimed.

9.

发明申请
Method and system to provide user-level multithreading 有权
标题翻译：方法和系统提供用户级多线程

公开(公告)号：US20050223199A1

公开(公告)日：2005-10-06

申请号：US10816103

申请日：2004-03-31

申请人： Edward Grochowski , Hong Wang , John Shen , Perry Wang , Jamison Collins , James Held , Partha Kundu , Raya Leviathan , Tin-Fook Ngai

发明人： Edward Grochowski , Hong Wang , John Shen , Perry Wang , Jamison Collins , James Held , Partha Kundu , Raya Leviathan , Tin-Fook Ngai

IPC分类号： G06F9/30 , G06F9/38 , G06F9/48 , G06F9/44

CPC分类号： G06F9/30003 , G06F9/30087 , G06F9/3009 , G06F9/30101 , G06F9/3013 , G06F9/384 , G06F9/3851

摘要： A method and system to provide user-level multithreading are disclosed. The method according to the present techniques comprises receiving programming instructions to execute one or more shared resource threads (shreds) via an instruction set architecture (ISA). One or more instruction pointers are configured via the ISA; and the one or more shreds are executed simultaneously with a microprocessor, wherein the microprocessor includes multiple instruction sequencers.

摘要翻译： 公开了提供用户级多线程的方法和系统。根据本技术的方法包括接收经由指令集架构（ISA）执行一个或多个共享资源线程（碎片）的编程指令。一个或多个指令指针通过ISA配置; 并且一个或多个碎片与微处理器同时执行，其中微处理器包括多个指令定序器。

10.

发明申请
Methods and apparatuses for thread management of mult-threading 审中-公开
标题翻译：多线程线程管理方法与设备

公开(公告)号：US20050071841A1

公开(公告)日：2005-03-31

申请号：US10676581

申请日：2003-09-30

申请人： Gerolf Hoflehner , Shih-Wei Liao , Xinmin Tian , Hong Wang , Daniel Lavery , Perry Wang , Dongkeun Kim , Milind Girkar , John Shen

发明人： Gerolf Hoflehner , Shih-Wei Liao , Xinmin Tian , Hong Wang , Daniel Lavery , Perry Wang , Dongkeun Kim , Milind Girkar , John Shen

IPC分类号： G06F9/45 , G06F9/46

CPC分类号： G06F8/441

摘要： Methods and apparatuses for thread management for multi-threading are described herein. In one embodiment, exemplary process includes selecting, during a compilation of code having one or more threads executable in a data processing system, a current thread having a most bottom order, determining resources allocated to one or more child threads spawned from the current thread, and allocating resources for the current thread in consideration of the resources allocated to the current thread's one or more child threads to avoid resource conflicts between the current thread and its one or more child threads. Other methods and apparatuses are also described.

摘要翻译： 本文描述了用于多线程的线程管理的方法和装置。在一个实施例中，示例性过程包括在具有在数据处理系统中可执行的一个或多个线程的代码的编译期间选择具有最低阶的当前线程，确定分配给从当前线程产生的一个或多个子线程的资源，并且考虑分配给当前线程的一个或多个子线程的资源来为当前线程分配资源，以避免当前线程与其一个或多个子线程之间的资源冲突。还描述了其它方法和装置。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类