Patent search ap:"Tin Fook NGAI" Page 3

21.

发明授权
Method and system to provide concurrent user-level, non-privileged shared resource thread creation and execution 有权
Title translation: 方法和系统提供并发用户级，非特权共享资源线程的创建和执行

公开(公告)号：US09189230B2

公开(公告)日：2015-11-17

申请号：US10816103

申请日：2004-03-31

Applicant: Edward T. Grochowski , Hong Wang , John P. Shen , Perry H. Wang , Jamison D. Collins , James P. Held , Partha Kundu , Raya Leviathan , Tin-Fook Ngai

Inventor： Edward T. Grochowski , Hong Wang , John P. Shen , Perry H. Wang , Jamison D. Collins , James P. Held , Partha Kundu , Raya Leviathan , Tin-Fook Ngai

IPC: G06F9/30 , G06F9/38

CPC classification number: G06F9/30003 , G06F9/30087 , G06F9/3009 , G06F9/30101 , G06F9/3013 , G06F9/384 , G06F9/3851

Abstract: A method and system to provide user-level multithreading are disclosed. The method according to the present techniques comprises receiving programming instructions to execute one or more shared resource threads (shreds) via an instruction set architecture (ISA). One or more instruction pointers are configured via the ISA; and the one or more shreds are executed simultaneously with a microprocessor, wherein the microprocessor includes multiple instruction sequencers.

Abstract translation: 公开了提供用户级多线程的方法和系统。根据本技术的方法包括接收经由指令集架构（ISA）执行一个或多个共享资源线程（碎片）的编程指令。一个或多个指令指针通过ISA配置; 并且一个或多个碎片与微处理器同时执行，其中微处理器包括多个指令定序器。

22.

发明申请
SYSTEMS, APPARATUSES, AND METHODS FOR SETTING AN OUTPUT MASK IN A DESTINATION WRITEMASK REGISTER FROM A SOURCE WRITE MASK REGISTER USING AN INPUT WRITEMASK AND IMMEDIATE 有权
Title translation: 用于使用输入写入和立即从源写入掩码寄存器设置目的地写入寄存器中的输出掩码的系统，设备和方法

公开(公告)号：US20140223139A1

公开(公告)日：2014-08-07

申请号：US13991877

申请日：2011-12-23

Applicant: Victor W. Lee , Daehyun Kim , Tin-Fook Ngai , Jayashankar Bharadwaj , Albert Hartono , Sara Baghsorkhi , Nalini Vasudevan

Inventor： Victor W. Lee , Daehyun Kim , Tin-Fook Ngai , Jayashankar Bharadwaj , Albert Hartono , Sara Baghsorkhi , Nalini Vasudevan

IPC: G06F9/30

CPC classification number: G06F9/30036 , G06F9/30018 , G06F9/30021 , G06F9/30025 , G06F9/30072 , G06F15/8007 , G06F15/8053 , G06F15/8084 , G06F15/8092

Abstract: Embodiments of systems, apparatuses, and methods for performing in a computer processor generation of a predicate mask based on vector comparison in response to a single instruction are described.

Abstract translation: 描述了用于在计算机处理器中执行基于向量比较响应于单个指令生成谓词掩码的系统，装置和方法的实施例。

23.

发明申请
Method and System for Intermediate Representation of Source Code 有权
Title translation: 源代码的中间表示方法和系统

公开(公告)号：US20100023931A1

公开(公告)日：2010-01-28

申请号：US12178842

申请日：2008-07-24

Applicant: Buqi Cheng , Tin-Fook Ngai , Zhaohui Du , PeiNan Zhang

Inventor： Buqi Cheng , Tin-Fook Ngai , Zhaohui Du , PeiNan Zhang

IPC: G06F9/45

CPC classification number: G06F8/433

Abstract: A method to provide effective control and data flow information in an Intermediate Representation (IR) form. A Path Sensitive single Assignment (PSA) IR form with effective and explicit control and data path information supports control flow sensitive optimizations such as path sensitive symbolic substitution, array privatization and speculative multi threading. In the definition of PSA form, besides defining new versioned variables, the gamma functions keep control path information. The gamma function in PSA form keeps the basic attribute of SSA IR form and only one definition exists for each use. Therefore, all existing Single Static Assignment (SSA) IR form based analysis can be applied in PSA form. The gamma function in PSA form keeps all essential control flow information and eliminates unnecessary predicates at the same time.

Abstract translation: 一种以中间表示（IR）形式提供有效控制和数据流信息的方法。路径敏感单分配（PSA）具有有效和显式控制和数据路径信息的IR形式支持控制流敏感优化，如路径敏感符号替换，数组私有化和投机多线程。在PSA形式的定义中，除了定义新的版本变量之外，伽马函数还保留了控制路径信息。 PSA形式的伽玛函数保持SSA IR形式的基本属性，并且每次使用只存在一个定义。因此，所有现有的单静态分配（SSA）IR形式分析可以以PSA形式应用。 PSA形式的伽玛功能保留所有必要的控制流信息，同时消除不必要的谓词。

24.

发明授权
Computation of all-pairs reaching probabilities in software systems 有权
Title translation: 软件系统中所有对达到概率的计算

公开(公告)号：US07379858B2

公开(公告)日：2008-05-27

申请号：US10778545

申请日：2004-02-17

Applicant: Chu-Cheow Lim , Zhao Hui Du , Tin-Fook Ngai

Inventor： Chu-Cheow Lim , Zhao Hui Du , Tin-Fook Ngai

IPC: G06F13/22

CPC classification number: G06F9/44

Abstract: A Markov chain model of a software system may be used to compute all-pairs reaching probabilities to provide guidance in performing speculative operations with respect to the software system.

Abstract translation: 可以使用软件系统的马可夫链模型来计算所有对达成概率，以提供关于软件系统执行投机操作的指导。

25.

发明授权
Run-ahead program execution with value prediction 失效
Title translation: 带有价值预测的预测程序执行

公开(公告)号：US07188234B2

公开(公告)日：2007-03-06

申请号：US10017793

申请日：2001-12-12

Applicant: Youfeng Wu , Tin-Fook Ngai

Inventor： Youfeng Wu , Tin-Fook Ngai

IPC: G06F9/312

CPC classification number: G06F9/383 , G06F9/3832 , G06F9/3842 , G06F9/3861

Abstract: A data processing apparatus, a computer, an article including a machine-accessible medium, and a method of processing data are disclosed. The data processing apparatus may include a pair of pipelines sharing an instruction cache, data cache, and a branch predictor with the second pipeline running ahead of the first pipeline using a data value prediction module. The pipelines may be included in one or more processors and coupled to a memory to form a computer. The method includes executing a plurality of instructions using the pipeline pair, such that when a cache miss is encountered by the second pipeline during execution of a LOAD instruction, the data value prediction module supplies a predicted load value in lieu of a cached value, enabling continued execution of the plurality of instructions by the second pipeline without waiting for the return of the cached value.

Abstract translation: 公开了一种数据处理装置，计算机，包括机器可访问介质的物品和处理数据的方法。数据处理装置可以包括使用数据值预测模块，共享指令高速缓存，数据高速缓存和分支预测器的一对管线，其中第二管线在第一管线之前运行。管线可以包括在一个或多个处理器中并且耦合到存储器以形成计算机。该方法包括使用流水线对来执行多个指令，使得当在执行LOAD指令期间由第二流水线遇到高速缓存未命中时，数据值预测模块提供代替缓存值的预测负载值，使能通过第二管道继续执行多个指令，而不等待返回缓存的值。

26.

发明授权
Method and apparatus for prediction for fork and join instructions in speculative execution 有权
Title translation: 用于在推测执行中预测叉和连接指令的方法和装置

公开(公告)号：US07181601B2

公开(公告)日：2007-02-20

申请号：US10731567

申请日：2003-12-08

Applicant: Kevin W. Rudd , Tin-Fook Ngai

Inventor： Kevin W. Rudd , Tin-Fook Ngai

IPC: G06F9/38

CPC classification number: G06F9/3009 , G06F9/30072 , G06F9/3842 , G06F9/3844 , G06F9/3851

Abstract: A method and apparatus for enabling the speculative forking of a speculative thread is disclosed. In one embodiment, a speculative fork instruction is conditioned by the results of a fork predictor. The fork predictor may issue predictions as to whether or not a speculative thread would execute desirably. The fork predictor may be implemented as a modified branch predictor circuit, and may have execution history updates entered by a determination of whether or not the execution of a speculative thread was or would have been desirable.

Abstract translation: 公开了一种用于使投机螺纹的推测性分叉的方法和装置。在一个实施例中，推测叉指令由叉预测器的结果来调节。叉预测器可以发出关于推测线程是否期望执行的预测。叉预测器可以被实现为修改的分支预测器电路，并且可以具有通过确定推测线程的执行是否是期望的输入而输入的执行历史更新。

27.

发明申请
METHOD AND SYSTEM TO PROVIDE USER-LEVEL MULTITHREADING 审中-公开

公开(公告)号：US20180225117A1

公开(公告)日：2018-08-09

申请号：US15943611

申请日：2018-04-02

Applicant: Ed Grochowski , Hong Wang , John P. Shen , Perry H. Wang , Jamison D. Collins , James Held , Partha Kundu , Raya Leviathan , Tin-Fook Ngai

Inventor： Ed Grochowski , Hong Wang , John P. Shen , Perry H. Wang , Jamison D. Collins , James Held , Partha Kundu , Raya Leviathan , Tin-Fook Ngai

IPC: G06F9/30 , G06F9/38

CPC classification number: G06F9/30003 , G06F9/30087 , G06F9/3009 , G06F9/30101 , G06F9/3013 , G06F9/384 , G06F9/3851

Abstract: A method and system to provide user-level multithreading are disclosed. The method according to the present techniques comprises receiving programming instructions to execute one or more shared resource threads (shreds) via an instruction set architecture (ISA). One or more instruction pointers are configured via the ISA; and the one or more shreds are executed simultaneously with a microprocessor, wherein the microprocessor includes multiple instruction sequencers.