Patent search ap:("INTEL CORPORATION") AND inv:"TIAN Page Xinmin"

1.

发明申请
TECHNOLOGIES FOR INDIRECTLY CALLING VECTOR FUNCTIONS 审中-公开
Title translation: 间接呼叫矢量功能的技术

公开(公告)号：WO2017153796A1

公开(公告)日：2017-09-14

申请号：PCT/IB2016/000404

申请日：2016-03-11

Applicant: INTEL CORPORATION

Inventor： IDO, Hideki, Saito , PREIS, Serge V. , KOZHUKHOV, Sergey S. , TIAN, Xinmin , MASLOV, Sergey V. , NELSON, Clark , YU, Jianfei

IPC: G06F9/45

Abstract: Technologies for indirectly calling vector functions include a compute device that includes a memory device to store source code and a compiler module. The compiler module is to identify a set of declarations of vector variants for scalar functions in the source code, generate a vector variant address map for each set of vector variants, generate an offset map for each scalar function, and identify, in the source code, an indirect call to the scalar functions, wherein the indirect call is to be vectorized. The compiler module is also to determine, based on a context of the indirect call, a vector variant to be called and store, in object code and in association with the indirect call, an offset into one of the vector variant address maps based on (i) the determined vector variant to be called and (ii) the offset map that corresponds to each scalar function.

Abstract translation: 用于间接调用向量函数的技术包括包含存储源代码的存储器设备和编译器模块的计算设备。编译器模块将为源代码中的标量函数标识矢量变体的一组声明，为每组矢量变体生成矢量变体地址映射，为每个标量函数生成偏移映射，并在源代码中标识，间接调用标量函数，其中间接调用将被矢量化。编译器模块还基于间接调用的上下文来确定待调用的矢量变体，并且以目标代码并且与间接调用相关联地将偏移量存储到矢量变体地址映射之一中，基于（ i）确定的要调用的矢量变体和（ii）与每个标量函数相对应的偏移映射。

2.

发明申请
SPECULATIVE COMPILATION TO GENERATE ADVICE MESSAGES 审中-公开
Title translation: 用于产生建议消息的抽样编译

公开(公告)号：WO2012064690A2

公开(公告)日：2012-05-18

申请号：PCT/US2011/059701

申请日：2011-11-08

Applicant: INTEL CORPORATION , KRISHNAIYER, Rakesh , IDO, Hideki Saito , SU, Ernesto , NG, John L. , LIN, Jin , TIAN, Xinmin , GEVA, Robert Y.

Inventor： KRISHNAIYER, Rakesh , IDO, Hideki Saito , SU, Ernesto , NG, John L. , LIN, Jin , TIAN, Xinmin , GEVA, Robert Y.

IPC: G06F9/45

CPC classification number: G06F8/4441 , G06F8/41 , G06F11/3664

Abstract: Methods to improve optimization of compilation are presented. In one embodiment, a method includes identifying one or more optimization speculations with respect to a code region and speculatively performing transformation on an intermediate representation of the code region in accordance with an optimization speculation. The method includes generating an advice message corresponding to the optimization speculation and displaying the advice message if the optimization speculation results in an improved compilation result.

Abstract translation: 介绍了改进编译优化的方法。在一个实施例中，一种方法包括根据优化推测识别关于代码区域的一个或多个优化推测并且推测性地对代码区域的中间表示进行变换。该方法包括：如果优化推测导致改进的编译结果，则生成对应于优化推测的建议消息并显示建议消息。

3.

发明申请
LOOP PARALLELIZATION BASED ON LOOP SPLITTING OR INDEX ARRAY 审中-公开
Title translation: 基于循环分割或索引阵列的并行化

公开(公告)号：WO2012087988A2

公开(公告)日：2012-06-28

申请号：PCT/US2011/065948

申请日：2011-12-19

Applicant: INTEL CORPORATION , LIN, Jin , RAVI, Nishkam , TIAN, Xinmin , NG, John L. , VALIULLIN, Renat V.

Inventor： LIN, Jin , RAVI, Nishkam , TIAN, Xinmin , NG, John L. , VALIULLIN, Renat V.

IPC: G06F9/38 , G06F9/45

CPC classification number: G06F8/456 , G06F8/4441

Abstract: Methods and apparatus to provide loop parallelization based on loop splitting and/or index array are described. In one embodiment, one or more split loops, corresponding to an original loop, are generated based on the mis-speculation information. In another embodiment, a plurality of subloops are generated from an original loop based on an index array. Other embodiments are also described.

Abstract translation: 描述了基于环路分离和/或索引阵列来提供环路并行化的方法和设备。在一个实施例中，基于错误推测信息来生成对应于原始循环的一个或多个分离环。在另一个实施例中，基于索引阵列从原始循环生成多个子环路。还描述了其他实施例。

4.

发明申请
COMPILER-BASED SCHEDULING OPTIMIZATIONS FOR USER-LEVEL THREADS 审中-公开
Title translation: 基于编译器的用户级线程调度优化

公开(公告)号：WO2007064490A1

公开(公告)日：2007-06-07

申请号：PCT/US2006/044587

申请日：2006-11-16

Applicant: INTEL CORPORATION , LIAO, Shih-Wei , RAKVIC, Ryan, N. , HANKINS, Richard, A. , WANG, Hong , WU, Gansha , LUEH, Guei-yuan , TIAN, Xinmin , PETERSEN, Paul, M. , SHAH, Sanjiv , DIEP, Trung , SHEN, John , CHINYA, Gautham

Inventor： LIAO, Shih-Wei , RAKVIC, Ryan, N. , HANKINS, Richard, A. , WANG, Hong , WU, Gansha , LUEH, Guei-yuan , TIAN, Xinmin , PETERSEN, Paul, M. , SHAH, Sanjiv , DIEP, Trung , SHEN, John , CHINYA, Gautham

IPC: G06F9/48

CPC classification number: G06F9/485 , G06F9/4881

Abstract: Method, apparatus and system embodiments to schedule user-level OS- independent "shreds" without intervention of an operating system. For at least one embodiment, the shred is scheduled for execution by a scheduler routine rather than the operating system. The scheduler routine may receive compiler-generated hints from a compiler. The compiler hints may be generated by the compiler without user-provided pragmas, and may be passed to the scheduler routine via an API-like interface. The interface may include a scheduling hint data structure that is maintained by the compiler. Other embodiments are also described and claimed.

Abstract translation: 方法，装置和系统实施例，以在不介入操作系统的情况下调度用户级独立于OS的“碎片”。对于至少一个实施例，碎片被调度为由调度器例程而不是操作系统执行。调度程序例程可以从编译器接收编译器生成的提示。编译器提示可能由编译器生成，而不需要用户提供的编译指示，并且可以通过类API接口传递给调度程序。接口可以包括由编译器维护的调度提示数据结构。还描述和要求保护其他实施例。

5.

发明申请
GENERATING VECTOR BASED SELECTION CONTROL STATEMENTS 审中-公开

公开(公告)号：WO2018125409A1

公开(公告)日：2018-07-05

申请号：PCT/US2017/061713

申请日：2017-11-15

Applicant: INTEL CORPORATION

Inventor： IDO, Hideki Saito , GARCIA, Eric N. , TIAN, Xinmin , GIRKAR, Milind B. , BRODMAN, James

IPC: G06F9/30

CPC classification number: G06F9/3844 , G06F9/30058 , G06F9/3806 , G06F15/76

Abstract: In one example, a system for generating vector based selection control statements can include a processor to determine a vector cost of the selection control statement is below a scalar cost and determine the selection control statement is to be executed in a sorted order based on dependencies between branch instructions of the selection control statement. The processor can also determine a program ordering of labels of the selection control statement does not match a mathematical ordering of the labels and execute the selection control statement with a vector of values, wherein the selection control statement is to be executed based on a jump table and a sorted unique value technique, wherein the sorted unique value technique comprises selecting at least one of the plurality of branch instructions from the jump table.

6.

发明申请
METHODS AND APPARATUSES FOR THREAD MANAGEMENT OF MULTI-THREADING 审中-公开
Title translation: 多线程螺纹管理的方法和设备

公开(公告)号：WO2005033936A1

公开(公告)日：2005-04-14

申请号：PCT/US2004/032075

申请日：2004-09-29

Applicant: INTEL CORPORATION , HOFLEHNER, Gerolf , LIAO, Shih-Wei , TIAN, Xinmin , WANG, Hong , LAVERY, Daniel , WANG, Perry , KIM, Dongkeun , GIRKAR, Milind , SHEN, John

Inventor： HOFLEHNER, Gerolf , LIAO, Shih-Wei , TIAN, Xinmin , WANG, Hong , LAVERY, Daniel , WANG, Perry , KIM, Dongkeun , GIRKAR, Milind , SHEN, John

IPC: G06F9/45

CPC classification number: G06F8/441

Abstract: Methods and apparatuses for thread management for multi-threading are described herein. In one embodiment, exemplary process includes selecting, during a compilation of code having one or more threads executable in a data processing system, a current thread having a most bottom order, determining resources allocated to one or more child threads spawned from the current thread, and allocating resources for the current thread in consideration of the resources allocated to the current thread's one or more child threads to avoid resource conflicts between the current thread and its one or more child threads. Other methods and apparatuses are also described.

Abstract translation: 本文描述了用于多线程的线程管理的方法和装置。在一个实施例中，示例性过程包括在具有在数据处理系统中可执行的一个或多个线程的代码的编译期间选择具有最低阶的当前线程，确定分配给从当前线程产生的一个或多个子线程的资源，并且考虑分配给当前线程的一个或多个子线程的资源来为当前线程分配资源，以避免当前线程与其一个或多个子线程之间的资源冲突。还描述了其它方法和装置。

7.

发明申请
THREAD-DATA AFFINITY OPTIMIZATION USING COMPILER 审中-公开
Title translation: 使用编译器的线程优化优化

公开(公告)号：WO2007041122A1

公开(公告)日：2007-04-12

申请号：PCT/US2006/037576

申请日：2006-08-26

Applicant: INTEL CORPORATION , TIAN, Xinmin , GIRKAR, Milind , SHER, David , GROVE, Richard , LI, Wei , WANG, Hong , NEWBURN, Chris , WANG, Perry , SHEN, John

Inventor： TIAN, Xinmin , GIRKAR, Milind , SHER, David , GROVE, Richard , LI, Wei , WANG, Hong , NEWBURN, Chris , WANG, Perry , SHEN, John

IPC: G06F9/45

CPC classification number: G06F8/45

Abstract: Thread-data affinity optimization can be performed by a compiler during the compiling of a computer program to be executed on a cache coherent non-uniform memory access (cc-NUMA) platform. In one embodiment, the present invention includes receiving a program to be compiled. The received program is then compiled in a first pass and executed. During execution, the compiler collects profiling data using a profiling tool. Then, in a second pass, the compiler performs thread-data affinity optimization on the program using the collected profiling data.

Abstract translation: 线程数据亲和度优化可以在编译要在高速缓存相干非均匀内存访问（cc-NUMA）平台上执行的计算机程序时由编译器执行。在一个实施例中，本发明包括接收要编译的程序。接收的程序然后被编译成第一遍并被执行。在执行期间，编译器使用分析工具收集分析数据。然后，在第二遍，编译器使用收集的分析数据对程序执行线程数据关联优化。

8.

发明申请
METHODS AND APPARATUSES FOR COMPILER-CREATING HELPER THREADS FOR MULTI-THREADING 审中-公开
Title translation: 编译器用于多线程的辅助线程的方法和设备

公开(公告)号：WO2005033931A2

公开(公告)日：2005-04-14

申请号：PCT/US2004/032461

申请日：2004-09-30

Applicant: INTEL CORPORATION , LIAO, Shih-wei , TIAN, Xinmin , HOFLEHNER, Gerolf, F. , WANG, Hong , LAVERY, Daniel, M. , WANG, Perry , KIM, Dongkeun , GIRKAR, Milind , SHEN, John, P.

Inventor： LIAO, Shih-wei , TIAN, Xinmin , HOFLEHNER, Gerolf, F. , WANG, Hong , LAVERY, Daniel, M. , WANG, Perry , KIM, Dongkeun , GIRKAR, Milind , SHEN, John, P.

IPC: G06F9/40

CPC classification number: G06F9/3842 , G06F8/4442 , G06F9/383 , G06F9/3851

Abstract: Methods and apparatuses for compiler- created helper thread for multithreading are described herein. In one embodiment, exemplary process includes identifying a region of a main thread that likely has one or more delinquent loads, the one or more delinquent loads representing loads which likely suffer cache misses during an execution of the main thread, analyzing the region for one or more helper threads with respect to the main thread, and generating code for the one or more helper threads, the one or more helper threads being speculatively executed in parallel with the main thread to perform one or more tasks for the region of the main thread. Other methods and apparatuses are also described.

Abstract translation: 本文描述了用于多线程的编译器创建的辅助线程的方法和装置。在一个实施例中，示例性过程包括识别可能具有一个或多个拖欠负载的主线程的区域，所述一个或多个违规负载表示在执行主线程期间可能遭受高速缓存未命中的负载，分析该区域中的一个或多个相对于主线程的更多帮助线程以及为一个或多个辅助线程生成代码，所述一个或多个辅助线程与主线程并行地推测地执行以对主线程的区域执行一个或多个任务。还描述了其它方法和装置。

9.

发明申请
MULTI-ENTRY THREADING METHOD AND APPARATUS FOR AUTOMATIC AND DIRECTIVE-GUIDED PARALLELIZATION OF A SOURCE PROGRAM 审中-公开

公开(公告)号：WO2002003194A3

公开(公告)日：2002-01-10

申请号：PCT/US2001/018614

申请日：2001-06-08

Applicant: INTEL CORPORATION , KIRKEGAARD, Knud , GIRKAR, Milind , GREY, Paul , TIAN, Xinmin

Inventor： KIRKEGAARD, Knud , GIRKAR, Milind , GREY, Paul , TIAN, Xinmin

IPC: G06F9/45

Abstract: A method and apparatus for compiling a source program are described. Multiple predetermined sequences within the source program are located. A start code is inserted in the source program prior to a first instruction of each predetermined sequence. An invocation code is inserted in the source program prior to the start code, the invocation code addressing the start code and transferring each sequence to a system for execution. Finally, a stop code is inserted in the source program after a last instruction of each sequence, the stop code signaling to the system to step execution of the sequence.

10.

发明申请
SYSTEMS AND METHODS FOR CACHE OPTIMIZATION 审中-公开

公开(公告)号：WO2020190796A1

公开(公告)日：2020-09-24

申请号：PCT/US2020/022833

申请日：2020-03-14

Applicant: INTEL CORPORATION

Inventor： KOKER, Altug , RAY, Joydeep , OULD-AHMED-VALL, Elmoustapha , APPU, Abhishek , ANANTARAMAN, Aravindh , ANDREI, Valentin , BILAGI, Durgaprasad , GEORGE, Varghese , INSKO, Brent , JAHAGIRDAR, Sanjeev , JANUS, Scott , K, Pattabhiraman , KIM, SungYe , MAIYURAN, Subramaniam , RANGANATHAN, Vasanth , STRIRAMASSARMA, Lakshminarayanan , TIAN, Xinmin

IPC: G06F9/38 , G06F12/0862 , G06F9/30 , G06F12/123 , G06F12/126

Abstract: Systems and methods for improving cache efficiency and utilization are disclosed. In one embodiment, a graphics processor includes processing resources to perform graphics operations and a cache controller of a cache memory that is coupled to the processing resources. The cache controller is configured to set an initial aging policy using an aging field based on age of cache lines within the cache memory and to determine whether a hint or an instruction to indicate a level of aging has been received.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification