Patent search ap:("QUALCOMM Incorporated") AND inv:"Lin Chen" Page 1

1.

发明授权
Per-shader preamble for graphics processing 有权

公开(公告)号：US09799089B1

公开(公告)日：2017-10-24

申请号：US15162272

申请日：2016-05-23

Applicant: QUALCOMM Incorporated

Inventor： Lin Chen , Yun Du , Andrew Evan Gruber , Guofang Jiao , Chun Yu , David Rigel Garcia Garcia

IPC: G06T1/20 , G06T15/80 , G06T1/60

CPC classification number: G06T1/20 , G06T1/60 , G06T15/80

Abstract: A method for processing data in a graphics processing unit including receiving a code block of instructions common to a plurality of groups of threads of a shader, executing the code block of instructions common to the plurality of groups of threads of the shader creating a result by a first group of threads of the plurality of groups of threads, storing the result of the code block of instructions common to the plurality of groups of threads of the shader in on-chip random access memory (RAM), the on-chip RAM accessible by each of the plurality of groups of threads, and upon a determination that storing the result of the code block of instructions common to the plurality of groups of threads of the shader has completed, returning the result of the code block of instructions common to the plurality of groups of threads of the shader from on-chip RAM.

2.

发明授权
General purpose register allocation in streaming processor 有权

公开(公告)号：US10558460B2

公开(公告)日：2020-02-11

申请号：US15379195

申请日：2016-12-14

Applicant: QUALCOMM Incorporated

Inventor： Yun Du , Liang Han , Lin Chen , Chihong Zhang , Hongjiang Shang , Jing Wu , Zilin Ying , Chun Yu , Guofang Jiao , Andrew Gruber , Eric Demers

IPC: G06F9/30 , G06F9/38 , G06T1/20 , G06F8/41

Abstract: Systems and techniques are disclosed for general purpose register dynamic allocation based on latency associated with of instructions in processor threads. A streaming processor can include a general purpose registers configured to stored data associated with threads, and a thread scheduler configured to receive allocation information for the general purpose registers, the information describing general purpose registers that are to be assigned as persistent general purpose registers (pGPRs) and volatile general purpose registers (vGPRs). The plurality of general purpose registers can be allocated according to the received information. The streaming processor can include the general purpose registers allocated according to the received information, the allocated based on execution latencies of instructions included in the threads.

3.

发明申请
CONSTANT MULTIPLICATION WITH TEXTURE UNIT OF GRAPHICS PROCESSING UNIT 审中-公开

公开(公告)号：US20170316540A1

公开(公告)日：2017-11-02

申请号：US15141519

申请日：2016-04-28

Applicant: QUALCOMM Incorporated

Inventor： Andrew Evan Gruber , Lin Chen , Liang Li , Chunhui Mei

IPC: G06T1/20 , G06T15/80 , G06T1/60

CPC classification number: G06T1/20 , G06T1/60 , G06T15/005 , G06T15/04 , G06T15/80 , G06T2200/28

Abstract: A texture unit of a graphics processing unit (GPU) may receive a texture data. The texture unit may receive the texture data from the memory. The texture unit may also multiply, by a multiplier circuit of the texture unit, the texture data by at least one constant, where the constant is not associated with a filtering operation, and where the texture data comprises at least one texel. The texture unit may also output, by the texture unit, a result of multiplying the texture data by the at least one constant.

4.

发明授权
Utilizing pipeline registers as intermediate storage 有权

公开(公告)号：US09747104B2

公开(公告)日：2017-08-29

申请号：US14275047

申请日：2014-05-12

Applicant: QUALCOMM Incorporated

Inventor： Lin Chen , Yun Du , Sumesh Udayakumaran , Chihong Zhang , Andrew Evan Gruber

IPC: G06F9/30 , G06F9/38

CPC classification number: G06F9/3012 , G06F9/30032 , G06F9/3017 , G06F9/3869 , G06F9/3875

Abstract: In one example, a method includes responsive to receiving, by a processing unit, one or more instructions requesting that a first value be moved from a first general purpose register (GPR) to a third GPR and that a second value be moved from a second GPR to a fourth GPR, copying, by an initial logic unit and during a first clock cycle, the first value to an initial pipeline register, copying, by the initial logic and during a second clock cycle, the second value to the initial pipeline register, copying, by a final logic unit and during a third clock cycle, the first value from a final pipeline register to the third GPR, and copying, by the final logic unit and during a fourth clock cycle, the second value from the final pipeline register to the fourth GPR.

5.

发明授权
GPU divergence barrier 有权

公开(公告)号：US09652284B2

公开(公告)日：2017-05-16

申请号：US14043562

申请日：2013-10-01

Applicant: QUALCOMM Incorporated

Inventor： Chunhui Mei , Alexei Vladimirovich Bourd , Lin Chen

IPC: G06F9/46 , G06F9/48 , G06T1/20 , G06F9/52 , G06F9/38

CPC classification number: G06F9/4843 , G06F9/3887 , G06F9/522 , G06T1/20

Abstract: A device includes a memory, and at least one programmable processor configured to determine, for each warp of a plurality of warps, whether a Boolean expression is true for a corresponding thread of each warp, pause execution of each warp having a corresponding thread for which the expression is true, determine a number of active threads for each of the plurality of warps for which the expression is true, sort the plurality of warps for which the expression is true based on the number of active threads in each of the plurality of warps, swap thread data of an active thread of a first warp of the plurality of warps with thread data of an inactive thread of a second warp of the plurality of warps, and resume execution of the at least one of the plurality of warps for which the expression is true.

6.

发明授权
Load scheme for shared register in GPU 有权

公开(公告)号：US09633411B2

公开(公告)日：2017-04-25

申请号：US14316391

申请日：2014-06-26

Applicant: QUALCOMM Incorporated

Inventor： Yun Du , Andrew Evan Gruber , Lin Chen , Guofang Jiao , Chun Yu

IPC: G06T1/60 , G06T15/80 , G09G5/36

CPC classification number: G06T1/60 , G06T15/80 , G09G5/363 , G09G2352/00 , G09G2360/06

Abstract: Techniques are described for determining whether data of a variable for each of a plurality of graphics items is same. If determined that the data is the same, the techniques store the data in a storage location of a specialized shared general purpose register that is associated with the variable.

7.

发明申请
SKIPPING OF DATA STORAGE 有权
Title translation: 数据存储的移动

公开(公告)号：US20160054998A1

公开(公告)日：2016-02-25

申请号：US14462932

申请日：2014-08-19

Applicant: QUALCOMM Incorporated

Inventor： Yun Du , Lin Chen , Andrew Evan Gruber , Chihong Zhang , Chun Yu

IPC: G06F9/30 , G06T1/20

CPC classification number: G06F9/30098 , G06F8/441 , G06F9/30145 , G06F9/30181 , G06F9/3828 , G06F9/3859 , G06T1/20 , G06T2200/28

Abstract: Techniques are described in which an indication is included to indicate a last use of an intermediate value generated as part of determining a final value is not be stored in a general purpose register (GPR). A processing unit avoids storing the intermediate value in the GPR based on the indication because the intermediate value is no longer needed for determining the final value.

Abstract translation: 描述了其中包括指示以指示作为确定最终值的一部分而生成的中间值的最后使用的指示不被存储在通用寄存器（GPR）中的技术。处理单元基于指示，避免将中间值存储在GPR中，因为不再需要中间值来确定最终值。

8.

发明授权
Per-instance preamble for graphics processing 有权

公开(公告)号：US09799094B1

公开(公告)日：2017-10-24

申请号：US15162198

申请日：2016-05-23

Applicant: QUALCOMM Incorporated

Inventor： Lin Chen , Richard Hammerstone , Jiaji Liu , Chihong Zhang , Andrew Evan Gruber , Yun Du

IPC: G06T1/60 , G06T1/20 , G06T15/00 , G06T9/00

CPC classification number: G06T1/60 , G06F8/4442 , G06F9/383 , G06F12/06 , G06F12/0862 , G06T1/20 , G06T9/00 , G06T15/005

Abstract: A method for processing data in a graphics processing unit (GPU) including receiving an instance identifier for an instance and a shader program comprising a preamble code block and a main shader code block, assigning, the instance identifier to a general purpose register at wave creation, allocating address space within the constant memory for instance uniforms, and determining the preamble code block has not been executed and the wave is a first wave of the instance to be executed, based on determining the preamble code block has not been executed and the wave is the first wave to be executed, executing the preamble code block to store the plurality of instance uniforms in the constant memory and based, at least in part, on executing the preamble code block, executing the wave of the plurality of waves using at least one of the plurality of instance constants stored inconstant memory.

9.

发明申请
EMULATION OF FUSED MULTIPLY-ADD OPERATIONS 有权
Title translation: 融合多媒体操作的仿真

公开(公告)号：US20160048374A1

公开(公告)日：2016-02-18

申请号：US14461890

申请日：2014-08-18

Applicant: QUALCOMM Incorporated

Inventor： Pramod Vasant Argade , Andrew Evan Gruber , Chiente Ho , Stewart Griffin Hall , Lin Chen

IPC: G06F7/57 , G06F5/01

CPC classification number: G06F7/5443 , G06F5/01 , G06F7/483 , G06F7/57

Abstract: At least one processor may emulate a fused multiply-add operation for a first operand, a second operand, and a third operand. The at least one processor may determine an intermediate value based at least in part on multiplying the first operand with the second operand, determine at least one of an upper intermediate value or a lower intermediate value, wherein determining the upper intermediate value comprises rounding, towards zero, the intermediate value by a specified number of bits, and wherein determining the lower intermediate value comprises subtracting the intermediate value by the upper intermediate value, determine an upper value and a lower value based at least in part on adding or subtracting the third operand to one of the upper intermediate value or the lower intermediate value, and determine an emulated fused multiply-add result by adding the upper value and the lower value.

Abstract translation: 至少一个处理器可以模拟第一操作数，第二操作数和第三操作数的融合乘法运算。至少一个处理器可以至少部分地基于将第一操作数与第二操作数相乘来确定中间值，确定上中间值或下中间值中的至少一个，其中确定上中间值包括四舍五入零，中间值乘以指定位数，并且其中确定较低中间值包括通过上述中间值减去中间值，至少部分地基于加上或减去第三操作数来确定上限值和较低值到较高中间值或较低中间值之一，并通过加上上限值和下限值来确定仿真融合乘法运算结果。

10.

发明申请
VECTOR SCALING INSTRUCTIONS FOR USE IN AN ARITHMETIC LOGIC UNIT 审中-公开
Title translation: 在算术逻辑单元中使用的矢量放大指令

公开(公告)号：US20160019027A1

公开(公告)日：2016-01-21

申请号：US14331991

申请日：2014-07-15

Applicant: QUALCOMM Incorporated

Inventor： Lin Chen , Andrew Evan Gruber , Guofang Jiao , Chiente Ho , Pramod Vasant Argade

IPC: G06F5/01

CPC classification number: G06F7/49936 , G06F7/552 , G06F2207/5525 , G09C1/00 , H04L2209/12

Abstract: At least one processor may receive components of a vector, wherein each of the components of the vector comprises at least an exponent. The at least one processor may further determine a maximum exponent out of respective exponents of the components of the vector, and may determine a scaling value based at least in part on the maximum exponent. An arithmetic logic unit of the at least one processor may scale the vector, by subtracting the scaling value from each of the respective exponents of the components of the vector.

Abstract translation: 至少一个处理器可以接收向量的分量，其中矢量的每个分量包括至少一个指数。所述至少一个处理器可以进一步确定向量的分量的相应指数中的最大指数，并且可以至少部分地基于最大指数来确定缩放值。所述至少一个处理器的算术逻辑单元可以通过从所述矢量的各个成分的各指数中减去所述缩放值来缩放所述向量。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification