-
公开(公告)号:US11204765B1
公开(公告)日:2021-12-21
申请号:US17003600
申请日:2020-08-26
Applicant: QUALCOMM Incorporated
Inventor: Yun Du , Fei Wei , Gang Zhong , Minjie Huang , Jian Jiang , Zilin Ying , Baoguang Yang , Yang Xia , Jing Han , Liangxiao Hu , Chihong Zhang , Chun Yu , Andrew Evan Gruber , Eric Demers
Abstract: A graphics processing unit (GPU) utilizes block general purpose registers (bGPRs) to load multiple waves of samples for an instruction group into a processing pipeline and receive processed samples from the pipeline. The GPU acquires a credit for the bGPR for execution of the instruction group for a first wave using a persistent GPR and the bGPR. The GPU refunds the credit upon loading the first wave into the pipeline. The GPU executes a subsequent wave for the instruction group to load samples to the pipeline when at least one credit is available and the pipeline is processing the first wave. The GPU stores an indication of each wave that has been loaded into the pipeline in a queue. The GPU returns samples for a next wave in the queue from the pipeline to the bGPR for further processing when the physical slot of the bGPR is available.
-
公开(公告)号:US11132760B2
公开(公告)日:2021-09-28
申请号:US16714052
申请日:2019-12-13
Applicant: QUALCOMM Incorporated
Inventor: Yun Du , Andrew Evan Gruber , Chihong Zhang , Gang Zhong , Jian Jiang , Fei Wei , Minjie Huang , Zilin Ying , Yang Xia , Jing Han , Chun Yu , Eric Demers
Abstract: Methods, systems, and devices for graphic processing are described. The methods, systems, and devices may include or be associated with identifying a graphics instruction, determining that the graphics instruction is alias enabled for the device, partitioning an alias lookup table into one or more slots, allocating a slot of the alias lookup table based on the partitioning and determining that the graphics instruction is alias enabled, generating an alias instruction based on allocating the slot of the alias lookup table and determining that the graphics instruction is alias enabled, and processing the alias instruction.
-
公开(公告)号:US11094103B2
公开(公告)日:2021-08-17
申请号:US16364829
申请日:2019-03-26
Applicant: QUALCOMM Incorporated
Inventor: Yun Du , Andrew Evan Gruber , Chun Yu , Chihong Zhang , Hongjiang Shang , Zilin Ying , Fei Wei
Abstract: Example techniques are described for generating graphics content by obtaining texture operation instructions corresponding to a texture operation, in response to determining at least one of insufficient general purpose register space is available for the texture operation or insufficient wave slots are available for the texture operation, generating an indication that the texture operation corresponds to a deferred wave, executing the texture operation, sending, to a texture processor, initial texture sample instructions corresponding to the texture operation that was executed, and receiving texture mapped data corresponding to the initial texture sample instructions.
-
公开(公告)号:US09665370B2
公开(公告)日:2017-05-30
申请号:US14462932
申请日:2014-08-19
Applicant: QUALCOMM Incorporated
Inventor: Yun Du , Lin Chen , Andrew Evan Gruber , Chihong Zhang , Chun Yu
CPC classification number: G06F9/30098 , G06F8/441 , G06F9/30145 , G06F9/30181 , G06F9/3828 , G06F9/3859 , G06T1/20 , G06T2200/28
Abstract: Techniques are described in which an indication is included to indicate a last use of an intermediate value generated as part of determining a final value is not be stored in a general purpose register (GPR). A processing unit avoids storing the intermediate value in the GPR based on the indication because the intermediate value is no longer needed for determining the final value.
-
公开(公告)号:US09799094B1
公开(公告)日:2017-10-24
申请号:US15162198
申请日:2016-05-23
Applicant: QUALCOMM Incorporated
Inventor: Lin Chen , Richard Hammerstone , Jiaji Liu , Chihong Zhang , Andrew Evan Gruber , Yun Du
CPC classification number: G06T1/60 , G06F8/4442 , G06F9/383 , G06F12/06 , G06F12/0862 , G06T1/20 , G06T9/00 , G06T15/005
Abstract: A method for processing data in a graphics processing unit (GPU) including receiving an instance identifier for an instance and a shader program comprising a preamble code block and a main shader code block, assigning, the instance identifier to a general purpose register at wave creation, allocating address space within the constant memory for instance uniforms, and determining the preamble code block has not been executed and the wave is a first wave of the instance to be executed, based on determining the preamble code block has not been executed and the wave is the first wave to be executed, executing the preamble code block to store the plurality of instance uniforms in the constant memory and based, at least in part, on executing the preamble code block, executing the wave of the plurality of waves using at least one of the plurality of instance constants stored inconstant memory.
-
公开(公告)号:US09626234B2
公开(公告)日:2017-04-18
申请号:US14570974
申请日:2014-12-15
Applicant: QUALCOMM Incorporated
Inventor: Alexei Vladimirovich Bourd , Colin Christopher Sharp , David Rigel Garcia Garcia , Chihong Zhang
Abstract: This disclosure describes communication techniques that may be used within a multiple-processor computing platform. The techniques may, in some examples, provide software interfaces that may be used to support message passing within a multiple-processor computing platform that initiates tasks using command queues. The techniques may, in additional examples, provide software interfaces that may be used for shared memory inter-processor communication within a multiple-processor computing platform. In further examples, the techniques may provide a graphics processing unit (GPU) that includes hardware for supporting message passing and/or shared memory communication between the GPU and a host CPU.
-
公开(公告)号:US20150324196A1
公开(公告)日:2015-11-12
申请号:US14275047
申请日:2014-05-12
Applicant: QUALCOMM Incorporated
Inventor: Lin Chen , Yun Du , Sumesh Udayakumaran , Chihong Zhang , Andrew Evan Gruber
CPC classification number: G06F9/3012 , G06F9/30032 , G06F9/3017 , G06F9/3869 , G06F9/3875
Abstract: In one example, a method includes responsive to receiving, by a processing unit, one or more instructions requesting that a first value be moved from a first general purpose register (GPR) to a third GPR and that a second value be moved from a second GPR to a fourth GPR, copying, by an initial logic unit and during a first clock cycle, the first value to an initial pipeline register, copying, by the initial logic and during a second clock cycle, the second value to the initial pipeline register, copying, by a final logic unit and during a third clock cycle, the first value from a final pipeline register to the third GPR, and copying, by the final logic unit and during a fourth clock cycle, the second value from the final pipeline register to the fourth GPR.
Abstract translation: 在一个示例中,一种方法包括响应于由处理单元接收一个或多个请求将第一值从第一通用寄存器(GPR)移动到第三GPR的指令,并且第二值从第二个 GPR到第四个GPR,由初始逻辑单元和在第一时钟周期期间将第一个值复制到初始流水线寄存器,通过初始逻辑复制第二个时钟周期,将第二个值复制到初始流水线寄存器 ,由最终逻辑单元和在第三时钟周期期间将第一值从最终流水线寄存器复制到第三GPR,并且由最终逻辑单元复制并在第四时钟周期期间从最终管道复制第二值 注册到第四个GPR。
-
-
-
-
-
-