-
公开(公告)号:US20170277567A1
公开(公告)日:2017-09-28
申请号:US15285472
申请日:2016-10-04
Applicant: MediaTek Inc.
Inventor: Shou-Jen Lai , Pei-Kuei Tsung , Po-Chun Fan , Sung-Fang Tsai
CPC classification number: G06F9/3887 , G06F9/30036 , G06F9/3012 , G06F9/3824 , G06F9/3851
Abstract: A computing device performs parallel computations using a set of thread processing units and a memory shuffle engine. The memory shuffle engine includes a register array to store an array of data elements retrieved from a memory buffer, and an array of input selectors. According to a first control signal, each input selector transfers at least a first data element from a corresponding subset of the register array, which is coupled to the input selector via input lines, to one or more corresponding thread processing units. According to a second control signal, each input selector transfers at least a second data element from another subset of the register array, which is coupled to another input selector via other input lines, to the one or more corresponding thread processing units.
-
公开(公告)号:US20160267621A1
公开(公告)日:2016-09-15
申请号:US14641449
申请日:2015-03-09
Applicant: MEDIATEK INC.
Inventor: Ming-Hao Liao , Shou-Jen Lai , Chia-Hsien Chou , Po-Chun Fan , Yan-Hong Lu , Chih-Chung Cheng , Hung-Yau Lin
IPC: G06T1/20
CPC classification number: G06T1/20
Abstract: A graphic processing system and a method of graphic processing are provided. The graphic processing system has a collector, a plurality of slots, a scheduler, an arbiter and at least an arithmetic logic unit (ALU). The collector is configured to group a plurality of workitems into elementary wavefronts. Each of the elementary wavefronts comprises workitems configured to execute the same kernel code. The scheduler is configured to allocate the elementary wavefronts to the slots. Two or more of the elementary wavefronts exist at one slot to form one of a plurality of macro wavefronts. The arbiter is configured to select one of the macro wavefronts. The ALU is configured to execute workitems of at least an elementary wavefront of the selected macro wavefront and output results of execution of the workitems.
Abstract translation: 提供图形处理系统和图形处理方法。 图形处理系统具有收集器,多个时隙,调度器,仲裁器和至少一个算术逻辑单元(ALU)。 收集器被配置为将多个工作项组合成基本波阵面。 每个基本波前都包括配置为执行相同内核代码的工作项。 调度器被配置为将基本波前分配给时隙。 在一个时隙上存在两个以上的基本波前,形成多个宏波前的一个。 仲裁器被配置为选择一个宏波阵面。 ALU被配置为执行至少所选宏波阵面的基本波阵面的工作项,并输出工作项目的执行结果。
-