专利检索 ap:("Adam James Muff" OR "Matthew Ray Tubbs") AND inv:"Matthew Ray Tubbs" 第 2 页

11.

发明授权
Execution unit with inline pseudorandom number generator 失效
标题翻译：具有内联伪随机数发生器的执行单元

公开(公告)号：US08255443B2

公开(公告)日：2012-08-28

申请号：US12132115

申请日：2008-06-03

申请人： Adam James Muff , Matthew Ray Tubbs

发明人： Adam James Muff , Matthew Ray Tubbs

IPC分类号： G06F7/58

CPC分类号： G06F9/3851 , G06F9/30014 , G06F9/30181

摘要： A circuit arrangement and method couple a hardware-based pseudorandom number generator (PRNG) to an execution unit in such a manner that pseudorandom numbers generated by the PRNG may be selectively output to the execution unit for use as an operand during the execution of instructions by the execution unit. A PRNG may be coupled to an input of an operand multiplexer that outputs to an operand input of an execution unit so that operands provided by instructions supplied to the execution unit are selectively overridden with pseudorandom numbers generated by the PRNG. Furthermore, overridden operands provided by instructions supplied to the execution unit may be used as seed values for the PRNG. In many instances, an instruction executed by an execution unit may be able to perform an arithmetic operation using both an operand specified by the instruction and a pseudorandom number generated by the PRNG during the execution of the instruction, so that the generation of the pseudorandom number and the performance of the arithmetic operation occur during a single pass of an execution unit.

摘要翻译： 电路布置和方法将基于硬件的伪随机数生成器（PRNG）耦合到执行单元，使得由PRNG生成的伪随机数可以被选择性地输出到执行单元，以在执行指令期间用作操作数，执行单元。 PRNG可以耦合到操作数多路复用器的输入，该输入输出到执行单元的操作数输入，使得由提供给执行单元的指令提供的操作数被PRNG生成的伪随机数选择性地覆盖。此外，提供给执行单元的指令提供的覆盖操作数可以用作PRNG的种子值。在许多情况下，执行单元执行的指令可以在执行指令期间使用由指令指定的操作数和由PRNG生成的伪随机数来执行算术运算，从而生成伪随机数并且算术运算的执行在执行单元的单次通过期间发生。

12.

发明授权
Designating operands with fewer bits in instruction code by indexing into destination register history table for each thread 失效
标题翻译：通过索引到每个线程的目标寄存器历史记录表来指定指令代码中较少位的操作数

公开(公告)号：US07814299B2

公开(公告)日：2010-10-12

申请号：US12274560

申请日：2008-11-20

申请人： Mark Joseph Hickey , Adam James Muff , Matthew Ray Tubbs , Charles David Wait

发明人： Mark Joseph Hickey , Adam James Muff , Matthew Ray Tubbs , Charles David Wait

IPC分类号： G06F9/30

CPC分类号： G06F9/30098 , G06F9/3016 , G06F9/3832

摘要： A circuit arrangement and method support instruction target history based register address indexing, whereby register addresses to be used by an instruction are decoded using a target history table of previous target register addresses, and an index into the target history table supplied by an index value in the instruction. An instruction may include at least one index value that identifies a previously used register address. During execution of the instruction, the index is retrieved from the instruction, and then a register address is retrieved from the target history table using the index.

摘要翻译： 一种电路布置和方法支持指令目标历史的寄存器地址索引，由此由指令使用的寄存器地址使用先前目标寄存器地址的目标历史表和由目标历史表中的索引值提供的索引进行解码指示。指令可以包括标识先前使用的寄存器地址的至少一个索引值。在执行指令期间，从指令中检索索引，然后使用索引从目标历史表中检索一个寄存器地址。

13.

发明申请
Floating Point Execution Unit for Calculating a One Minus Dot Product Value in a Single Pass 失效
标题翻译：浮点执行单元，用于计算单次通过中的一个减号点产品值

公开(公告)号：US20100031009A1

公开(公告)日：2010-02-04

申请号：US12184324

申请日：2008-08-01

申请人： Adam James Muff , Matthew Ray Tubbs

发明人： Adam James Muff , Matthew Ray Tubbs

IPC分类号： G06F9/302

CPC分类号： G06F9/30014 , G06F9/3802 , G06F9/3851 , G06F9/3877 , G06F9/3885 , G06T15/50 , G06T2200/28

摘要： A floating point execution unit calculates a one minus dot product value in a single pass. As such, the dependency that otherwise would be required to perform the calculations is eliminated, resulting in a substantially faster performance of such calculations. The floating point execution unit may be used, for example, to accelerate pixel shading algorithms such as Fresnel and electron microscope effects.

摘要翻译： 浮点执行单元在单程中计算一个减去点积积值。因此，消除了否则将执行计算所需的相关性，导致这种计算的性能显着更快。例如，浮点执行单元可以用于加速诸如菲涅耳和电子显微镜效应的像素着色算法。

14.

发明申请
Method and Apparatus for an Area Efficient Transcendental Estimate Algorithm 失效
标题翻译：用于区域有效超验估计算法的方法和装置

公开(公告)号：US20090070398A1

公开(公告)日：2009-03-12

申请号：US11851658

申请日：2007-09-07

申请人： Eric Oliver Mejdrich , Adam James Muff , Matthew Ray Tubbs

发明人： Eric Oliver Mejdrich , Adam James Muff , Matthew Ray Tubbs

IPC分类号： G06F7/38

CPC分类号： G06F7/548

摘要： A method, computer-readable medium, and an apparatus for generating a transcendental value. The method includes receiving an input containing an input value and an opcode and determining whether the opcode corresponds to a trigonometric operation or a power-of-two operation. The method also includes calculating a fractional value and an integer value from the input value, generating the transcendental value based on the fractional value by adding at least a portion of the fractional value with at least one of a shifted fractional value produced by shifting the portion of the fractional value and a constant value, and providing the transcendental value in response to the request. In this fashion, the same circuit area may be used to carry out both trigonometric and power-of-two calculations, leading to greater circuit area savings and performance advantages while not sacrificing significant accuracy.

摘要翻译： 一种用于产生超验值的方法，计算机可读介质和装置。该方法包括接收包含输入值和操作码的输入，并确定操作码是否对应于三角运算或二进制运算。该方法还包括从输入值计算分数值和整数值，通过将分数值的至少一部分与通过移动部分产生的移位分数值中的至少一个相加而基于分数值生成超越值的分数值和恒定值，并且响应于该请求提供超验值。以这种方式，可以使用相同的电路面积来执行三角和二次幂计算，导致更大的电路面积节省和性能优点，而不牺牲显着的精度。

15.

发明申请
Operand Multiplexor Control Modifier Instruction in a Fine Grain Multithreaded Vector Microprocessor 失效
标题翻译：精细多线程向量微处理器中的操作数多路复用器控制修改器指令

公开(公告)号：US20080122854A1

公开(公告)日：2008-05-29

申请号：US11564072

申请日：2006-11-28

申请人： Eric Oliver Mejdrich , Adam James Muff , Matthew Ray Tubbs

发明人： Eric Oliver Mejdrich , Adam James Muff , Matthew Ray Tubbs

IPC分类号： G06T1/00

CPC分类号： G06T1/20

摘要： The present invention is generally related to the field of image processing, and more specifically to an instruction set for processing images. Vector processing may involve rearranging vector operands in one or more source registers prior to performing vector operations. Typically, rearranging of operands in source registers is done by issuing a plurality of permute instructions that require excessive usage of temporary registers. Furthermore, the permute instructions may cause dependencies between instructions executing in a pipeline, thereby adversely affecting performance. Embodiments of the invention provide a level of muxing between a register file and a vector unit that allow for rearrangement of vector operands in source registers prior to providing the operands to the vector unit, thereby obviating the need for permute instructions.

摘要翻译： 本发明通常涉及图像处理领域，更具体地涉及用于处理图像的指令集。矢量处理可以包括在执行向量操作之前在一个或多个源寄存器中重新排列向量操作数。通常，通过发出需要临时寄存器过度使用的多个置换指令来完成源寄存器中操作数的重新排列。此外，置换指令可能导致在流水线中执行的指令之间的相关性，从而不利地影响性能。本发明的实施例提供了一种在寄存器文件和向量单元之间的复用水平，其允许在将操作数提供给向量单元之前重新排列源寄存器中的向量操作数，从而避免了对置换指令的需要。

16.

发明授权
Method and apparatus for implementing a multiple operand vector floating point summation to scalar function 失效
标题翻译：用于实现多重操作数向量浮点求和的标量函数的方法和装置

公开(公告)号：US08239438B2

公开(公告)日：2012-08-07

申请号：US11840277

申请日：2007-08-17

申请人： Adam James Muff , Matthew Ray Tubbs

发明人： Adam James Muff , Matthew Ray Tubbs

IPC分类号： G06F7/38

CPC分类号： G06F7/5443 , G06F7/483 , G06F9/30014 , G06F9/30036 , G06F9/3893

摘要： Embodiments of the invention provide methods and apparatus for executing a multiple operand instruction. Executing the multiple operand instruction comprises computing an arithmetic result of a pair of operands in each processing lane of a vector unit. The arithmetic results generated in each processing lane of the vector unit may be transferred to a dot product unit. The dot product unit may compute an arithmetic result using the arithmetic result computed by each processing lane of the vector unit to generate an arithmetic result of more than two operands.

摘要翻译： 本发明的实施例提供了用于执行多操作数指令的方法和装置。执行多操作数指令包括计算向量单元的每个处理通道中的一对操作数的算术结果。在矢量单元的每个处理车道中产生的算术结果可以被转移到点积单位。点积单位可以使用由向量单位的每个处理车道计算的算术结果来计算算术结果，以生成超过两个操作数的算术结果。

17.

发明授权
Floating point execution unit for calculating a one minus dot product value in a single pass 失效
标题翻译：用于在单程中计算一个减去积积积的浮点执行单元

公开(公告)号：US08139061B2

公开(公告)日：2012-03-20

申请号：US12184324

申请日：2008-08-01

申请人： Adam James Muff , Matthew Ray Tubbs

发明人： Adam James Muff , Matthew Ray Tubbs

IPC分类号： G06T15/00

CPC分类号： G06F9/30014 , G06F9/3802 , G06F9/3851 , G06F9/3877 , G06F9/3885 , G06T15/50 , G06T2200/28

摘要： A floating point execution unit calculates a one minus dot product value in a single pass. As such, the dependency that otherwise would be required to perform the calculations is eliminated, resulting in a substantially faster performance of such calculations. The floating point execution unit may be used, for example, to accelerate pixel shading algorithms such as Fresnel and electron microscope effects.

摘要翻译： 浮点执行单元在单程中计算一个减去点积积值。因此，消除了否则将执行计算所需的相关性，导致这种计算的性能显着更快。例如，浮点执行单元可以用于加速诸如菲涅耳和电子显微镜效应的像素着色算法。

18.

发明授权
Early exit processing of iterative refinement algorithm using register dependency disable 失效
标题翻译：使用寄存器依赖关系禁用的迭代细化算法的早期退出处理

公开(公告)号：US07921278B2

公开(公告)日：2011-04-05

申请号：US12045313

申请日：2008-03-10

申请人： Adam James Muff , Matthew Ray Tubbs

发明人： Adam James Muff , Matthew Ray Tubbs

IPC分类号： G06F9/30 , G06F9/40 , G06F7/38 , G06F9/00 , G06F9/44

CPC分类号： G06F9/30014 , G06F7/4873 , G06F7/535 , G06F9/30065 , G06F9/3838 , G06F9/3851 , G06F9/3857 , G06F9/3859 , G06F2207/5355 , G06F2207/5356

摘要： An “early exit” of an iterative refinement algorithm is implemented by effectively disabling read after write dependency stalls of newer instructions, as well as disabling the register write enable of these instructions, for the remainder of the algorithm, in addition to disabling the register write enable of these instructions. By doing so, the latency of the algorithm is reduced and the performance is increased without the complexity and potential poor performance of compare and branch instructions that might otherwise be required.

摘要翻译： 迭代细化算法的“提前退出”除了禁用寄存器写入之外，还通过有效禁用更新指令的写依赖性停止之后的读取以及禁止这些指令的寄存器写使能，对于算法的其余部分启用这些指令。通过这样做，降低了算法的等待时间，并且性能得到提高，而没有另外需要的比较和分支指令的复杂性和潜在的差的性能。

19.

发明授权
Processing unit incorporating instruction-based persistent vector multiplexer control 失效
标题翻译：包含基于指令的持久矢量多路复用器控制的处理单元

公开(公告)号：US07904699B2

公开(公告)日：2011-03-08

申请号：US12045221

申请日：2008-03-10

申请人： Eric Oliver Mejdrich , Adam James Muff , Robert Allen Shearer , Matthew Ray Tubbs

发明人： Eric Oliver Mejdrich , Adam James Muff , Robert Allen Shearer , Matthew Ray Tubbs

IPC分类号： G06F9/00

CPC分类号： G06F9/30032 , G06F9/30036 , G06F9/30109 , G06F9/30123

摘要： Persistent vector multiplexer control is used in a vector-based execution unit to control the shuffling of words in operand vectors processed by the execution unit. In addition, a persistent swizzle instruction is defined in an instruction set for the vector-based execution unit and is used to cause state information to be persisted such that the operand vectors processed by subsequent vector instructions executed by the vector-based execution unit will be selectively shuffled using the persisted state information. As a result, when multiple vector instructions require a common custom word ordering for one or more operand vectors, a single persistent swizzle instruction may be used to select the desired custom word ordering for all of the vector instructions.

摘要翻译： 持续矢量复用器控制在基于矢量的执行单元中用于控制由执行单元处理的操作数向量中的字的混洗。此外，在用于基于向量的执行单元的指令集中定义持续转换指令，并且用于使状态信息被持久化，使得由基于向量的执行单元执行的后续向量指令处理的操作数向量将被使用持久状态信息选择性地进行混洗。因此，当多个向量指令需要一个或多个操作数向量的公共自定义单词排序时，可以使用单个持续旋转指令来选择所有向量指令的期望的定制单词排序。

20.

发明授权
Operand multiplexor control modifier instruction in a fine grain multithreaded vector microprocessor 失效
标题翻译：精细多线程向量微处理器中的操作数多路复用器控制修改器指令

公开(公告)号：US07868894B2

公开(公告)日：2011-01-11

申请号：US11564072

申请日：2006-11-28

申请人： Eric Oliver Mejdrich , Adam James Muff , Matthew Ray Tubbs

发明人： Eric Oliver Mejdrich , Adam James Muff , Matthew Ray Tubbs

IPC分类号： G06T1/00

CPC分类号： G06T1/20

摘要： The present invention is generally related to the field of image processing, and more specifically to an instruction set for processing images. Vector processing may involve rearranging vector operands in one or more source registers prior to performing vector operations. Typically, rearranging of operands in source registers is done by issuing a plurality of permute instructions that require excessive usage of temporary registers. Furthermore, the permute instructions may cause dependencies between instructions executing in a pipeline, thereby adversely affecting performance. Embodiments of the invention provide a level of muxing between a register file and a vector unit that allow for rearrangement of vector operands in source registers prior to providing the operands to the vector unit, thereby obviating the need for permute instructions.

摘要翻译： 本发明通常涉及图像处理领域，更具体地涉及用于处理图像的指令集。矢量处理可以包括在执行向量操作之前在一个或多个源寄存器中重新排列向量操作数。通常，通过发出需要临时寄存器过度使用的多个置换指令来完成源寄存器中操作数的重新排列。此外，置换指令可能导致在流水线中执行的指令之间的相关性，从而不利地影响性能。本发明的实施例提供了一种在寄存器文件和向量单元之间的复用水平，其允许在将操作数提供给向量单元之前重新排列源寄存器中的向量操作数，从而避免了对置换指令的需要。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类