专利检索 ap:("Intel Corporation") AND inv:"Bret L. Toll" 第 9 页

81.

发明授权
Instructions and logic to vectorize conditional loops 有权

公开(公告)号：US09696993B2

公开(公告)日：2017-07-04

申请号：US15344836

申请日：2016-11-07

申请人： Intel Corporation

发明人： Tal Uliel , Elmoustapha Ould-Ahmed-Vall , Bret L. Toll

IPC分类号： G06F15/76 , G06F9/30 , G06F15/80

CPC分类号： G06F9/30036 , G06F9/30018 , G06F9/30025 , G06F9/30043 , G06F9/3013 , G06F9/30145 , G06F9/3016 , G06F15/8007

摘要： A processing device to provide vectorization of conditional loops includes vector physical registers to store a source vector having a first plurality of n data fields, and a destination vector comprising a second plurality of data fields corresponding to the first plurality of data fields, wherein each of the second plurality of data fields corresponds to a mask value in a vector conditions mask. The processing device includes a decode stage to decode a first processor instruction specifying a vector expand operation and a data partition size, and execution units to set elements of the source vector to n count values, obtain a decisions vector, generate the vector conditions mask according to the decisions vector, and copy data from consecutive vector elements in the source vector, into unmasked vector elements of the destination vector, without copying data from the source vector into masked vector elements of the destination vector.

82.

发明授权
Vector friendly instruction format and execution thereof 有权
标题翻译：向量友好的指令格式及其执行

公开(公告)号：US09513917B2

公开(公告)日：2016-12-06

申请号：US14170397

申请日：2014-01-31

申请人： Intel Corporation

发明人： Robert C. Valentine , Jesus Corbal San Adrian , Roger Espasa Sans , Robert D. Cavin , Bret L. Toll , Santiago Galan Duran , Jeffrey G. Wiedemeier , Sridhar Samudrala , Milind Baburao Girkar , Edward Thomas Grochowski , Jonathan Cannon Hall , Dennis R. Bradford , Elmoustapha Ould-Ahmed-Vall , James C. Abel , Mark Charney , Seth Abraham , Suleyman Sair , Andrew Thomas Forsyth , Lisa Wu , Charles Yount

IPC分类号： G06F9/305 , G06F9/315 , G06F9/30 , G06F9/34

CPC分类号： G06F9/30181 , G06F9/3001 , G06F9/30014 , G06F9/30018 , G06F9/30032 , G06F9/30036 , G06F9/30047 , G06F9/30145 , G06F9/30149 , G06F9/30185 , G06F9/30192 , G06F9/34

摘要： A vector friendly instruction format and execution thereof. According to one embodiment of the invention, a processor is configured to execute an instruction set. The instruction set includes a vector friendly instruction format. The vector friendly instruction format has a plurality of fields including a base operation field, a modifier field, an augmentation operation field, and a data element width field, wherein the first instruction format supports different versions of base operations and different augmentation operations through placement of different values in the base operation field, the modifier field, the alpha field, the beta field, and the data element width field, and wherein only one of the different values may be placed in each of the base operation field, the modifier field, the alpha field, the beta field, and the data element width field on each occurrence of an instruction in the first instruction format in instruction streams.

摘要翻译： 一种向量友好的指令格式及其执行。根据本发明的一个实施例，处理器被配置为执行指令集。指令集包括向量友好指令格式。向量友好指令格式具有多个字段，包括基本操作字段，修改字段，增加操作字段和数据元素宽度字段，其中第一指令格式支持不同版本的基本操作和不同的扩充操作，基本操作字段，修饰符字段，α字段，β字段和数据元素宽度字段中的不同值，并且其中只有一个不同的值可以被放置在基本操作字段，修饰符字段，在指令流中的第一指令格式的指令的每次出现时的alpha字段，β字段和数据元素宽度字段。

83.

发明授权
Packed data operation mask comparison processors, methods, systems, and instructions 有权

公开(公告)号：US09442733B2

公开(公告)日：2016-09-13

申请号：US14966206

申请日：2015-12-11

申请人： Intel Corporation

发明人： Bret L. Toll , Robert Valentine , Jesus Corbal , Elmoustapha Ould-Ahmed-Vall , Mark J. Charney

IPC分类号： G06F9/30 , G06F9/38

CPC分类号： G06F9/30189 , G06F9/30018 , G06F9/30021 , G06F9/30029 , G06F9/30036 , G06F9/30094 , G06F9/30101 , G06F9/30145

摘要： Receive packed data operation mask comparison instruction indicating first packed data operation mask having first packed data operation mask bits and second packed data operation mask having second packed data operation mask bits. Each packed data operation mask bit of first mask corresponds to a packed data operation mask bit of second mask in corresponding position. Modify first flag to first value if bitwise AND of each packed data operation mask bit of first mask with each corresponding packed data operation mask bit of second mask is zero. Otherwise modify first flag to second value. Modify second flag to third value if bitwise AND of each packed data operation mask bit of first mask with bitwise NOT of each corresponding packed data operation mask bit of second mask is zero. Otherwise modify second flag to fourth value.

84.

发明申请
ROTATE INSTRUCTIONS THAT COMPLETE EXECUTION EITHER WITHOUT WRITING OR READING FLAGS 审中-公开
标题翻译：完整的执行操作，无需书写或阅读标志

公开(公告)号：US20150089201A1

公开(公告)日：2015-03-26

申请号：US14562310

申请日：2014-12-05

申请人： Intel Corporation

发明人： Vinodh Gopal , James D. Guilford , Gilbert M. Wolrich , Wajdi K. Feghali , Erdinc Ozturk , Martin G. Dixon , Sean Mirkes , Bret L. Toll , Maxim Loktyukhin , Mark C. Davis , Alexandre J. Farcy

IPC分类号： G06F9/30

CPC分类号： G06F9/30032 , G06F9/30094 , G06F9/30098

摘要： A method of one aspect may include receiving a rotate instruction. The rotate instruction may indicate a source operand and a rotate amount. A result may be stored in a destination operand indicated by the rotate instruction. The result may have the source operand rotated by the rotate amount. Execution of the rotate instruction may complete without reading a carry flag.

摘要翻译： 一个方面的方法可以包括接收旋转指令。旋转指令可以指示源操作数和旋转量。结果可以存储在由旋转指令指示的目标操作数中。结果可能使源操作数旋转了旋转量。旋转指令的执行可以在不读取进位标志的情况下完成。

85.

发明授权
Systems, methods, and apparatuses for matrix operations 有权

公开(公告)号：US12106100B2

公开(公告)日：2024-10-01

申请号：US16487421

申请日：2017-07-01

申请人： Intel Corporation

发明人： Robert Valentine , Mark J. Charney , Elmoustapha Ould-Ahmed-Vall , Dan Baum , Zeev Sperber , Jesus Corbal , Bret L. Toll , Raanan Sade , Igor Yanover , Yuri Gebil , Rinat Rappoport , Stanislav Shwartsman , Menachem Adelman , Simon Rubanovich

IPC分类号： G06F9/30 , G06F7/485 , G06F7/487 , G06F7/76 , G06F9/38 , G06F17/16

CPC分类号： G06F9/30036 , G06F7/485 , G06F7/4876 , G06F7/762 , G06F9/3001 , G06F9/30032 , G06F9/30043 , G06F9/30109 , G06F9/30112 , G06F9/30134 , G06F9/30145 , G06F9/30149 , G06F9/3016 , G06F9/30185 , G06F9/30196 , G06F9/3818 , G06F9/3836 , G06F17/16 , G06F2212/454

摘要： Embodiments detailed herein relate to matrix (tile) operations. For example, decode circuitry to decode an instruction having fields for an opcode and a memory address; and execution circuitry to execute the decoded instruction to set a tile configuration for the processor to utilize tiles in matrix operations based on a description retrieved from the memory address, wherein a tile a set of 2-dimensional registers are discussed.

86.

发明授权
Systems, methods, and apparatuses for tile load 有权

公开(公告)号：US11567765B2

公开(公告)日：2023-01-31

申请号：US16487766

申请日：2017-07-01

申请人： Intel Corporation

发明人： Robert Valentine , Menachem Adelman , Milind B. Girkar , Zeev Sperber , Mark J. Charney , Bret L. Toll , Rinat Rappoport , Jesus Corbal , Stanislav Shwartsman , Dan Baum , Igor Yanover , Alexander F. Heinecke , Barukh Ziv , Elmoustapha Ould-Ahmed-Vall , Yuri Gebil

IPC分类号： G06F9/38 , G06F9/30 , G06F7/485 , G06F7/487 , G06F17/16 , G06F7/76

摘要： Embodiments detailed herein relate to matrix operations. In particular, the loading of a matrix (tile) from memory. For example, support for a loading instruction is described in the form of decode circuitry to decode an instruction having fields for an opcode, a destination matrix operand identifier, and source memory information, and execution circuitry to execute the decoded instruction to load groups of strided data elements from memory into configured rows of the identified destination matrix operand to memory.

87.

发明授权
Instruction execution that broadcasts and masks data values at different levels of granularity 有权

公开(公告)号：US11301581B2

公开(公告)日：2022-04-12

申请号：US16730844

申请日：2019-12-30

申请人： Intel Corporation

发明人： Elmoustapha Ould-Ahmed-Vall , Robert Valentine , Jesus Corbal , Bret L. Toll , Mark J. Charney

IPC分类号： G06F21/62 , G06F16/27 , G06F21/70 , G06F9/30 , G06F9/38

摘要： An apparatus is described that includes an execution unit to execute a first instruction and a second instruction. The execution unit includes input register space to store a first data structure to be replicated when executing the first instruction and to store a second data structure to be replicated when executing the second instruction. The first and second data structures are both packed data structures. Data values of the first packed data structure are twice as large as data values of the second packed data structure. The execution unit also includes replication logic circuitry to replicate the first data structure when executing the first instruction to create a first replication data structure, and, to replicate the second data structure when executing the second data instruction to create a second replication data structure. The execution unit also includes masking logic circuitry to mask the first replication data structure at a first granularity and mask the second replication data structure at a second granularity. The second granularity is twice as fine as the first granularity.

88.

发明授权
Apparatuses and methods for a processor architecture 有权

公开(公告)号：US11294809B2

公开(公告)日：2022-04-05

申请号：US16115067

申请日：2018-08-28

申请人： Intel Corporation

发明人： Jason W. Brandt , Robert S. Chappell , Jesus Corbal , Edward T. Grochowski , Stephen H. Gunther , Buford M. Guy , Thomas R. Huff , Christopher J. Hughes , Elmoustapha Ould-Ahmed-Vall , Ronak Singhal , Seyed Yahya Sotoudeh , Bret L. Toll , Lihu Rappoport , David Papworth , James D. Allen

IPC分类号： G06F12/0831 , G06F12/1027 , G06F12/1009 , G06F9/30

摘要： Embodiments of an invention a processor architecture are disclosed. In an embodiment, a processor includes a decoder, an execution unit, a coherent cache, and an interconnect. The decoder is to decode an instruction to zero a cache line. The execution unit is to issue a write command to initiate a cache line sized write of zeros. The coherent cache is to receive the write command, to determine whether there is a hit in the coherent cache and whether a cache coherency protocol state of the hit cache line is a modified state or an exclusive state, to configure a cache line to indicate all zeros, and to issue the write command toward the interconnect. The interconnect is to, responsive to receipt of the write command, issue a snoop to each of a plurality of other coherent caches for which it must be determined if there is a hit.

89.

发明授权
Rotate instructions that complete execution either without writing or reading flags 有权

公开(公告)号：US11106461B2

公开(公告)日：2021-08-31

申请号：US15939693

申请日：2018-03-29

申请人： Intel Corporation

发明人： Vinodh Gopal , James D. Guilford , Gilbert M. Wolrich , Wajdi K. Feghali , Erdinc Ozturk , Martin G. Dixon , Sean P. Mirkes , Bret L. Toll , Maxim Loktyukhin , Mark C. Davis , Alexandre J. Farcy

IPC分类号： G06F9/30

摘要： A method of one aspect may include receiving a rotate instruction. The rotate instruction may indicate a source operand and a rotate amount. A result may be stored in a destination operand indicated by the rotate instruction. The result may have the source operand rotated by the rotate amount. Execution of the rotate instruction may complete without reading a carry flag.

90.

发明授权
Packed data element predication processors, methods, systems, and instructions 有权

公开(公告)号：US10963257B2

公开(公告)日：2021-03-30

申请号：US16586977

申请日：2019-09-28

申请人： Intel Corporation

发明人： Bret L. Toll , Buford M. Guy , Ronak Singhal , Mishali Naik

IPC分类号： G06F9/30

摘要： A processor includes a first mode where the processor is not to use packed data operation masking, and a second mode where the processor is to use packed data operation masking. A decode unit to decode an unmasked packed data instruction for a given packed data operation in the first mode, and to decode a masked packed data instruction for a masked version of the given packed data operation in the second mode. The instructions have a same instruction length. The masked instruction has bit(s) to specify a mask. Execution unit(s) are coupled with the decode unit. The execution unit(s), in response to the decode unit decoding the unmasked instruction in the first mode, to perform the given packed data operation. The execution unit(s), in response to the decode unit decoding the masked instruction in the second mode, to perform the masked version of the given packed data operation.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类