专利检索 ap:("Elmoustapha Ould-Ahmed-Vall" OR "Suleyman Sair" OR "Joonmoo Huh") AND inv:"Suleyman Sair" 第 1 页

1.

发明授权
Systems, apparatuses, and method for strided access 有权

公开(公告)号：US09946541B2

公开(公告)日：2018-04-17

申请号：US14975612

申请日：2015-12-18

申请人： Elmoustapha Ould-Ahmed-Vall , Suleyman Sair , Joonmoo Huh

发明人： Elmoustapha Ould-Ahmed-Vall , Suleyman Sair , Joonmoo Huh

IPC分类号： G06F9/30

CPC分类号： G06F9/30036 , G06F9/30014 , G06F9/30018 , G06F9/30032 , G06F9/30043 , G06F9/30181 , G06F9/30192 , G06F9/3455

摘要： Systems, methods, and apparatuses for strided access are described. In some embodiments, a plurality of registers are loaded with data from an array of structures. Then data elements that that are not needed in a permute operation are overwritten with index values with a write mask. The register now contains a mix of data and index values. When this same write mask is passed to the permute instruction which overwrites the index register as destination, the data values are preserved and index values are overwritten with data coming from the other two source registers as controlled by the index values.

2.

发明申请
Systems, Apparatuses, and Method for Strided Access 有权

公开(公告)号：US20170177356A1

公开(公告)日：2017-06-22

申请号：US14975612

申请日：2015-12-18

申请人： Elmoustapha Ould-Ahmed-Vall , Suleyman Sair , Joonmoo Huh

发明人： Elmoustapha Ould-Ahmed-Vall , Suleyman Sair , Joonmoo Huh

IPC分类号： G06F9/30

CPC分类号： G06F9/30036 , G06F9/30014 , G06F9/30018 , G06F9/30032 , G06F9/30043 , G06F9/30181 , G06F9/30192 , G06F9/3455

摘要： Systems, methods, and apparatuses for strided access are described. In some embodiments, a plurality of registers are loaded with data from an array of structures. Then data elements that that are not needed in a permute operation are overwritten with index values with a write mask. The register now contains a mix of data and index values. When this same write mask is passed to the permute instruction which overwrites the index register as destination, the data values are preserved and index values are overwritten with data coming from the other two source registers as controlled by the index values.

3.

发明授权
Instruction and logic to provide vector horizontal majority voting functionality 有权
标题翻译：提供向量横向多数投票功能的指令和逻辑

公开(公告)号：US09448794B2

公开(公告)日：2016-09-20

申请号：US13977735

申请日：2011-11-30

申请人： Elmoustapha Ould-Ahmed-Vall , Kshitij A. Doshi , Suleyman Sair , Charles R. Yount

发明人： Elmoustapha Ould-Ahmed-Vall , Kshitij A. Doshi , Suleyman Sair , Charles R. Yount

IPC分类号： G06F11/00 , G06F9/30 , G06F11/14 , G06F11/10

CPC分类号： G06F9/30036 , G06F7/22 , G06F7/544 , G06F9/30018 , G06F9/30021 , G06F9/30101 , G06F9/30145 , G06F9/3016 , G06F11/1048 , G06F11/1479

摘要： Instructions and logic provide vector horizontal majority voting functionality. Some embodiments, responsive to an instruction specifying: a destination operand, a size of the vector elements, a source operand, and a mask corresponding to a portion of the vector element data fields in the source operand; read a number of values from data fields of the specified size in the source operand, corresponding to the mask specified by the instruction and store a result value to that number of corresponding data fields in the destination operand, the result value computed from the majority of values read from the number of data fields of the source operand.

摘要翻译： 指令和逻辑提供向量横向多数投票功能。一些实施例，响应于指定目的地操作数，向量元素的大小，源操作数和对应于源操作数中的向量元素数据字段的一部分的掩码的指令; 从源操作数中的指定大小的数据字段读取一些数值，对应于指令指定的掩码，并将结果值存储到目标操作数中的相应数据字段数，从大多数从源操作数的数据字段数读取的值。

4.

发明申请
INSTRUCTION AND LOGIC TO PROVIDE VECTOR LOAD-OP/STORE-OP WITH STRIDE FUNCTIONALITY 有权
标题翻译：指令和逻辑提供向量负载/存储 - 具有强大的功能

公开(公告)号：US20140195778A1

公开(公告)日：2014-07-10

申请号：US13977728

申请日：2011-09-26

申请人： Elmoustapha Ould-Ahmed-Vall , Kshitij A. Doshi , Suleyman Sair , Charles R. Yount

发明人： Elmoustapha Ould-Ahmed-Vall , Kshitij A. Doshi , Suleyman Sair , Charles R. Yount

IPC分类号： G06F9/38 , G06F9/30

CPC分类号： G06F9/30043 , G06F9/30018 , G06F9/30036 , G06F9/3004 , G06F9/30101 , G06F9/3016 , G06F9/30185 , G06F9/3455 , G06F9/3808 , G06F9/3877 , G06F9/3887 , G06F15/8061

摘要： Instructions and logic provide vector load-op and/or store-op with stride functionality. Some embodiments, responsive to an instruction specifying: a set of loads, a second operation, destination register, operand register, memory address, and stride length; execution units read values in a mask register, wherein fields in the mask register correspond to stride-length multiples from the memory address to data elements in memory. A first mask value indicates the element has not been loaded from memory and a second value indicates that the element does not need to be, or has already been loaded. For each having the first value, the data element is loaded from memory into the corresponding destination register location, and the corresponding value in the mask register is changed to the second value. Then the second operation is performed using corresponding data in the destination and operand registers to generate results. The instruction may be restarted after faults.

摘要翻译： 指令和逻辑提供矢量加载操作和/或存储操作与步幅功能。一些实施例，响应于指令：一组负载，第二操作，目的地寄存器，操作数寄存器，存储器地址和步幅长度; 执行单元读取掩码寄存器中的值，其中掩码寄存器中的字段对应于从存储器地址到存储器中的数据元素的跨距长度倍数。第一个掩码值表示元素尚未从内存中加载，第二个值表示元素不需要或已经被加载。对于具有第一个值的每一个，数据元素从存储器加载到相应的目标寄存器位置，并且掩码寄存器中的对应值被改变为第二值。然后使用目的地和操作数寄存器中的相应数据执行第二个操作，以生成结果。指令可能在故障后重新启动。

5.

发明申请
Systems, Methods, and Apparatuses for Fault Tolerance and Detection 审中-公开

公开(公告)号：US20170185465A1

公开(公告)日：2017-06-29

申请号：US14983026

申请日：2015-12-29

申请人： Elmoustapha Ould-Ahmed-Vall , Suleyman Sair , Kshitij A. Doshi , Charles R. Yount

发明人： Elmoustapha Ould-Ahmed-Vall , Suleyman Sair , Kshitij A. Doshi , Charles R. Yount

IPC分类号： G06F11/07 , G06F9/30

CPC分类号： G06F11/079 , G06F9/3001 , G06F9/30036 , G06F9/30189 , G06F9/30196 , G06F9/3861 , G06F9/3889 , G06F9/455 , G06F11/0745 , G06F11/0751 , G06F11/0772 , G06F11/1608 , G06F11/1629 , G06F11/1641 , G06F15/8007

摘要： Systems, methods, and apparatuses for fault tolerance and detection are described. For example, an apparatus including circuitry to replicate input sources of an instruction; arithmetic logic unit (ALU) circuitry to execute the instruction with replicated input sources using single instruction, multiple data (SIMD) hardware to produce a packed data result; and comparison circuitry coupled to the ALU circuitry to evaluate the packed data result and output a singular data result into a destination of the instruction is described.

6.

发明授权
Instruction and logic to provide vector loads with strides and masking functionality 有权

公开(公告)号：US09672036B2

公开(公告)日：2017-06-06

申请号：US13977730

申请日：2011-09-26

申请人： Elmoustapha Ould-Ahmed-Vall , Kshitij A. Doshi , Suleyman Sair , Charles R. Yount

发明人： Elmoustapha Ould-Ahmed-Vall , Kshitij A. Doshi , Suleyman Sair , Charles R. Yount

IPC分类号： G06F9/312 , G06F9/345 , G06F9/30

CPC分类号： G06F9/30043 , G06F9/30018 , G06F9/30036 , G06F9/30101 , G06F9/3455

摘要： Instructions and logic provide vector loads and/or stores with stride and mask functionality. Some embodiments, responsive to an instruction specifying: a set of loads, destination register, mask register, memory address, and stride length; execution units read values in the mask register, wherein fields in the mask register correspond to stride-length multiples from the memory address to data elements in memory. A first mask value indicates the element has not been loaded from memory and a second value indicates that the element does not need to be, or has already been loaded. For each having the first value, the corresponding multiple of said stride length is generated according to the data field's position in the mask register to load the data element from memory into the corresponding destination register location, and the corresponding value in the mask register is changed to the second value. These instructions can restart after faults.

7.

发明授权
Apparatus and method of mask permute instructions 有权

公开(公告)号：US09632980B2

公开(公告)日：2017-04-25

申请号：US13976435

申请日：2011-12-23

申请人： Elmoustapha Ould-Ahmed-Vall , Robert Valentine , Jesus Corbal , Suleyman Sair

发明人： Elmoustapha Ould-Ahmed-Vall , Robert Valentine , Jesus Corbal , Suleyman Sair

IPC分类号： G06F9/30 , G06F15/80

CPC分类号： G06F9/30032 , G06F9/30036 , G06F9/30145 , G06F15/8092

摘要： An apparatus is described having instruction execution logic circuitry. The instruction execution logic circuitry has input vector element routing circuitry to perform the following for each of three different instructions: for each of a plurality of output vector element locations, route into an output vector element location an input vector element from one of a plurality of input vector element locations that are available to source the output vector element. The output vector element and each of the input vector element locations are one of three available bit widths for the three different instructions. The apparatus further includes masking layer circuitry coupled to the input vector element routing circuitry to mask a data structure created by the input vector routing element circuitry. The masking layer circuitry is designed to mask at three different levels of granularity that correspond to the three available bit widths.

8.

发明授权
Vector frequency compress instruction 有权
标题翻译：矢量频率压缩指令

公开(公告)号：US09459866B2

公开(公告)日：2016-10-04

申请号：US13993058

申请日：2011-12-30

申请人： Elmoustapha Ould-Ahmed-Vall , Suleyman Sair , Kshitij A. Doshi , Charles R. Yount , Bret L. Toll

发明人： Elmoustapha Ould-Ahmed-Vall , Suleyman Sair , Kshitij A. Doshi , Charles R. Yount , Bret L. Toll

IPC分类号： G06F9/30 , H03M7/46 , H03M7/30

CPC分类号： G06F9/30036 , G06F9/30018 , G06F9/30025 , G06F9/30032 , G06F9/3016 , H03M7/46 , H03M7/6005

摘要： A processor core that includes a hardware decode unit to decode a vector frequency compress instruction that includes a source operand and a destination operand. The source operand specifying a source vector register that includes a plurality of source data elements including one or more runs of identical data elements that are each to be compressed in a destination vector register as a value and run length pair. The destination operand identifies the destination vector register. The processor core also includes an execution engine unit to execute the decoded vector frequency compress instruction which causes, for each source data element, a value to be copied into the destination vector register to indicate that source data element's value. One or more runs of the source data elements equal are encoded in the destination vector register as the predetermined compression value followed by a run length for that run.

摘要翻译： 一种处理器核心，其包括用于解码包括源操作数和目的地操作数的向量频率压缩指令的硬件解码单元。源操作数指定源向量寄存器，其包括多个源数据元素，其包括在目的地向量寄存器中各自被压缩的相同数据元素的一个或多个游程作为值和游程长度对。目标操作数标识目标向量寄存器。处理器核心还包括执行引擎单元，用于执行解码的向量频率压缩指令，其对于每个源数据元素，其将被复制到目的地向量寄存器中的值指示源数据元素的值。源数据元素相等的一个或多个运行在目标向量寄存器中被编码为预定压缩值，后跟该运行的运行长度。

9.

发明授权
System, apparatus and method for generating a loop alignment count or a loop alignment mask 有权

公开(公告)号：US10083032B2

公开(公告)日：2018-09-25

申请号：US13993321

申请日：2011-12-14

申请人： Suleyman Sair , Elmoustapha Ould-Ahmed-Vall

发明人： Suleyman Sair , Elmoustapha Ould-Ahmed-Vall

IPC分类号： G06F9/30 , G06F9/38 , G06F9/345 , G06F9/32

CPC分类号： G06F9/30065 , G06F9/30018 , G06F9/30036 , G06F9/30072 , G06F9/325 , G06F9/345 , G06F9/3824

摘要： A loop alignment instruction indicates a base address of an array as a first operand, an iteration limit of a loop as a second operand, and a destination. The loop contains iterations and each iteration includes a data element of the array. A processor receives the loop alignment instruction, decodes the instruction for execution, and stores a result of the execution in the destination. The result indicates the number of data elements at a beginning of the array that are to be handled separately from a remaining portion of the array, such that the base address of the remaining portion of the array aligns with an alignment width.

10.

发明申请
SYSTEM, APPARATUS AND METHOD FOR GENERATING A LOOP ALIGNMENT COUNT OR A LOOP ALIGNMENT MASK 审中-公开
标题翻译：用于生成环路对齐计数或循环对准掩模的系统，装置和方法

公开(公告)号：US20140201510A1

公开(公告)日：2014-07-17

申请号：US13993321

申请日：2011-12-14

申请人： Suleyman Sair , Elmoustapha Ould-Ahmed-Vall

发明人： Suleyman Sair , Elmoustapha Ould-Ahmed-Vall

IPC分类号： G06F9/38

CPC分类号： G06F9/30065 , G06F9/30018 , G06F9/30036 , G06F9/30072 , G06F9/325 , G06F9/345 , G06F9/3824

摘要： A loop alignment instruction indicates a base address of an array as a first operand, an iteration limit of a loop as a second operand, and a destination. The loop contains iterations and each iteration includes a data element of the array. A processor receives the loop alignment instruction, decodes the instruction for execution, and stores a result of the execution in the destination. The result indicates the number of data elements at a beginning of the array that are to be handled separately from a remaining portion of the array, such that the base address of the remaining portion of the array aligns with an alignment width.

摘要翻译： 循环对齐指令表示阵列的基地址作为第一操作数，作为第二操作数的循环的迭代限制和目的地。循环包含迭代，每次迭代都包含数组的数据元素。处理器接收循环对准指令，解码执行指令，并将执行结果存储在目的地。结果表示数组开头的数组元素的数量，该数组元素将与数组的剩余部分分开处理，以使阵列剩余部分的基址与对齐宽度对齐。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类