Patent search ap:("INTEL CORPORATION") AND inv:"Elmoustapha OULD-AHMED-VALL" Page 4

31.

发明申请
SYSTEMS AND METHODS FOR PERFORMING HORIZONTAL TILE OPERATIONS 审中-公开

公开(公告)号：US20190042261A1

公开(公告)日：2019-02-07

申请号：US16131382

申请日：2018-09-14

Applicant: Intel Corporation

Inventor： Christopher J. HUGHES , Bret TOLL , Dan BAUM , Elmoustapha OULD-AHMED-VALL , Raanan SADE , Robert VALENTINE , Mark J. CHARNEY , Alexander F. HEINECKE

IPC: G06F9/38 , G06F9/30

Abstract: Disclosed embodiments relate to systems and methods for performing instructions specifying horizontal tile operations. In one example, a processor includes fetch circuitry to fetch an instruction specifying a horizontal tile operation, a location of a M by N source matrix comprising K groups of elements, and locations of K destinations, wherein each of the K groups of elements comprises the same number of elements, decode circuitry to decode the fetched instruction, and execution circuitry to respond to the decoded instruction by generating K results, each result being generated by performing the specified horizontal tile operation across every element of a corresponding group of the K groups, and writing each generated result to a corresponding location of the K specified destination locations.

32.

发明申请
APPARATUS AND METHOD TO REVERSE AND PERMUTE BITS IN A MASK REGISTER 审中-公开

公开(公告)号：US20180039497A1

公开(公告)日：2018-02-08

申请号：US15785016

申请日：2017-10-16

Applicant: Intel Corporation

Inventor： Elmoustapha OULD-AHMED-VALL , Robert VALENTINE

IPC: G06F9/30

CPC classification number: G06F9/30018 , G06F9/30032 , G06F9/30036 , G06F9/30098

Abstract: An apparatus and method are described for performing a bit reversal and permutation on mask values. For example, a processor is described to execute an instruction to perform the operations of: reading a plurality of mask bits stored in a source mask register, the mask bits associated with vector data elements of a vector register; and performing a bit reversal operation to copy each mask bit from a source mask register to a destination mask register, wherein the bit reversal operation causes bits from the source mask register to be reversed within the destination mask register resulting in a symmetric, mirror image of the original bit arrangement.

33.

发明申请
Machine Level Instructions to Compute a 3D Z-Curve Index from 3D Coordinates 审中-公开
Title translation: 从3D坐标计算3D Z曲线索引的机器级别说明

公开(公告)号：US20160139919A1

公开(公告)日：2016-05-19

申请号：US14542499

申请日：2014-11-14

Applicant: Intel Corporation

Inventor： Arnold Kerry EVANS , Elmoustapha OULD-AHMED-VALL

IPC: G06F9/30 , G06F9/355 , G06F9/38

CPC classification number: G06F9/30018 , G06F9/30025 , G06F9/30032 , G06F9/30036 , G06F9/30109 , G06F9/355 , G06F9/3802 , G06F9/3895

Abstract: In one embodiment, a processor includes 32-bit and 64-bit machine level instructions to compute a 3D Z-curve Index. A processor decode unit is configured to decode a z-curve ordering instruction having three source operands, each operand associated with one of a first, second, or third coordinate and a processor execution unit is configured to execute the decoded instruction before outputting the 3D Z-curve index to a location specified by a destination operand.

Abstract translation: 在一个实施例中，处理器包括用于计算3D Z曲线索引的32位和64位机器级指令。处理器解码单元被配置为解码具有三个源操作数的z曲线排序指令，每个操作数与第一，第二或第三坐标之一相关联，并且处理器执行单元被配置为在输出3D Z之前执行解码指令 -curve索引到目标操作数指定的位置。

34.

发明申请
SCALAR CORE INTEGRATION 有权

公开(公告)号：US20250068588A1

公开(公告)日：2025-02-27

申请号：US18822815

申请日：2024-09-03

Applicant: Intel Corporation

Inventor： Joydeep RAY , Aravindh ANANTARAMAN , Abhishek R. APPU , Altug KOKER , Elmoustapha OULD-AHMED-VALL , Valentin ANDREI , Subramaniam MAIYURAN , Nicolas GALOPPO VON BORRIES , Varghese GEORGE , Mike MACPHERSON , Ben ASHBAUGH , Murali RAMADOSS , Vikranth VEMULAPALLI , William SADLER , Jonathan PEARCE , Sungye KIM

IPC: G06F15/80 , G06F9/30 , G06F9/38 , G06T15/00

Abstract: Methods and apparatus relating to scalar core integration in a graphics processor. In an example, an apparatus comprises a processor to receive a set of workload instructions for a graphics workload from a host complex, determine a first subset of operations in the set of operations that is suitable for execution by a scalar processor complex of the graphics processing device and a second subset of operations in the set of operations that is suitable for execution by a vector processor complex of the graphics processing device, assign the first subset of operations to the scalar processor complex for execution to generate a first set of outputs, assign the second subset of operations to the vector processor complex for execution to generate a second set of outputs. Other embodiments are also disclosed and claimed.

35.

发明公开
DYNAMIC MEMORY RECONFIGURATION 审中-公开

公开(公告)号：US20240184739A1

公开(公告)日：2024-06-06

申请号：US18432859

申请日：2024-02-05

Applicant: INTEL CORPORATION

Inventor： Joydeep RAY , Niranjan COORAY , Subramaniam MAIYURAN , Altug KOKER , Prasoonkumar SURTI , Varghese GEORGE , Valentin ANDREI , Abhishek APPU , Guadalupe GARCIA , Pattabhiraman K , Sungye KIM , Sanjay KUMAR , Pratik MAROLIA , Elmoustapha OULD-AHMED-VALL , Vasanth RANGANATHAN , William SADLER , Lakshminarayanan STRIRAMASSARMA

IPC: G06F15/78 , G06F7/544 , G06F7/575 , G06F7/58 , G06F9/30 , G06F9/38 , G06F9/50 , G06F12/02 , G06F12/06 , G06F12/0802 , G06F12/0804 , G06F12/0811 , G06F12/0862 , G06F12/0866 , G06F12/0871 , G06F12/0875 , G06F12/0882 , G06F12/0888 , G06F12/0891 , G06F12/0893 , G06F12/0895 , G06F12/0897 , G06F12/1009 , G06F12/128 , G06F15/80 , G06F17/16 , G06F17/18 , G06N3/08 , G06T1/20 , G06T1/60 , G06T15/06 , H03M7/46

CPC classification number: G06F15/7839 , G06F7/5443 , G06F7/575 , G06F7/588 , G06F9/3001 , G06F9/30014 , G06F9/30036 , G06F9/3004 , G06F9/30043 , G06F9/30047 , G06F9/30065 , G06F9/30079 , G06F9/3887 , G06F9/5011 , G06F9/5077 , G06F12/0215 , G06F12/0238 , G06F12/0246 , G06F12/0607 , G06F12/0802 , G06F12/0804 , G06F12/0811 , G06F12/0862 , G06F12/0866 , G06F12/0871 , G06F12/0875 , G06F12/0882 , G06F12/0888 , G06F12/0891 , G06F12/0893 , G06F12/0895 , G06F12/0897 , G06F12/1009 , G06F12/128 , G06F15/8046 , G06F17/16 , G06F17/18 , G06T1/20 , G06T1/60 , H03M7/46 , G06F9/3802 , G06F9/3818 , G06F9/3867 , G06F2212/1008 , G06F2212/1021 , G06F2212/1044 , G06F2212/302 , G06F2212/401 , G06F2212/455 , G06F2212/60 , G06N3/08 , G06T15/06

Abstract: Embodiments described herein provide techniques to enable the dynamic reconfiguration of memory on a general-purpose graphics processing unit. One embodiment described herein enables dynamic reconfiguration of cache memory bank assignments based on hardware statistics. One embodiment enables for virtual memory address translation using mixed four kilobyte and sixty-four kilobyte pages within the same page table hierarchy and under the same page directory. One embodiment provides for a graphics processor and associated heterogenous processing system having near and far regions of the same level of a cache hierarchy.

36.

发明申请
SYSTEMS, METHODS, AND APPARATUSES FOR ZEROING A MATRIX 有权

公开(公告)号：US20220300286A1

公开(公告)日：2022-09-22

申请号：US17833643

申请日：2022-06-06

Applicant: Intel Corporation

Inventor： Robert VALENTINE , Menachem ADELMAN , Zeev SPERBER , Mark J. CHARNEY , Bret L. TOLL , Jesus CORBAL , Alexander F. HEINECKE , Barukh ZIV , Elmoustapha OULD-AHMED-VALL , Stanislav SHWARTSMAN

IPC: G06F9/30 , G06F7/485 , G06F7/487 , G06F17/16 , G06F7/76 , G06F9/38

Abstract: Embodiments detailed herein relate to matrix operations. In particular, performing a matrix operation of zeroing a matrix in response to a single instruction. For example, a processor detailed which includes decode circuitry to decode an instruction having fields for an opcode and a source/destination matrix operand identifier; and execution circuitry to execute the decoded instruction to zero each data element of the identified source/destination matrix.

37.

发明申请
SYSTEMS, METHODS, AND APPARATUSES FOR TILE STORE 有权

公开(公告)号：US20220291926A1

公开(公告)日：2022-09-15

申请号：US17706413

申请日：2022-03-28

Applicant: Intel Corporation

Inventor： Robert VALENTINE , Menachem ADELMAN , Elmoustapha OULD-AHMED-VALL , Bret L. TOLL , Milind B. GIRKAR , Zeev SPERBER , Mark J. CHARNEY , Rinat RAPPOPORT , Jesus CORBAL , Stanislav SHWARTSMAN , Igor YANOVER , Alexander F. HEINECKE , Barukh ZIV , Dan BAUM , Yuri GEBIL

IPC: G06F9/30 , G06F7/485 , G06F7/487 , G06F17/16 , G06F7/76 , G06F9/38

Abstract: Embodiments detailed herein relate to matrix operations. In particular, the loading of a matrix (tile) from memory. For example, support for a loading instruction is described in at least a form of decode circuitry to decode an instruction having fields for an opcode, a source matrix operand identifier, and destination memory information, and execution circuitry to execute the decoded instruction to store each data element of configured rows of the identified source matrix operand to memory based on the destination memory information

38.

发明申请
SYSTEMS, METHODS, AND APPARATUS FOR MATRIX MOVE 有权

公开(公告)号：US20220236989A1

公开(公告)日：2022-07-28

申请号：US17587637

申请日：2022-01-28

Applicant: Intel Corporation

Inventor： Robert VALENTINE , Zeev SPERBER , Mark J. CHARNEY , Bret L. TOLL , Jesus CORBAL , Dan BAUM , Alexander HEINECKE , Elmoustapha OULD-AHMED-VALL

IPC: G06F9/30 , G06F7/485 , G06F7/487 , G06F17/16 , G06F7/76 , G06F9/38

Abstract: Detailed herein are embodiment systems, processors, and methods for matrix move. For example, a processor comprising decode circuitry to decode an instruction having fields for an opcode, a source matrix operand identifier, and a destination matrix operand identifier; and execution circuitry to execute the decoded instruction to move each data element of the identified source matrix operand to corresponding data element position of the identified destination matrix operand is described.

39.

发明申请
EFFICIENT DIVIDE AND ACCUMULATE INSTRUCTION WHEN AN OPERAND IS EQUAL TO OR NEAR A POWER OF TWO 有权

公开(公告)号：US20220197634A1

公开(公告)日：2022-06-23

申请号：US17129693

申请日：2020-12-21

Applicant: Intel Corporation

Inventor： Elmoustapha OULD-AHMED-VALL

IPC: G06F9/30

Abstract: Techniques and apparatuses for performing a near multiply and accumulate instruction are described. An apparatus includes decoder circuitry to decode an instruction, the instruction to include a field for an identifier of a first source operand, a field for an identifier of a second source operand, and a field for an identifier of a third source operand. The apparatus also includes execution circuitry to execute the decoded instruction to perform a division on a pair of data elements from the first and second source operands to produce a quotient data element via a shift operation when at least one data element in the pair of data elements is equal to a power of two or near a power of two or via a division operation on the pair of data elements when the pair of data elements is neither equal to a power of two or near a power of two.

40.

发明申请
SYSTEMS FOR PERFORMING INSTRUCTIONS TO QUICKLY CONVERT AND USE TILES AS 1D VECTORS 有权

公开(公告)号：US20210318874A1

公开(公告)日：2021-10-14

申请号：US17240882

申请日：2021-04-26

Applicant: INTEL CORPORATION

Inventor： Bret TOLL , Christopher J. HUGHES , Dan BAUM , Elmoustapha OULD-AHMED-VALL , Raanan SADE , Robert VALENTINE , Mark J. CHARNEY , Alexander F. HEINECKE

IPC: G06F9/30

Abstract: Disclosed embodiments relate to systems for performing instructions to quickly convert and use matrices (tiles) as one-dimensional vectors. In one example, a processor includes fetch circuitry to fetch an instruction having fields to specify an opcode, locations of a two-dimensional (2D) matrix and a one-dimensional (1D) vector, and a group of elements comprising one of a row, part of a row, multiple rows, a column, part of a column, multiple columns, and a rectangular sub-tile of the specified 2D matrix, and wherein the opcode is to indicate a move of the specified group between the 2D matrix and the 1D vector, decode circuitry to decode the fetched instruction; and execution circuitry, responsive to the decoded instruction, when the opcode specifies a move from 1D, to move contents of the specified 1D vector to the specified group of elements.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification