Patent search ap:("Intel Corporation") AND inv:"Joseph Williams" Page 1

1.

发明授权
Vector processor utilizing massively fused operations 有权

公开(公告)号：US12282774B2

公开(公告)日：2025-04-22

申请号：US17358231

申请日：2021-06-25

Applicant: Intel Corporation

Inventor： Joseph Williams , Zoran Zivkovic , Jian-Guo Chen , Hong Wan , David Dougherty , Jay O'neill

IPC: G06F9/30

Abstract: Techniques are disclosed for the use of fused vector processor instructions by a vector processor architecture. Each fused vector processor instruction may include a set of fields associated with individual vector processing instructions. The vector processor architecture may implement local buffers facilitating a single vector processor instruction to be used to execute each of the individual vector processing instructions without re-accessing vector registers between each executed individual vector processing instruction. The vector processor architecture enables less communication across the interconnection network, thereby increasing interconnection network bandwidth and the speed of computations, and decreasing power usage.

2.

发明申请
PROCESSOR EMBEDDED STREAMING BUFFER 有权

公开(公告)号：US20250103331A1

公开(公告)日：2025-03-27

申请号：US18971447

申请日：2024-12-06

Applicant: Intel Corporation

Inventor： Joseph Williams

IPC: G06F9/30 , G06F9/355 , G06F9/38 , G06F15/78

Abstract: Techniques are disclosed for the use of local buffers integrated into the execution units of an array processor architecture. The use of local buffers results in less communication across the interconnection network implemented by processors, and increases interconnection network bandwidth, increases the speed of computations, and decreases power usage.

3.

发明公开
FLEXIBLE VECTORIZED PROCESSING ARCHITECTURE 审中-公开

公开(公告)号：US20240220249A1

公开(公告)日：2024-07-04

申请号：US18147099

申请日：2022-12-28

Applicant: Intel Corporation

Inventor： Jian-Guo Chen , David Dougherty , Madihally Narasimha , Joseph Othmer , Hong Wan , Joseph Williams , Zoran Zivkovic

IPC: G06F9/30 , G06F30/343

CPC classification number: G06F9/30036 , G06F9/3001 , G06F30/343

Abstract: Techniques are disclosed for the implementation of a programmable processing array architecture that realizes vectorized processing operations for a variety of applications. Such vectorized processing operations may include digital front end (DFE) processing operations, which include finite impulse response (FIR) filter processing operations. The programmable processing array architecture provides a front-end interconnection network that generates specific data sliding time window patterns in accordance with the particular DFE processing operation to be executed. The architecture enables the processed data generated in accordance with these sliding time window patterns to be fed to a set of multipliers and adders to generate output data. The architecture supports a wide range of processing operations to be performed via a single programmable processing array platform by leveraging the programmable nature of the array and the use of instruction sets.

4.

发明申请
VECTOR PROCESSOR SUPPORTING LINEAR INTERPOLATION ON MULTIPLE DIMENSIONS 有权

公开(公告)号：US20220197640A1

公开(公告)日：2022-06-23

申请号：US17131939

申请日：2020-12-23

Applicant: Intel Corporation

Inventor： Kameran Azadet , Joseph Williams , Zoran Zivkovic

IPC: G06F9/30 , G06F9/38 , G06F16/901

Abstract: Techniques are disclosed for a vector processor architecture that enables data interpolation in accordance with multiple dimensions, such as one-, two-, and three-dimensional linear interpolation. The vector processor architecture includes a vector processor and accompanying vector addressable memory that enable a simultaneous retrieval of multiple entries in the vector addressable memory to facilitate linear interpolation calculations. The vector processor architecture vastly increases the speed in which such calculations may occur compared to conventional processing architectures. Example implementations include the calculation of digital pre-distortion (DPD) coefficients for use with radio frequency (RF) transmitter chains to support multi-band applications.

5.

发明授权
Apparatuses, methods, and systems for a user defined formatting instruction to configure multicast Benes network circuitry 有权

公开(公告)号：US11334356B2

公开(公告)日：2022-05-17

申请号：US16457994

申请日：2019-06-29

Applicant: Intel Corporation

Inventor： Jian-Guo Chen , David T. Dougherty , Steven Pinault , Parakalan Venkataraghavan , Joseph Williams , Meng-Lin Yu , Kamran Azadet

IPC: G06F9/30 , H04L49/25 , H04L49/201 , G06F9/54

Abstract: Systems, methods, and apparatuses relating to a user defined formatting instruction to configure multicast Benes network circuitry are described. In one embodiment, a processor includes a decoder to decode a single instruction into a decoded single instruction, the single instruction having fields that identify packed input data, packed control data, and a packed data destination; and an execution unit to execute the decoded single instruction to: send the packed control data to respective control inputs of a circuit that comprises an inverse butterfly circuit coupled in series to a butterfly circuit, wherein the inverse butterfly circuit comprises a first plurality of stages of multicast switches and the butterfly circuit comprises a second plurality of stages of multicast switches, read, once from storage separate from the circuit, each element of the packed input data as respective inputs of the circuit, route the packed input data through the circuit according to the packed control data, and store resultant packed data from the circuit into the packed data destination.

6.

发明授权
Digital processor having instruction set with complex exponential non-linear function 有权
Title translation: 具有指令集的数字处理器具有复数指数非线性函数

公开(公告)号：US09529567B2

公开(公告)日：2016-12-27

申请号：US13701397

申请日：2012-10-26

Applicant: Intel Corporation

Inventor： Kameran Azadet , Albert Molina , Joseph H. Othmer , Parakalan Venkataraghavan , Meng-Lin Yu , Joseph Williams

IPC: G06F9/30 , G06F5/01 , H04B1/04 , H04L1/00 , H04L27/233 , H04B1/62 , H04L25/02 , H04L25/03 , H03M3/00 , H03F1/02 , H03F1/32 , H03F3/189 , H03F3/24 , H04B1/00

CPC classification number: G06F9/3001 , G06F5/01 , G06F9/30036 , G06F17/15 , H03F1/0288 , H03F1/3241 , H03F1/3258 , H03F3/189 , H03F3/24 , H03F2200/336 , H03F2201/3209 , H03F2201/3212 , H03F2201/3224 , H03F2201/3233 , H03H17/06 , H03M3/30 , H04B1/0003 , H04B1/0475 , H04B1/62 , H04B2001/0408 , H04L1/0054 , H04L25/02 , H04L25/03 , H04L25/03178 , H04L25/03216 , H04L25/4917 , H04L27/2334

Abstract: A digital processor is provided having an instruction set with a complex exponential function. The digital processor evaluates a complex exponential function for an input value, x, by obtaining a complex exponential software instruction having the input value, x, as an input; and in response to the complex exponential software instruction: invoking at least one complex exponential functional unit that implements complex exponential software instructions to apply the complex exponential function to the input value, x; and generating an output corresponding to the complex exponential of the input value, x. A complex exponential function for an input value, x, can be evaluated by wrapping the input value to maintain a given range; computing a coarse approximation angle using a look-up table; scaling the coarse approximation angle to obtain an angle from 0 to θ; and computing a fine corrective value using a polynomial approximation.

Abstract translation: 提供了具有具有复指数函数的指令集的数字处理器。数字处理器通过获得具有输入值x的复数指数软件指令作为输入来评估输入值x的复指数函数; 并且响应于复杂的指数软件指令：调用至少一个复数指数函数单元，其执行复指数软件指令以将复指数函数应用于输入值x; 并产生与输入值x的复指数相对应的输出。输入值x的复数指数函数可以通过包装输入值来保持给定范围来评估; 使用查找表计算粗近似角度; 缩放粗近似角度以获得从0到θ的角度; 以及使用多项式近似计算精细校正值。

7.

发明公开
PROGRAMMABLE PROCESSING ARRAY SUPPORTING MULTI-DIMENSIONAL INTERPOLATION COMPUTATIONS 审中-公开

公开(公告)号：US20240134818A1

公开(公告)日：2024-04-25

申请号：US18533369

申请日：2023-12-08

Applicant: Intel Corporation

Inventor： Zoran Zivkovic , Jian-Guo Chen , Jay ONeill , Joseph Williams

IPC: G06F15/80 , G06F1/03

CPC classification number: G06F15/8007 , G06F1/03

Abstract: Techniques are disclosed for a programmable processor architecture that enables data interpolation using an architecture that iteratively processes portions of a look-up table (LUT) in accordance with a fused single instruction stream, multiple data streams (SIMD) instruction. The LUT may contain segment entries that correspond to a result of evaluating a function using a corresponding index values, which represent an independent variable of the function. The index values are used to map data sample values in a data array that is to be interpolated to the segment entries. By using an iterative process of mapping data samples to valid segment entries contained in each LUT portion, the architecture advantageously facilitates scaling to support larger LUTs and thus may be expanded to enable linear interpolation on multiple dimensions.

8.

发明授权
Apparatuses, methods, and systems for vector processor architecture having an array of identical circuit blocks 有权

公开(公告)号：US11074213B2

公开(公告)日：2021-07-27

申请号：US16457993

申请日：2019-06-29

Applicant: Intel Corporation

Inventor： Joseph Williams , Jay O'Neill , Jeroen Leijten , Harm Peters , Eugene Scuteri

IPC: G06F15/80 , G06F15/78

Abstract: Systems, methods, and apparatuses relating to vector processor architecture having an array of identical circuit blocks are described. In one embodiment, a processor includes a single centralized circuit comprising an instruction decoder and a controller; and a plurality of circuit slices that each comprise an arithmetic logic unit, a multiplier, a register file, a local memory, and a same plurality of logic circuits and a packed data datapath in between, wherein each circuit slice includes a physical port that provides a unique identification value that identifies a circuit slice from the other circuit slices, and the controller is to broadcast a same configuration value to the plurality of circuit slices to cause a first circuit slice to enable a first logic circuit and enable a second logic circuit of the first circuit slice based on its unique identification value and the configuration value, and cause a second circuit slice to enable a same, first logic circuit and disable a same, second logic circuit of the second circuit slice based on its unique identification value and the configuration value.

9.

发明授权
Vector processor having instruction set with vector convolution function for fir filtering 有权
Title translation: 矢量处理器具有用于冷杉滤波的矢量卷积函数的指令集

公开(公告)号：US09280315B2

公开(公告)日：2016-03-08

申请号：US13701376

申请日：2012-10-26

Applicant: Intel Corporation

Inventor： Kameran Azadet , Meng-Lin Yu , Joseph Othmer , Joseph Williams , Albert Molina

IPC: G06F17/15 , G06F5/01 , H04B1/04 , H04L1/00 , H04L27/233 , H04B1/62 , G06F9/30 , H04L25/02 , H04L25/03 , H03M3/00 , H03F1/02 , H03F1/32 , H03F3/189 , H03F3/24 , H04B1/00

CPC classification number: G06F9/3001 , G06F5/01 , G06F9/30036 , G06F17/15 , H03F1/0288 , H03F1/3241 , H03F1/3258 , H03F3/189 , H03F3/24 , H03F2200/336 , H03F2201/3209 , H03F2201/3212 , H03F2201/3224 , H03F2201/3233 , H03H17/06 , H03M3/30 , H04B1/0003 , H04B1/0475 , H04B1/62 , H04B2001/0408 , H04L1/0054 , H04L25/02 , H04L25/03 , H04L25/03178 , H04L25/03216 , H04L25/4917 , H04L27/2334

Abstract: A vector processor is provided having an instruction set with a vector convolution function. The disclosed vector processor performs a convolution function between an input signal and a filter impulse response by obtaining a vector comprised of at least N1+N2-1 input samples; obtaining N2 time shifted versions of the vector (including a zero shifted version), wherein each time shifted version comprises N1 samples; and performing a weighted sum of the time shifted versions of the vector by a vector of N1 coefficients; and producing an output vector comprising one output value for each of the weighted sums. The vector processor performs the method, for example, in response to one or more vector convolution software instructions having a vector input. The vector can comprise a plurality of real or complex input samples and the filter impulse response can be expressed using a plurality of coefficients that are real or complex.

Abstract translation: 提供了具有具有向量卷积函数的指令集的向量处理器。所公开的向量处理器通过获得包括至少N1 + N2-1个输入样本的向量来执行输入信号和滤波器脉冲响应之间的卷积函数; 获得矢量的N2时移版本（包括零位版本），其中每个时移版本包括N1个样本; 以及通过N1系数的向量来执行向量的时移版本的加权和; 以及产生包括每个加权和的一个输出值的输出向量。矢量处理器例如响应于具有矢量输入的一个或多个向量卷积软件指令来执行该方法。矢量可以包括多个实数或复数输入样本，并且可以使用实数或复数的多个系数来表示滤波器脉冲响应。

10.

发明公开
PROCESSING PIPELINE WITH ZERO LOOP OVERHEAD 审中-公开

公开(公告)号：US20240345839A1

公开(公告)日：2024-10-17

申请号：US18647891

申请日：2024-04-26

Applicant: Intel Corporation

Inventor： Kameran Azadet , Jeroen Leijten , Joseph Williams

IPC: G06F9/30 , G06F9/38 , G06F9/46 , G06F9/54

CPC classification number: G06F9/30036 , G06F9/30079 , G06F9/3877 , G06F9/463 , G06F9/546

Abstract: Techniques are disclosed for reducing or eliminating loop overhead caused by function calls in processors that form part of a pipeline architecture. The processors in the pipeline process data blocks in an iterative fashion, with each processor in the pipeline completing one of several iterations associated with a processing loop for a commonly-executed function. The described techniques leverage the use of message passing for pipelined processors to enable an upstream processor to signal to a downstream processor when processing has been completed, and thus a data block is ready for further processing in accordance with the next loop processing iteration. The described techniques facilitate a zero loop overhead architecture, enable continuous data block processing, and allow the processing pipeline to function indefinitely within the main body of the processing loop associated with the commonly-executed function where efficiency is greatest.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification