Patent search ap:("Hewlett Packard Enterprise Development LP") AND inv:"Martin Foltin" Page 3

21.

发明授权
Acceleration of model/weight programming in memristor crossbar arrays 有权

公开(公告)号：US12242966B2

公开(公告)日：2025-03-04

申请号：US18528935

申请日：2023-12-05

Applicant: Hewlett Packard Enterprise Development LP

Inventor： Sai Rahul Chalamalasetti , Paolo Faraboschi , Martin Foltin , Catherine Graves , Dejan S. Milojicic , John Paul Strachan , Sergey Serebryakov

IPC: G11C7/00 , G06N3/08 , G11C13/00

Abstract: A crossbar array includes a number of memory elements. An analog-to-digital converter (ADC) is electronically coupled to the vector output register. A digital-to-analog converter (DAC) is electronically coupled to the vector input register. A processor is electronically coupled to the ADC and to the DAC. The processor may be configured to determine whether division of input vector data by output vector data from the crossbar array is within a threshold value, and if not within the threshold value, determine changed data values as between the output vector data and the input vector data, and write the changed data values to the memory elements of the crossbar array.

22.

发明公开
MULTI-DIE DOT-PRODUCT ENGINE TO PROVISION LARGE SCALE MACHINE LEARNING INFERENCE APPLICATIONS 审中-公开

公开(公告)号：US20240211212A1

公开(公告)日：2024-06-27

申请号：US18601259

申请日：2024-03-11

Applicant: Hewlett Packard Enterprise Development LP

Inventor： Craig Warner , Eun Sub Lee , Sai Rahul Chalamalasetti , Martin Foltin

IPC: G06F7/544 , G06F9/38 , G06F9/52 , G06F40/20 , G06N3/063

CPC classification number: G06F7/5443 , G06F9/3867 , G06F9/522 , G06F40/20 , G06N3/063

Abstract: Systems and methods are provided for a multi-die dot-product engine (DPE) to provision large-scale machine learning inference applications. The multi-die DPE leverages a multi-chip architecture. For example, a multi-chip interface can include a plurality of DPE chips, where each DPE chip performs inference computations for performing deep learning operations. A hardware interface between a memory of a host computer and the plurality of DPE chips communicatively connects the plurality of DPE chips to the memory of the host computer system during an inference operation such that the deep learning operations are spanned across the plurality of DPE chips. Due to the multi-die architecture, multiple silicon devices are allowed to be used for inference, thereby enabling power-efficient inference for large-scale machine learning applications and complex deep neural networks. The multi-die DPE can be used to build a multi-device DNN inference system performing specific applications, such as object recognition, with high accuracy.

23.

发明公开
ACCELERATION OF MODEL/WEIGHT PROGRAMMING IN MEMRISTOR CROSSBAR ARRAYS 审中-公开

公开(公告)号：US20240112029A1

公开(公告)日：2024-04-04

申请号：US18528935

申请日：2023-12-05

Applicant: Hewlett Packard Enterprise Development LP

Inventor： Sai Rahul Chalamalasetti , Paolo Faraboschi , Martin Foltin , Catherine Graves , Dejan S. Milojicic , John Paul Strachan , Sergey Serebryakov

IPC: G06N3/08 , G11C13/00

CPC classification number: G06N3/08 , G11C13/0069 , G11C2213/77

Abstract: A crossbar array includes a number of memory elements. An analog-to-digital converter (ADC) is electronically coupled to the vector output register. A digital-to-analog converter (DAC) is electronically coupled to the vector input register. A processor is electronically coupled to the ADC and to the DAC. The processor may be configured to determine whether division of input vector data by output vector data from the crossbar array is within a threshold value, and if not within the threshold value, determine changed data values as between the output vector data and the input vector data, and write the changed data values to the memory elements of the crossbar array.

24.

发明公开
RESISTIVE AND DIGITAL PROCESSING CORES 审中-公开

公开(公告)号：US20240111970A1

公开(公告)日：2024-04-04

申请号：US18528086

申请日：2023-12-04

Applicant: Hewlett Packard Enterprise Development LP

Inventor： John Paul Strachan , Dejan S. Milojicic , Martin Foltin , Sai Rahul Chalamalasetti , Amit S. Sharma

IPC: G06J1/00 , G06F17/16

CPC classification number: G06J1/00 , G06F17/16

Abstract: In some examples, a device includes a first processing core comprising a resistive memory array to perform an analog computation, and a digital processing core comprising a digital memory programmable with different values to perform different computations responsive to respective different conditions. The device further includes a controller to selectively apply input data to the first processing core and the digital processing core.

25.

发明申请
Acceleration of Model/Weight Programming in Memristor Crossbar Arrays 有权

公开(公告)号：US20210201136A1

公开(公告)日：2021-07-01

申请号：US17044633

申请日：2018-04-30

Applicant: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP

Inventor： Sai Rahul Chalamalasetti , Paolo Faraboschi , Martin Foltin , Catherine Graves , Dejan S. Milojicic , John Paul Strachan , Sergey Serebryakov

IPC: G06N3/08 , G11C13/00

Abstract: A crossbar array includes a number of memory elements. An analog-to-digital converter (ADC) is electronically coupled to the vector output register. A digital-to-analog converter (DAC) is electronically coupled to the vector input register. A processor is electronically coupled to the ADC and to the DAC. The processor may be configured to determine whether division of input vector data by output vector data from the crossbar array is within a threshold value, and if not within the threshold value, determine changed data values as between the output vector data and the input vector data, and write the changed data values to the memory elements of the crossbar array.

26.

发明申请
DUAL IN-LINE MEMORY MODULE 有权

公开(公告)号：US20210167038A1

公开(公告)日：2021-06-03

申请号：US17248988

申请日：2021-02-16

Applicant: Hewlett Packard Enterprise Development LP

Inventor： Gregg B. Lesartre , Jason H. Culler , Martin Foltin , William S. Jaffe

IPC: H01L25/065 , H01L25/18

Abstract: According to an example, a dual in-line memory module (DIMM) may include a high density package substrate including a plurality of connectors for communicatively interconnecting the DIMM to a system.

27.

发明申请
DETERMINING SIGNIFICANCE LEVELS OF ERROR VALUES IN PROCESSES THAT INCLUDE MULTIPLE LAYERS 审中-公开

公开(公告)号：US20200073755A1

公开(公告)日：2020-03-05

申请号：US16115100

申请日：2018-08-28

Applicant: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP

Inventor： John Paul Strachan , Catherine Graves , Dejan S. Milojicic , Paolo Faraboschi , Martin Foltin , Sergey Serebryakov

IPC: G06F11/10 , G11C29/52

Abstract: A computer system includes multiple memory array components that include respective analog memory arrays which are sequenced to implement a multi-layer process. An error array data structure is obtained for at least a first memory array component, and from which a determination is made as to whether individual nodes (or cells) of the error array data structure are significant. A determination can be made as to any remedial operations that can be performed to mitigate errors of significance.

28.

发明申请
Adjustable Precision for Multi-Stage Compute Processes 审中-公开

公开(公告)号：US20200042287A1

公开(公告)日：2020-02-06

申请号：US16052218

申请日：2018-08-01

Applicant: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP

Inventor： Sai Rahul Chalamalasetti , Paolo Faraboschi , Martin Foltin , Catherine Graves , Dejan S. Milojicic , Sergey Serebryakov , John Paul Strachan

IPC: G06F7/483 , G06N3/063 , G06N3/08

Abstract: Disclosed techniques provide for dynamically changing precision of a multi-stage compute process. For example, changing neural network (NN) parameters on a per-layer basis depending on properties of incoming data streams and per-layer performance of an NN among other considerations. NNs include multiple layers that may each be calculated with a different degree of accuracy and therefore, compute resource overhead (e.g., memory, processor resources, etc.). NNs are usually trained with 32-bit or 16-bit floating-point numbers. Once trained, an NN may be deployed in production. One approach to reduce compute overhead is to reduce parameter precision of NNs to 16 or 8 for deployment. The conversion to an acceptable lower precision is usually determined manually before deployment and precision levels are fixed while deployed. Disclosed techniques and implementations address automatic rather than manual determination or precision levels for different stages and dynamically adjusting precision for each stage at run-time.

29.

发明授权
Sensing circuit for resistive memory 有权

公开(公告)号：US09972387B2

公开(公告)日：2018-05-15

申请号：US15325040

申请日：2014-10-31

Applicant: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP

Inventor： Martin Foltin , Yoocharn Jeon , Brent Buchanan , Erik Ordentlich , Naveen Muralimanohar , James S. Ignowski , Jacquelyn M. Ingemi

IPC: G11C11/00 , G11C13/00

CPC classification number: G11C13/004 , G11C7/06 , G11C13/0038 , G11C13/0059 , G11C13/0061 , G11C13/0069 , G11C13/0097 , G11C27/024 , G11C2013/0045 , G11C2013/0054 , G11C2207/068

Abstract: This disclosure provides a circuit that includes a ramp generator to apply a voltage ramp to a resistive memory cell. A sensing circuit can enable the ramp generator and monitor a current output received from the resistive memory cell in response to the applied voltage ramp, wherein the sensing circuit compares the current output to a predetermined current threshold to determine the state of the resistive memory cell.

30.

发明授权
System and method for processing convolutions on crossbar-based neural network accelerators for increased inference throughput 有权

公开(公告)号：US12254395B2

公开(公告)日：2025-03-18

申请号：US17027628

申请日：2020-09-21

Applicant: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP

Inventor： Glaucimar Da Silva Aguiar , Francisco Plínio Oliveira Silveira , Eun Sub Lee , Rodrigo Jose Da Rosa Antunes , Joaquim Gomes Da Costa Eulalio De Souza , Martin Foltin , Jefferson Rodrigo Alves Cavalcante , Lucas Leite , Arthur Carvalho Walraven Da Cunha , Monycky Vasconcelos Frazao , Alex Ferreira Ramires Trajano

IPC: G06F15/78 , G06F17/16 , G06N3/063

Abstract: Systems and methods are provided to improve traditional chip processing. Using crossbar computations, the convolution layer can be flattened into vectors, and the vectors can be grouped into a matrix where each row or column is a flattened filter. Each submatrix of the input corresponding to a position of a convolution window is also flattened into a vector. The convolution is computed as the dot product of each input vector and the filter matrix. Using intra-crossbar computations, the unused space of the crossbars is used to store replicas of the filters matrices and the unused space in XIN is used to store more elements of the input. In inter-crossbar computations, the unused crossbars are used to store replicas of the filters matrices and the unused XINs are used to store more elements of the input. Then, the method performs multiple convolution iterations in a single step.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification