-
公开(公告)号:US11644882B2
公开(公告)日:2023-05-09
申请号:US17337107
申请日:2021-06-02
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Harumi Kuno , Alan Davis , Torsten Wilde , Daniel William Dauwe , Duncan Roweth , Ryan Dean Menhusen , Sergey Serebryakov , John L. Byrne , Vipin Kumar Kukkala , Sai Rahul Chalamalasetti
IPC: G06F1/3206 , G06F1/30 , H02J3/00 , G06F1/18
CPC classification number: G06F1/305 , G06F1/188 , G06F1/3206 , H02J3/003
Abstract: One embodiment provides a system and method for predicting network power usage associated with workloads. During operation, the system configures a simulator to simulate operations of a plurality of network components, which comprises embedding one or more event counters in each simulated network component. A respective event counter is configured to count a number of network-power-related events. The system collects, based on values of the event counters, network-power-related performance data associated with one or more sample workloads applied to the simulator; and trains a machine-learning model with the collected network-power-related performance data and characteristics of the sample workloads as training data 1, thereby facilitating prediction of network-power-related performance associated with a to-be-evaluated workload.
-
公开(公告)号:US20210034742A1
公开(公告)日:2021-02-04
申请号:US16526388
申请日:2019-07-30
Applicant: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
Inventor: Naysen Robertson , Sai Rahul Chalamalasetti , William James Walker
Abstract: In some examples, an apparatus includes a management controller for use in a computer system having a processing resource for executing an operating system (OS) of the computer system, the management controller being separate from the processing resource and to perform, based on operation of the management controller within a cryptographic boundary, management of components of the computer system, the management of components comprising power control of the computer system. The management controller is to receive sensor data, perform facial recognition based on the sensor data, and determine whether to initiate a security action responsive to the facial recognition.
-
公开(公告)号:US10725940B2
公开(公告)日:2020-07-28
申请号:US16167494
申请日:2018-10-22
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Qiong Cai , Paolo Faraboschi , Cong Xu , Ping Chi , Sai Rahul Chalamalasetti , Andrew C. Walton
Abstract: Techniques for reallocating a memory pending queue based on stalls are provided. In one aspect, it may be determined at a memory stop of a memory fabric that at least one class of memory access is stalled. It may also be determined at the memory stop of the memory fabric that there is at least one class of memory access that is not stalled. At least a portion of a memory pending queue may be reallocated from the class of memory access that is not stalled to the class of memory access that is stalled.
-
公开(公告)号:US20190056872A1
公开(公告)日:2019-02-21
申请号:US16167494
申请日:2018-10-22
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Qiong Cai , Paolo Faraboschi , Cong Xu , Ping Ping , Sai Rahul Chalamalasetti , Andrew C. Walton
Abstract: Techniques for reallocating a memory pending queue based on stalls are provided. In one aspect, it may be determined at a memory stop of a memory fabric that at least one class of memory access is stalled. It may also be determined at the memory stop of the memory fabric that there is at least one class of memory access that is not stalled. At least a portion of a memory pending queue may be reallocated from the class of memory access that is not stalled to the class of memory access that is stalled.
-
公开(公告)号:US10108351B2
公开(公告)日:2018-10-23
申请号:US15190276
申请日:2016-06-23
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Qiong Cai , Paolo Faraboschi , Cong Xu , Ping Chi , Sai Rahul Chalamalasetti , Andrew C. Walton
IPC: G06F3/06
Abstract: Techniques for reallocating a memory pending queue based on stalls are provided. In one aspect, it may be determined at a memory stop of a memory fabric that at least one class of memory access is stalled. It may also be determined at the memory stop of the memory fabric that there is at least one class of memory access that is not stalled. At least a portion of a memory pending queue may be reallocated from the class of memory access that is not stalled to the class of memory access that is stalled.
-
公开(公告)号:US12242966B2
公开(公告)日:2025-03-04
申请号:US18528935
申请日:2023-12-05
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Sai Rahul Chalamalasetti , Paolo Faraboschi , Martin Foltin , Catherine Graves , Dejan S. Milojicic , John Paul Strachan , Sergey Serebryakov
Abstract: A crossbar array includes a number of memory elements. An analog-to-digital converter (ADC) is electronically coupled to the vector output register. A digital-to-analog converter (DAC) is electronically coupled to the vector input register. A processor is electronically coupled to the ADC and to the DAC. The processor may be configured to determine whether division of input vector data by output vector data from the crossbar array is within a threshold value, and if not within the threshold value, determine changed data values as between the output vector data and the input vector data, and write the changed data values to the memory elements of the crossbar array.
-
公开(公告)号:US20240362031A1
公开(公告)日:2024-10-31
申请号:US18308275
申请日:2023-04-27
Applicant: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
Inventor: DEJAN S. MILOJICIC , Sai Rahul Chalamalasetti , Sergey Serebryakov
CPC classification number: G06F9/44505 , G06F11/3495
Abstract: Systems and methods are provided for an accelerator system that includes a baseline (production) accelerator, optimizing accelerator, and control hardware accelerator, and an operation of alternatingly switching the production/optimizing accelerators between production and optimizing. With two production/optimizing accelerators, at any given point in time, one accelerator adapts while another accelerator processes data. Once the second accelerator starts doing a better job (e.g., has adapted to data drift), the accelerators change their modes, and the trainable accelerator becomes the “optimized” one. The accelerators do this non-stop, thus maintaining redundancy, providing expected quality of service (QOS) and adapting to data/concept drift.
-
18.
公开(公告)号:US20240211212A1
公开(公告)日:2024-06-27
申请号:US18601259
申请日:2024-03-11
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Craig Warner , Eun Sub Lee , Sai Rahul Chalamalasetti , Martin Foltin
CPC classification number: G06F7/5443 , G06F9/3867 , G06F9/522 , G06F40/20 , G06N3/063
Abstract: Systems and methods are provided for a multi-die dot-product engine (DPE) to provision large-scale machine learning inference applications. The multi-die DPE leverages a multi-chip architecture. For example, a multi-chip interface can include a plurality of DPE chips, where each DPE chip performs inference computations for performing deep learning operations. A hardware interface between a memory of a host computer and the plurality of DPE chips communicatively connects the plurality of DPE chips to the memory of the host computer system during an inference operation such that the deep learning operations are spanned across the plurality of DPE chips. Due to the multi-die architecture, multiple silicon devices are allowed to be used for inference, thereby enabling power-efficient inference for large-scale machine learning applications and complex deep neural networks. The multi-die DPE can be used to build a multi-device DNN inference system performing specific applications, such as object recognition, with high accuracy.
-
公开(公告)号:US20240112029A1
公开(公告)日:2024-04-04
申请号:US18528935
申请日:2023-12-05
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Sai Rahul Chalamalasetti , Paolo Faraboschi , Martin Foltin , Catherine Graves , Dejan S. Milojicic , John Paul Strachan , Sergey Serebryakov
CPC classification number: G06N3/08 , G11C13/0069 , G11C2213/77
Abstract: A crossbar array includes a number of memory elements. An analog-to-digital converter (ADC) is electronically coupled to the vector output register. A digital-to-analog converter (DAC) is electronically coupled to the vector input register. A processor is electronically coupled to the ADC and to the DAC. The processor may be configured to determine whether division of input vector data by output vector data from the crossbar array is within a threshold value, and if not within the threshold value, determine changed data values as between the output vector data and the input vector data, and write the changed data values to the memory elements of the crossbar array.
-
公开(公告)号:US20240111970A1
公开(公告)日:2024-04-04
申请号:US18528086
申请日:2023-12-04
Applicant: Hewlett Packard Enterprise Development LP
Inventor: John Paul Strachan , Dejan S. Milojicic , Martin Foltin , Sai Rahul Chalamalasetti , Amit S. Sharma
Abstract: In some examples, a device includes a first processing core comprising a resistive memory array to perform an analog computation, and a digital processing core comprising a digital memory programmable with different values to perform different computations responsive to respective different conditions. The device further includes a controller to selectively apply input data to the first processing core and the digital processing core.
-
-
-
-
-
-
-
-
-