-
公开(公告)号:US20220350715A1
公开(公告)日:2022-11-03
申请号:US17865095
申请日:2022-07-14
Applicant: Intel Corporation
Inventor: Shen ZHOU , Cong LI , Tai HUANG , Kuljit S. BAINS
IPC: G06F11/20
Abstract: A system can respond to detection or prediction of an uncorrectable error (UE) in memory based on fault-aware analysis. The fault-aware analysis enables the system to generate a determination of a specific hardware element of the memory that is faulty. In response to detection of an error, the system can correlate a hardware configuration of the memory device with historical data indicating memory faults for hardware elements of the hardware configuration. Based on a determination of the specific component that likely caused the UE, the system can identify a region of memory associated with the detected UE and mirror the faulty region to a reserved memory space of the memory device for access to data of the faulty region.
-
2.
公开(公告)号:US20240241778A1
公开(公告)日:2024-07-18
申请号:US18562237
申请日:2021-12-13
Applicant: Intel Corporation
Inventor: Shen ZHOU , Cong LI , Kuljit S. BAINS , Ugonna ECHERUO , Reza E. DAFTARI , Theodros YIGZAW , Mariusz ORIOL
IPC: G06F11/07
CPC classification number: G06F11/073 , G06F11/079 , G06F11/0793
Abstract: A system (204) can respond to detection of an uncorrectable error (UE) (254) in memory (246) based on fault-aware analysis. The fault-aware analysis enables the system (204) to generate a determination of a specific hardware element of the memory (246) that caused the detected UE (254). In response to detection of a UE (254), the system (204) can correlate a hardware configuration (256) of the memory (246) device with historical data indicating memory (246) faults for hardware elements of the hardware configuration (256). Based on a determination of the specific component that likely caused the UE (254), the system (204) can issue a corrective action for the specific hardware element based on the determination.
-
公开(公告)号:US20230083193A1
公开(公告)日:2023-03-16
申请号:US17348435
申请日:2021-06-15
Applicant: Intel Corporation
Inventor: Shen ZHOU , Xiaoming DU , Cong LI , Kuljit S. BAINS , Rajat AGARWAL , Murugasamy K. NACHIMUTHU , Maciej LAWNICZAK , Chao Yan TANG , Mariusz ORIOL
IPC: G06F11/07
Abstract: A system can predict memory device failure through identification of correctable error patterns based on the memory architecture. The failure prediction can thus account for the circuit-level of the memory rather than the mere number or frequency of correctable errors. A failure prediction engine correlates hardware configuration of the memory device with correctable errors (CEs) detected in data of the memory device to predict an uncorrectable error (UE) based on the correlation.
-
4.
公开(公告)号:US20210279122A1
公开(公告)日:2021-09-09
申请号:US17317745
申请日:2021-05-11
Applicant: Intel Corporation
Inventor: Shen ZHOU , Cong LI , Kuljit S. BAINS , Xiaoming DU , Mariusz ORIOL
IPC: G06F11/07
Abstract: Methods and apparatus for lifetime telemetry on memory error statistics to improve memory failure analysis and prevention. Memory error information corresponding to detected correctable errors and uncorrectable memory errors are monitored, with the memory error information identifying an associated DRAM device in an associated DIMM. Corresponding micro-level error bits information from the memory error information is decoded and Micro-level Error Statistic Indicators (MESIs) are generated. Information associated with the MESIs from DRAM devices on the DIMMs are periodically written to persistent storage on those DIMMs. The MESIs for a given DIMM are updated over the lifetime of the DIMM.
-
-
-