RUNTIME SPARING FOR UNCORRECTABLE ERRORS BASED ON FAULT-AWARE ANALYSIS

    公开(公告)号:US20220350715A1

    公开(公告)日:2022-11-03

    申请号:US17865095

    申请日:2022-07-14

    Abstract: A system can respond to detection or prediction of an uncorrectable error (UE) in memory based on fault-aware analysis. The fault-aware analysis enables the system to generate a determination of a specific hardware element of the memory that is faulty. In response to detection of an error, the system can correlate a hardware configuration of the memory device with historical data indicating memory faults for hardware elements of the hardware configuration. Based on a determination of the specific component that likely caused the UE, the system can identify a region of memory associated with the detected UE and mirror the faulty region to a reserved memory space of the memory device for access to data of the faulty region.

    LIFETIME TELEMETRY ON MEMORY ERROR STATISTICS TO IMPROVE MEMORY FAILURE ANALYSIS AND PREVENTION

    公开(公告)号:US20210279122A1

    公开(公告)日:2021-09-09

    申请号:US17317745

    申请日:2021-05-11

    Abstract: Methods and apparatus for lifetime telemetry on memory error statistics to improve memory failure analysis and prevention. Memory error information corresponding to detected correctable errors and uncorrectable memory errors are monitored, with the memory error information identifying an associated DRAM device in an associated DIMM. Corresponding micro-level error bits information from the memory error information is decoded and Micro-level Error Statistic Indicators (MESIs) are generated. Information associated with the MESIs from DRAM devices on the DIMMs are periodically written to persistent storage on those DIMMs. The MESIs for a given DIMM are updated over the lifetime of the DIMM.

Patent Agency Ranking