AUTO-DISABLING DRAM ERROR CHECKING ON THRESHOLD

    公开(公告)号:US20180018217A1

    公开(公告)日:2018-01-18

    申请号:US15207679

    申请日:2016-07-12

    IPC分类号: G06F11/10 G06F3/06

    CPC分类号: G06F11/1004 G06F11/10

    摘要: An aspect includes a method for auto-disabling dynamic random access memory (DRAM) error checking based on a threshold. A method includes receiving data at a DRAM from a memory controller and executing error checking logic based on the data. The error checking logic detects and error condition in the data and it is determined, at the DRAM, whether detecting the error condition in the data causes an error threshold to be reached. The error checking logic is disabled at the DRAM in response to determining that detecting the error condition in the data causes the error the error threshold to be reached. The error condition is communicated to the memory controller in response to determining that detecting the error condition does not cause the error threshold to be reached.

    Implementing ECC redundancy using reconfigurable logic blocks
    25.
    发明授权
    Implementing ECC redundancy using reconfigurable logic blocks 有权
    使用可重构逻辑块实现ECC冗余

    公开(公告)号:US09230687B2

    公开(公告)日:2016-01-05

    申请号:US13867207

    申请日:2013-04-22

    摘要: A method, system and computer program product are provided for implementing ECC (Error Correction Codes) redundancy using reconfigurable logic blocks in a computer system. When a fail is detected when reading from memory, it is determined if the incorrect data is in the data or the ECC component of the data. When incorrect data is found in the ECC component of the data, and an actionable threshold is not reached, a predetermined Reliability, Availability, and Serviceability (RAS) action is taken. When the actionable threshold is reached with incorrect data identified in the ECC component of the data, an analysis process is performed to determine if the ECC logic is faulty. When a fail in the ECC logic is detected, the identified ECC failed logic is replaced with a spare block of logic.

    摘要翻译: 提供了一种方法,系统和计算机程序产品,用于在计算机系统中使用可重新配置的逻辑块实现ECC(纠错码)冗余。 当从存储器读取时检测到故障时,确定数据中的错误数据是否在数据或ECC组件中。 当在数据的ECC组件中找到不正确的数据,并且没有达到可操作的阈值时,采取预定的可靠性,可用性和可服务性(RAS)动作。 当在数据的ECC组件中识别出不正确的数据达到可操作的阈值时,执行分析处理以确定ECC逻辑是否有故障。 当检测到ECC逻辑中的故障时,所识别的ECC故障逻辑被替换为备用逻辑块。

    IMPLEMENTING SIMULTANEOUS READ AND WRITE OPERATIONS UTILIZING DUAL PORT DRAM
    26.
    发明申请
    IMPLEMENTING SIMULTANEOUS READ AND WRITE OPERATIONS UTILIZING DUAL PORT DRAM 有权
    实现同时使用双端口DRAM的读取和写入操作

    公开(公告)号:US20150213854A1

    公开(公告)日:2015-07-30

    申请号:US14310717

    申请日:2014-06-20

    IPC分类号: G11C7/10

    摘要: A method, system and memory controller are provided for implementing simultaneous read and write operations in a memory subsystem utilizing a dual port Dynamic Random Access Memory (DRAM) configuration. A DRAM includes a first partition and a second partition. A memory controller determines if memory requirements are above or below a usage threshold. If the memory requirements are below the usage threshold, the memory is partitioned into a read buffer and a write buffer, with writes going to the write buffer and reads coming from the read buffer, data being transferred from the write buffer to the read buffer through an Error Correction Code (ECC) engine. If the memory requirements are above the usage threshold, the entire memory is used for reads and writes.

    摘要翻译: 提供了一种方法,系统和存储器控制器,用于在利用双端口动态随机存取存储器(DRAM)配置的存储器子系统中实现同时的读和写操作。 DRAM包括第一分区和第二分区。 存储器控制器确定存储器需求是否高于或低于使用阈值。 如果存储器要求低于使用阈值,则存储器被划分为读缓冲器和写缓冲器,其中写入缓冲器的写入和来自读缓冲器的读取,数据从写缓冲器传送到读缓冲器,通过 纠错码(ECC)引擎。 如果内存要求高于使用阈值,则整个内存将用于读取和写入。

    SELF MONITORING AND SELF REPAIRING ECC
    27.
    发明申请
    SELF MONITORING AND SELF REPAIRING ECC 有权
    自我监测和自我修复ECC

    公开(公告)号:US20140250340A1

    公开(公告)日:2014-09-04

    申请号:US13781807

    申请日:2013-03-01

    IPC分类号: G06F11/28

    摘要: Exemplary embodiments of the present invention disclose a method and system for monitoring a first Error Correcting Code (ECC) device for failure and replacing the first ECC device with a second ECC device if the first ECC device begins to fail or fails. In a step, an exemplary embodiment detects that a specified number of correctable errors is exceeded. In another step, an exemplary embodiment detects the occurrence of an uncorrectable error. In another step, an exemplary embodiment performs a loopback test on an ECC device if a specified number of correctable errors is exceeded or if an uncorrectable error occurs. In another step, an exemplary embodiment replaces an ECC device that fails the loopback test with an ECC device that passes a loopback test.

    摘要翻译: 本发明的示例性实施例公开了一种用于监视第一纠错码(ECC)设备的方法和系统,用于如果第一ECC设备开始失败或失败,则用第二ECC设备故障并替换第一ECC设备。 在一个步骤中,示例性实施例检测到超过了指定数量的可校正错误。 在另一步骤中,示例性实施例检测出不可校正的错误。 在另一步骤中,如果超过指定数量的可校正错误或者发生不可校正的错误,则示例性实施例对ECC设备执行环回测试。 在另一步骤中,示例性实施例用通过环回测试的ECC设备替代了对环回测试失败的ECC设备。

    REAL-TIME ERROR DEBUGGING
    28.
    发明公开

    公开(公告)号:US20230153190A1

    公开(公告)日:2023-05-18

    申请号:US17454987

    申请日:2021-11-15

    IPC分类号: G06F11/07 G06F11/36

    摘要: In an approach to improve resolving defects within computer hardware, programs, software, or systems, embodiments pause mainline traffic and isolating interface or retention issues, and determine one or more types of errors in an event of a mainline traffic fail, wherein debug techniques are applied to fail information to resolve or further diagnose the one or more types of errors, and wherein the debug techniques are tracked and categorized. Additionally, embodiments apply corrective read actions to a detected error based on previously stored corrective actions associated with the detected error, and responsive to identifying no additional actions are required, restoring a collected system data. Further, embodiments, resume the paused mainline traffic.

    Hardware abstraction in software or firmware for hardware calibration

    公开(公告)号:US10936222B2

    公开(公告)日:2021-03-02

    申请号:US16445485

    申请日:2019-06-19

    IPC分类号: G06F12/00 G06F3/06 G06F9/445

    摘要: A computer-implemented method for calibrating DRAM is provided. A non-limiting example of the computer-implemented method includes reading, by a processor, system configuration information and disabling, by the processor, one or more steps in a list of calibration steps to be performed based on the system configuration information to leave a list of remaining calibration steps. Based on a determination that two or more remaining calibration steps are co-dependent, the method configures, by the processor, a single calibration step that encapsulates the co-dependent algorithm and places, by the processor, the single calibration step in a list of steps to be called. The method then provides, by the processor, the list of steps to be called.