-
11.
公开(公告)号:US20080229035A1
公开(公告)日:2008-09-18
申请号:US11685395
申请日:2007-03-13
IPC分类号: G06F12/00
CPC分类号: G06F12/0607
摘要: Systems and methods for implementing a stride valise for memory are provided. One embodiment includes a system comprising a plurality of memory modules configured to store interleaved data in a plurality of memory storage units according to a predetermined interleave. The plurality of memory storage units can be defined by a memory range of consecutive addresses. The system also comprises a memory test device configured to access a portion of the plurality of memory storage units in a sequence that repeats according to a programmable stride value.
摘要翻译: 提供了用于实现用于存储器的步幅值的系统和方法。 一个实施例包括一种系统,其包括多个存储器模块,其被配置为根据预定的交错存储多个存储器存储单元中的交织数据。 多个存储器存储单元可以由连续地址的存储器范围来定义。 该系统还包括被配置为以按照可编程步幅值重复的顺序访问多个存储器单元的一部分的存储器测试设备。
-
公开(公告)号:US08713350B2
公开(公告)日:2014-04-29
申请号:US12633648
申请日:2009-12-08
IPC分类号: G06F11/00
CPC分类号: G06F11/0793 , G06F11/0712 , G06F11/0766
摘要: A method of managing errors in a data processing system may involve at least one computer system. Each computer system may include a processor that executes an operating system, firmware, and system memory storing instructions for the operating system. A firmware error handler resident in the firmware may identify an error occurring in the computer system. The firmware error handler may determine whether the operating system is required to take an action in response to the error. If the operating system is not required to take an action in response to the error, the firmware error handler may create an error log accessible to the operating system appropriate to cause the operating system to take no action.
摘要翻译: 管理数据处理系统中的错误的方法可以涉及至少一个计算机系统。 每个计算机系统可以包括执行存储操作系统的指令的操作系统,固件和系统存储器的处理器。 驻留在固件中的固件错误处理程序可能会识别计算机系统中发生的错误。 固件错误处理程序可以确定操作系统是否需要采取响应错误的动作。 如果操作系统不需要采取措施来响应错误,则固件错误处理程序可能会创建适用于使操作系统不采取任何操作的操作系统可访问的错误日志。
-
公开(公告)号:US20110179314A1
公开(公告)日:2011-07-21
申请号:US12691512
申请日:2010-01-21
IPC分类号: G06F11/07
CPC分类号: G06F11/0766 , G06F11/0721 , G06F11/0793
摘要: Method and system of error logging. At least some of the illustrative embodiments are methods including detecting assertion of an error pin by a processor system, (comprising at least a main processor and a chipset, the assertion of the error pin an indication to reboot the processor system) the detecting by a reset circuit, notifying a management processor (distinct from the main processor) that the error pin is asserted (the notifying by the reset circuit), writing to a plurality of registers in the chipset (the writing by the management processor), de-asserting a reset pin of the main processor, and then executing by the main processor an error-handling code to generate an error log.
摘要翻译: 错误记录的方法和系统 说明性实施例中的至少一些是包括检测处理器系统的错误引脚的断言(包括至少主处理器和芯片组,断言错误引脚的重新引导处理器系统的指示)的方法, 复位电路,通知管理处理器(与主处理器不同),错误引脚被断言(由复位电路通知),写入芯片组中的多个寄存器(管理处理器的写入),取消断言 主处理器的复位引脚,然后由主处理器执行错误处理代码以生成错误日志。
-
14.
公开(公告)号:US07103639B2
公开(公告)日:2006-09-05
申请号:US09730221
申请日:2000-12-05
申请人: Andrew C. Walton , Guy L. Kuntz
发明人: Andrew C. Walton , Guy L. Kuntz
IPC分类号: G06F15/167
CPC分类号: G06F15/177 , G06F9/4405 , G06F11/1417 , G06F11/1482 , G06F11/202
摘要: The present invention flexibly manages the formation of a partition from a plurality of independently executing cells (discrete hardware entities comprising system resources) in preparation for the instantiation of an operating system instance upon the partition. Specifically, the invention manages configuration activities that occur to transition from having individual cells acting independently, and having cells rendezvous, to having cells become interdependent to continue operations as a partition. The invention manages the partitioning forming process such that no single point of failure disrupts the process. Instead, the invention is implemented as a distributed application wherein individual cells independently execute instructions based upon respective copies of the complex profile (a “map” of the complex configuration). Also, the invention adapts to a degree of delay associated with certain cells becoming ready to join the formation or rendezvous process. The invention is able to cope with missing, unavailable, or otherwise malfunctioning cells. Additionally, the invention analyzes present cells to determine their compatibility and reject cells that are not compatible.
-
公开(公告)号:US08990646B2
公开(公告)日:2015-03-24
申请号:US13485824
申请日:2012-05-31
CPC分类号: G11C29/44 , G06F11/2205 , G06F11/27 , G11C5/04 , G11C29/028 , G11C29/08 , G11C29/56008 , G11C29/76 , G11C2029/0401 , G11C2029/0409
摘要: An error test routine tests for a type of memory error by changing a content of a memory module. A memory handling procedure isolates the memory error in response to a positive outcome of the error test routine. The error test routine and memory handling procedure are to be performed at runtime transparent to an operating system. Information corresponding to isolating the memory error is stored.
摘要翻译: 错误测试例程通过更改内存模块的内容来测试内存错误类型。 存储器处理程序响应于错误测试例程的肯定结果来隔离存储器错误。 错误测试例程和内存处理过程将在运行时对操作系统透明执行。 存储与隔离存储器错误相对应的信息。
-
16.
公开(公告)号:US08612797B2
公开(公告)日:2013-12-17
申请号:US11394585
申请日:2006-03-31
IPC分类号: G06F11/07
CPC分类号: G11C29/76
摘要: System and methods of selectively managing errors in memory modules. In an exemplary implementation, a method may include monitoring for persistent errors in the memory modules. The methods may also include mapping at least a portion of the memory modules to a spare memory cache only to obviate persistent errors. The method may also include initiating memory erasure on at least a portion of the memory modules only if insufficient cache lines are available in the spare memory cache.
摘要翻译: 有选择地管理存储器模块中的错误的系统和方法。 在示例性实现中,方法可以包括监视存储器模块中的持续错误。 所述方法还可以包括将至少一部分存储器模块映射到备用存储器高速缓存以避免持续错误。 该方法还可以包括仅当在备用存储器高速缓存中可用的高速缓存行不足时才启动存储器模块的至少一部分上的存储器擦除。
-
公开(公告)号:US20130326293A1
公开(公告)日:2013-12-05
申请号:US13485824
申请日:2012-05-31
CPC分类号: G11C29/44 , G06F11/2205 , G06F11/27 , G11C5/04 , G11C29/028 , G11C29/08 , G11C29/56008 , G11C29/76 , G11C2029/0401 , G11C2029/0409
摘要: An error test routine is to test for a type of memory error by changing a content of a memory module. A memory handling procedure is to isolate the memory error in response to a positive outcome of the error test routine. The error test routine and memory handling procedure is to be performed at runtime transparent to an operating system. Information corresponding to isolating the memory error is stored.
摘要翻译: 错误测试例程是通过更改内存模块的内容来测试一种内存错误。 存储器处理过程是响应于错误测试例程的肯定结果来隔离存储器错误。 错误测试程序和内存处理程序将在运行时对操作系统透明执行。 存储与隔离存储器错误相对应的信息。
-
公开(公告)号:US20120239973A1
公开(公告)日:2012-09-20
申请号:US13258392
申请日:2009-12-08
IPC分类号: G06F11/07
CPC分类号: G06F11/0784 , G06F11/0712 , G06F11/079 , G06F11/0793
摘要: A method of managing errors in a data processing system (10) may involve at least one computer system (14). Each computer system (14) may include a plurality of hardware components (18), including a processor (20) for executing a respective operating system and a memory (22) for storing instructions for the respective operating system (24), and firmware (28) including a firmware error handler (30). For each computer system (14), the firmware error handler (30) may identify an error occurring in one of the hardware components (18). Each respective firmware error handler (30) may communicate error information about the identified error to an error manager (32) external of the computer system (14). The error manager (14) may compile the error information communicated from each respective firmware error handler (30).
摘要翻译: 管理数据处理系统(10)中的错误的方法可以包括至少一个计算机系统(14)。 每个计算机系统(14)可以包括多个硬件组件(18),包括用于执行相应操作系统的处理器(20)和用于存储相应操作系统(24)的指令的存储器(22)和固件( 28),其包括固件错误处理程序(30)。 对于每个计算机系统(14),固件错误处理器(30)可以识别在硬件组件(18)之一上发生的错误。 每个相应的固件错误处理器(30)可以将关于所识别的错误的错误信息传送到计算机系统(14)的外部的错误管理器(32)。 错误管理器(14)可以编译从每个相应的固件错误处理器(30)传送的错误信息。
-
公开(公告)号:US08151147B2
公开(公告)日:2012-04-03
申请号:US12640971
申请日:2009-12-17
IPC分类号: G06F11/00
CPC分类号: G06F11/0793 , G06F11/0709
摘要: In accordance with at least some embodiments, a system comprises a plurality of partitions, each partition having its own error handler. The system further comprises a plurality of resources assignable to the plurality of partitions. The system further comprises management logic coupled to the plurality of partitions and the plurality of resources. The management logic comprises an error management tool that synchronizes operation of the error handlers in response to an error.
摘要翻译: 根据至少一些实施例,系统包括多个分区,每个分区具有其自己的错误处理程序。 该系统还包括可分配给多个分区的多个资源。 该系统还包括耦合到多个分区和多个资源的管理逻辑。 管理逻辑包括错误管理工具,该错误管理工具使错误处理程序的响应于错误的操作同步。
-
公开(公告)号:US08108724B2
公开(公告)日:2012-01-31
申请号:US12641072
申请日:2009-12-17
IPC分类号: G06F11/00
CPC分类号: G06F11/079 , G06F11/0727 , G06F11/0748 , G06F11/1428 , G06F11/202
摘要: A system and method for fault management in a computer-based system are disclosed herein. A system includes a plurality of field replaceable units (“FRUs”) and fault management logic. The fault management logic is configured to collect error information from a plurality of components of the system. The logic stores, for each component identified as a possible cause of a detected fault, a record assigning one of two different component failure probability indications. The logic identifies a single of the plurality of FRUs that has failed based on the stored probability indications.
摘要翻译: 本文公开了一种用于基于计算机的系统中的故障管理的系统和方法。 系统包括多个现场可更换单元(“FRU”)和故障管理逻辑。 故障管理逻辑被配置为从系统的多个部件收集错误信息。 对于被识别为检测到的故障的可能原因的每个组件,逻辑存储器分配两个不同组件故障概率指示之一的记录。 该逻辑基于所存储的概率指示来识别已经发生故障的多个FRU中的单个。
-
-
-
-
-
-
-
-
-