-
公开(公告)号:US08990646B2
公开(公告)日:2015-03-24
申请号:US13485824
申请日:2012-05-31
CPC分类号: G11C29/44 , G06F11/2205 , G06F11/27 , G11C5/04 , G11C29/028 , G11C29/08 , G11C29/56008 , G11C29/76 , G11C2029/0401 , G11C2029/0409
摘要: An error test routine tests for a type of memory error by changing a content of a memory module. A memory handling procedure isolates the memory error in response to a positive outcome of the error test routine. The error test routine and memory handling procedure are to be performed at runtime transparent to an operating system. Information corresponding to isolating the memory error is stored.
摘要翻译: 错误测试例程通过更改内存模块的内容来测试内存错误类型。 存储器处理程序响应于错误测试例程的肯定结果来隔离存储器错误。 错误测试例程和内存处理过程将在运行时对操作系统透明执行。 存储与隔离存储器错误相对应的信息。
-
公开(公告)号:US20130326293A1
公开(公告)日:2013-12-05
申请号:US13485824
申请日:2012-05-31
CPC分类号: G11C29/44 , G06F11/2205 , G06F11/27 , G11C5/04 , G11C29/028 , G11C29/08 , G11C29/56008 , G11C29/76 , G11C2029/0401 , G11C2029/0409
摘要: An error test routine is to test for a type of memory error by changing a content of a memory module. A memory handling procedure is to isolate the memory error in response to a positive outcome of the error test routine. The error test routine and memory handling procedure is to be performed at runtime transparent to an operating system. Information corresponding to isolating the memory error is stored.
摘要翻译: 错误测试例程是通过更改内存模块的内容来测试一种内存错误。 存储器处理过程是响应于错误测试例程的肯定结果来隔离存储器错误。 错误测试程序和内存处理程序将在运行时对操作系统透明执行。 存储与隔离存储器错误相对应的信息。
-
公开(公告)号:US08122291B2
公开(公告)日:2012-02-21
申请号:US12691512
申请日:2010-01-21
IPC分类号: G06F11/00
CPC分类号: G06F11/0766 , G06F11/0721 , G06F11/0793
摘要: Method and system of error logging. At least some of the illustrative embodiments are methods including detecting assertion of an error pin by a processor system, (comprising at least a main processor and a chipset, the assertion of the error pin an indication to reboot the processor system) the detecting by a reset circuit, notifying a management processor (distinct from the main processor) that the error pin is asserted (the notifying by the reset circuit), writing to a plurality of registers in the chipset (the writing by the management processor), de-asserting a reset pin of the main processor, and then executing by the main processor an error-handling code to generate an error log.
摘要翻译: 错误记录的方法和系统 说明性实施例中的至少一些是包括检测处理器系统的错误引脚的断言(包括至少主处理器和芯片组,断言错误引脚的重新引导处理器系统的指示)的方法, 复位电路,通知管理处理器(与主处理器不同),错误引脚被断言(由复位电路通知),写入芯片组中的多个寄存器(管理处理器的写入),取消断言 主处理器的复位引脚,然后由主处理器执行错误处理代码以生成错误日志。
-
公开(公告)号:US08713350B2
公开(公告)日:2014-04-29
申请号:US12633648
申请日:2009-12-08
IPC分类号: G06F11/00
CPC分类号: G06F11/0793 , G06F11/0712 , G06F11/0766
摘要: A method of managing errors in a data processing system may involve at least one computer system. Each computer system may include a processor that executes an operating system, firmware, and system memory storing instructions for the operating system. A firmware error handler resident in the firmware may identify an error occurring in the computer system. The firmware error handler may determine whether the operating system is required to take an action in response to the error. If the operating system is not required to take an action in response to the error, the firmware error handler may create an error log accessible to the operating system appropriate to cause the operating system to take no action.
摘要翻译: 管理数据处理系统中的错误的方法可以涉及至少一个计算机系统。 每个计算机系统可以包括执行存储操作系统的指令的操作系统,固件和系统存储器的处理器。 驻留在固件中的固件错误处理程序可能会识别计算机系统中发生的错误。 固件错误处理程序可以确定操作系统是否需要采取响应错误的动作。 如果操作系统不需要采取措施来响应错误,则固件错误处理程序可能会创建适用于使操作系统不采取任何操作的操作系统可访问的错误日志。
-
公开(公告)号:US20110179314A1
公开(公告)日:2011-07-21
申请号:US12691512
申请日:2010-01-21
IPC分类号: G06F11/07
CPC分类号: G06F11/0766 , G06F11/0721 , G06F11/0793
摘要: Method and system of error logging. At least some of the illustrative embodiments are methods including detecting assertion of an error pin by a processor system, (comprising at least a main processor and a chipset, the assertion of the error pin an indication to reboot the processor system) the detecting by a reset circuit, notifying a management processor (distinct from the main processor) that the error pin is asserted (the notifying by the reset circuit), writing to a plurality of registers in the chipset (the writing by the management processor), de-asserting a reset pin of the main processor, and then executing by the main processor an error-handling code to generate an error log.
摘要翻译: 错误记录的方法和系统 说明性实施例中的至少一些是包括检测处理器系统的错误引脚的断言(包括至少主处理器和芯片组,断言错误引脚的重新引导处理器系统的指示)的方法, 复位电路,通知管理处理器(与主处理器不同),错误引脚被断言(由复位电路通知),写入芯片组中的多个寄存器(管理处理器的写入),取消断言 主处理器的复位引脚,然后由主处理器执行错误处理代码以生成错误日志。
-
6.
公开(公告)号:US07103639B2
公开(公告)日:2006-09-05
申请号:US09730221
申请日:2000-12-05
申请人: Andrew C. Walton , Guy L. Kuntz
发明人: Andrew C. Walton , Guy L. Kuntz
IPC分类号: G06F15/167
CPC分类号: G06F15/177 , G06F9/4405 , G06F11/1417 , G06F11/1482 , G06F11/202
摘要: The present invention flexibly manages the formation of a partition from a plurality of independently executing cells (discrete hardware entities comprising system resources) in preparation for the instantiation of an operating system instance upon the partition. Specifically, the invention manages configuration activities that occur to transition from having individual cells acting independently, and having cells rendezvous, to having cells become interdependent to continue operations as a partition. The invention manages the partitioning forming process such that no single point of failure disrupts the process. Instead, the invention is implemented as a distributed application wherein individual cells independently execute instructions based upon respective copies of the complex profile (a “map” of the complex configuration). Also, the invention adapts to a degree of delay associated with certain cells becoming ready to join the formation or rendezvous process. The invention is able to cope with missing, unavailable, or otherwise malfunctioning cells. Additionally, the invention analyzes present cells to determine their compatibility and reject cells that are not compatible.
-
公开(公告)号:US09645857B2
公开(公告)日:2017-05-09
申请号:US12641001
申请日:2009-12-17
CPC分类号: G06F9/5061 , G06F9/22 , G06F9/44
摘要: In accordance with at least some embodiments, a system includes a plurality of partitions, each partition having its own operating system (OS) and workload. The system also includes a plurality of resources assignable to the plurality of partitions. The system also includes management logic coupled to the plurality of partitions and the plurality of resources. The management logic is configured to set priority rules for each of the plurality of partitions based on user input. The management logic performs automated resource fault management for the resources assigned to the plurality of partitions based on the priority rules.
-
公开(公告)号:US08161324B2
公开(公告)日:2012-04-17
申请号:US12641091
申请日:2009-12-17
申请人: Howard Calkin , Andrew C. Walton
发明人: Howard Calkin , Andrew C. Walton
IPC分类号: G06F11/00
CPC分类号: G06F11/0751 , G06F11/0727
摘要: A system and method for recording fault information in an electronic system are disclosed herein. A system includes fault analysis logic and a plurality of field replaceable units (“FRUs”). The fault analysis is configured to analyze system error information, and identify at least one of the FRUs in the system to be a possible cause of a detected fault based on the analysis. Each FRU includes writeable non-volatile storage including storage locations reserved to store information including a result of the analysis. The result of the analysis indicates a reason that the FRU storing the information was determined, by the fault analysis logic, to be a possible cause of the fault.
摘要翻译: 本文公开了一种在电子系统中记录故障信息的系统和方法。 系统包括故障分析逻辑和多个现场可更换单元(“FRU”)。 故障分析被配置为分析系统错误信息,并且基于分析将系统中的至少一个FRU识别为检测到的故障的可能原因。 每个FRU包括可写入的非易失性存储器,包括保存用于存储包括分析结果的信息的存储位置。 分析结果表明存在信息的FRU由故障分析逻辑确定为故障的可能原因。
-
公开(公告)号:US08122290B2
公开(公告)日:2012-02-21
申请号:US12641103
申请日:2009-12-17
IPC分类号: G06F11/00
CPC分类号: G06F11/0766 , G06F11/0712 , G06F11/0724 , G06F11/079
摘要: A system for error log consolidation is disclosed herein. A server computer includes a plurality of system processors and error log consolidation logic. The system processors are configurable to form isolated execution partitions. The error log consolidation logic is configured to, based on detection of a fault in the server, retrieve error logs from the system processors, and to consolidate the retrieved logs with server computer information not available to the system processors to generate a consolidated error log. The consolidated error log includes a comprehensive set of server information relevant to identifying a cause of the detected fault.
摘要翻译: 本文公开了用于错误日志整合的系统。 服务器计算机包括多个系统处理器和错误日志合并逻辑。 系统处理器可配置为形成隔离的执行分区。 错误日志整合逻辑被配置为基于检测到服务器中的故障,从系统处理器中检索错误日志,并将检索到的日志与系统处理器不可用的服务器计算机信息合并,以生成统一的错误日志。 统一的错误日志包括与识别检测到的故障原因相关的全套服务器信息。
-
公开(公告)号:US20110154097A1
公开(公告)日:2011-06-23
申请号:US12641072
申请日:2009-12-17
CPC分类号: G06F11/079 , G06F11/0727 , G06F11/0748 , G06F11/1428 , G06F11/202
摘要: A system and method for fault management in a computer-based system are disclosed herein. A system includes a plurality of field replaceable units (“FRUs”) and fault management logic. The fault management logic is configured to collect error information from a plurality of components of the system. The logic stores, for each component identified as a possible cause of a detected fault, a record assigning one of two different component failure probability indications. The logic identifies a single of the plurality of FRUs that has failed based on the stored probability indications.
摘要翻译: 本文公开了一种用于基于计算机的系统中的故障管理的系统和方法。 系统包括多个现场可更换单元(“FRU”)和故障管理逻辑。 故障管理逻辑被配置为从系统的多个部件收集错误信息。 对于被识别为检测到的故障的可能原因的每个组件,逻辑存储器分配两个不同组件故障概率指示之一的记录。 该逻辑基于所存储的概率指示来识别已经发生故障的多个FRU中的单个。
-
-
-
-
-
-
-
-
-