Fail-over control in a computer system having redundant service processors
    2.
    发明授权
    Fail-over control in a computer system having redundant service processors 有权
    具有冗余服务处理器的计算机系统中的故障切换控制

    公开(公告)号:US06931568B2

    公开(公告)日:2005-08-16

    申请号:US10113167

    申请日:2002-03-29

    IPC分类号: H04L1/22 G06F11/00

    摘要: A system and method for determining an active service processor from two or more redundant service processors in the system. The system typically includes two management modules and at least one managed subsystem such as a server blade. Each management module includes a service processor and control logic. The control logic is configured to receive various status signals from the service processor and to generate a control signal based thereon. The control signal is provided, via an interconnect plane, to determination logic on each managed subsystem. The determination logic receives a control signal from each management module and generates a switch signal based on the state of the control signals. The switch signal controls switching logic configured to receive bus signals from the service processors on each management module. Based on the control signal, one of the service processor bus signals is provided to managed instrumentation on the managed subsystem. The management module control logic is generally configured to maintain the control signal in its current state if the active processor is determined to be functional. The control logic is further configured to alter the control signal state if the active service processor is determined to be non-functional. A transition in the control signal typically generates a fail-over event that causes the switching logic on the managed subsystems to switch from the previously active service processor to the previously inactive or standby service processor as the source of service processor signals.

    摘要翻译: 一种用于从系统中的两个或多个冗余服务处理器确定主动服务处理器的系统和方法。 系统通常包括两个管理模块和至少一个托管子系统,例如服务器刀片。 每个管理模块包括服务处理器和控制逻辑。 控制逻辑被配置为从服务处理器接收各种状态信号并且基于此产生控制信号。 控制信号通过互连平面提供给每个被管理的子系统上的确定逻辑。 确定逻辑从每个管理模块接收控制信号,并根据控制信号的状态生成开关信号。 开关信号控制被配置为从每个管理模块上的服务处理器接收总线信号的开关逻辑。 基于控制信号,服务处理器总线信号之一被提供给被管理子系统上的管理仪器。 管理模块控制逻辑通常被配置为如果活动处理器被确定为功能,则将控制信号保持在其当前状态。 控制逻辑还被配置为如果主动服务处理器被确定为不起作用则改变控制信号状态。 控制信号中的转换通常产生故障转移事件,导致被管理子系统上的交换逻辑从先前有效的服务处理器切换到先前不活动或备用服务处理器作为服务源处理器信号。

    Remotely booting devices in a dense server environment without manually installing authentication parameters on the devices to be booted
    4.
    发明授权
    Remotely booting devices in a dense server environment without manually installing authentication parameters on the devices to be booted 有权
    在密集的服务器环境中远程启动设备,而无需在要引导的设备上手动安装认证参数

    公开(公告)号:US07194619B2

    公开(公告)日:2007-03-20

    申请号:US10109230

    申请日:2002-03-26

    IPC分类号: G06F15/177 H04L9/00

    CPC分类号: G06F21/305 G06F21/575

    摘要: A method, system and computer program product for remotely booting devices. A deployment server may remotely transmit authentication parameter(s), e.g., public key, secret key, to a service unit configured to establish a private connection between server blades and the deployment server. The service unit may remotely install the authentication parameter(s) onto the server blade(s) to be booted by either the deployment server or another boot server. By the service unit remotely installing the authentication parameter(s) onto the server blade(s), the need to manually install them during each network boot operation may be alleviated. By remotely transmitting authentication parameter(s) instead of manually installing them on the devices to be booted during each network boot operation, the deployment server may be able to generate unique authentication parameter(s), e.g., public/private key pair, secret key, for each network boot operation thereby substantially reducing the exposure to replay attacks.

    摘要翻译: 用于远程启动设备的方法,系统和计算机程序产品。 部署服务器可以远程地将认证参数(例如,公共密钥,秘密密钥)发送到配置为在服务器刀片和部署服务器之间建立专用连接的服务单元。 服务单元可以将认证参数远程安装到服务器刀片上,以由部署服务器或其他引导服务器引导。 通过服务单元将认证参数远程安装到服务器刀片上,可以减轻在每次网络引导操作期间手动安装认证参数的需要。 通过在每个网络引导操作期间远程发送认证参数而不是手动将其安装在要引导的设备上,部署服务器可以能够生成唯一的认证参数,例如公钥/私钥对,秘密密钥 ,用于每个网络引导操作,从而显着减少对重放攻击的暴露。

    Logging insertion/removal of server blades in a data processing system
    5.
    发明授权
    Logging insertion/removal of server blades in a data processing system 有权
    在数据处理系统中记录服务器刀片的插入/删除

    公开(公告)号:US06883125B2

    公开(公告)日:2005-04-19

    申请号:US10057643

    申请日:2002-01-24

    IPC分类号: G06F21/86 G06F11/00

    CPC分类号: G06F21/86 G06F2221/2101

    摘要: This invention is comprised of a data processing system containing at least one main processor connected to a system bus, a system memory connected to the system bus and accessible to each of the main processors, a tamper mechanism, and a local service processor. The tamper mechanism is configured to change state each time the system is inserted into a slot in a rack enclosure. The local service processor is connected to the tamper mechanism and configured to update an insertion log upon detecting a change in state of the tamper mechanism. The insertion log provides a count and a history of rack insertions to which the system has been subjected. The system may include a non-volatile storage element which is updated exclusively by the local service processor that contains the insertion log. The insertion log may include an insertion counter. In this embodiment, the local service processor is configured to increment the insertion counter upon each insertion. The local service processor may be further configured to issue an alert if the insertion counter exceeds a predetermined value. In one embodiment, the system further includes a battery backed real-time clock connected to the local service processor. The local service processor is configured to include real-time information corresponding to each insertion event in the insertion log. Each entry in the insertion log may include the identity of the rack enclosure and the geographical address of the slot of the corresponding insertion event. The local service processor may be configured to detect the tamper mechanism state and update the insertion following a power event such that the insertion log update is independent of configuring the data processing system with a boot image.

    摘要翻译: 本发明包括一个数据处理系统,该数据处理系统包括连接到系统总线的至少一个主处理器,连接到系统总线并且可被每个主处理器访问的系统存储器,篡改机制和本地服务处理器。 篡改机制被配置为每当系统插入机柜中的插槽时改变状态。 本地服务处理器连接到篡改机制,并且被配置为在检测到篡改机制的状态变化时更新插入日志。 插入日志提供系统受到的机架插入的计数和历史记录。 系统可以包括由包含插入日志的本地服务处理器专门更新的非易失性存储元件。 插入日志可以包括插入计数器。 在该实施例中,本地服务处理器被配置为在每次插入时递增插入计数器。 本地服务处理器还可以被配置为如果插入计数器超过预定值则发出警报。 在一个实施例中,系统还包括连接到本地服务处理器的电池支持的实时时钟。 本地服务处理器被配置为包括与插入日志中的每个插入事件相对应的实时信息。 插入日志中的每个条目可以包括机架机柜的标识和相应插入事件的时隙的地理位置。 本地服务处理器可以被配置为检测篡改机制状态并且在功率事件之后更新插入,使得插入日志更新独立于使用引导映像配置数据处理系统。

    System and method for prevention of boot storms in a computer network
    6.
    发明授权
    System and method for prevention of boot storms in a computer network 有权
    防止计算机网络引导风暴的系统和方法

    公开(公告)号:US07415519B2

    公开(公告)日:2008-08-19

    申请号:US10184202

    申请日:2002-06-28

    IPC分类号: G06F15/173

    CPC分类号: G06F9/4405 H04L67/34

    摘要: This invention is useful in a networked system with densely packaged servers or server blades. The servers are connected to a system management network, a communication network and an image server. A management module attached to the system management network and a network switch monitors and controls network booting from an image server on the communication network to prevent over commitment of network and image server resources in order to avoid a boot storm. The management module collects system information and calculates the number of servers or clients the networked system can boot at any one instant of time without burdening the system. The management module logic controls booting via the system management network and service processor elements, which can block server booting and release servers to boot when other servers have completed their boot process.

    摘要翻译: 本发明在具有密集封装的服务器或服务器刀片的联网系统中是有用的。 服务器连接到系统管理网络,通信网络和图像服务器。 连接到系统管理网络的管理模块和网络交换机监视和控制来自通信网络上的图像服务器的网络引导,以防止网络和映像服务器资源的过度承诺,​​以避免引导风暴。 管理模块收集系统信息,并计算网络系统可以在任何时刻启动的服务器或客户端数量,而不会对系统造成负担。 管理模块逻辑控制通过系统管理网络和服务处理器元素进行启动,当其他服务器完成启动过程时,可以阻止服务器启动并释放服务器进行引导。

    Adaptive startup policy for accelerating multi-disk array spin-up
    8.
    发明授权
    Adaptive startup policy for accelerating multi-disk array spin-up 有权
    用于加速多磁盘阵列升级的自适应启动策略

    公开(公告)号:US06966006B2

    公开(公告)日:2005-11-15

    申请号:US10142142

    申请日:2002-05-09

    IPC分类号: G06F3/06 G11B19/20 G06F1/26

    摘要: A method and system for adaptively implementing a disk drive startup sequence for a disk drive array. Prior to a next disk drive spin-up sequence a currently available power supply resource capacity and a startup metric of each of the array disk drives are determined. Each of the disk drives are scheduled into designated startup groups as a function of both the determined currently available power supply resource capacity and the determined startup metric. The scheduling of disk drives into designated startup groups includes determining an activation sequence timing schedule for each of the disk drives. The activation sequence timing schedule determines the relative times at which spindle motors for each of said plurality of disk drives will be activated as a function of the determined startup metric for each of the disk drives and the available power supply resource capacity as reduced by the steady state power requirements of each of the startup groups.

    摘要翻译: 一种用于自适应地实现磁盘驱动器阵列的磁盘驱动器启动顺序的方法和系统。 在下一个磁盘驱动器启动序列之前,确定当前可用的电源资源容量和每个阵列磁盘驱动器的启动度量。 根据确定的当前可用的电源资源容量和确定的启动度量,将每个磁盘驱动器调度到指定的启动组中。 将磁盘驱动器调度到指定的启动组包括确定每个磁盘驱动器的激活顺序时序表。 激活顺序时序调度决定了所述多个磁盘驱动器中的每一个的主轴电动机将被激活的相对时间,作为对于每个磁盘驱动器确定的启动度量以及可用电力供应资源容量的函数,由稳定 每个启动组的状态功率要求。

    Remotely controlled boot settings in a server blade environment
    9.
    发明授权
    Remotely controlled boot settings in a server blade environment 有权
    在服务器刀片环境中远程控制引导设置

    公开(公告)号:US07013385B2

    公开(公告)日:2006-03-14

    申请号:US10064012

    申请日:2002-06-04

    IPC分类号: G06F9/445

    CPC分类号: G06F9/4405 G06F15/177

    摘要: A system for storing and configuring CMOS setting information remotely in a sewer blade environment includes a management module having includes persistent storage containing a table of CMOS setting information for each server blade. Each server blade includes boot block software that executes when the blade is booted. The boot block software initiates communication with the management module and retrieves its CMOS settings from the table. Thus, CMOS settings for a particular blade location remain unchanged each time a blade is replaced or upgraded. The management module and saver blades may implement a programming interface tat includes command abstractions for each CMOS setting. The management module sends command abstractions to each sewer blade during the CMOS configuration process. The server blade interprets the commands and maps the commands to specific CMOS bit addresses thereby making the specific CMOS implementation employed by any server blade transparent to the management module.

    摘要翻译: 用于在下水道刀片环境中远程存储和配置CMOS设置信息的系统包括管理模块,其包括持久存储器,其包含用于每个服务器刀片的CMOS设置信息表。 每个服务器刀片都包括引导块软件,当刀片启动时执行。 引导块软件启动与管理模块的通信,并从表中检索其CMOS设置。 因此,每当刀片被替换或升级时,特定刀片位置的CMOS设置保持不变。 管理模块和保护刀片可以实现编程接口,其中包括每个CMOS设置的命令抽象。 管理模块在CMOS配置过程中向每个下水道刀片发送命令抽象。 服务器刀片解释命令并将命令映射到特定的CMOS位地址,从而使任何服务器刀片采用的特定CMOS实现对管理模块都是透明的。

    Monitoring insertion/removal of server blades in a data processing system
    10.
    发明授权
    Monitoring insertion/removal of server blades in a data processing system 有权
    监控数据处理系统中服务器刀片的插入/删除

    公开(公告)号:US06968414B2

    公开(公告)日:2005-11-22

    申请号:US10006976

    申请日:2001-12-04

    IPC分类号: G06F13/40 G06F13/00

    CPC分类号: G06F13/409

    摘要: A system for monitoring server blades includes a cabinet having a plurality of racks configured to receive a server blade. The server blade includes one or more microprocessors, memory, and a service processor all interconnected via one or more busses. The system includes a management blade to monitor the local service processors. Upon installation, a new blade identifies itself to the management blade by its physical slot position within the cabinet. In response to a reset, the local blade service processor determines from a tamper latch whether the blade has been removed from the chassis since the last power-on event. If the tamper latch is broken, the local service processor informs the management blade and resets the tamper latch. The local service processor of each blade may send a periodic heartbeat to the management blade, which monitors for loss of the heartbeat signal to determine when a blade is removed.

    摘要翻译: 用于监视服务器刀片的系统包括具有被配置为接收服务器刀片的多个机架的机柜。 服务器刀片包括一个或多个微处理器,存储器和经由一个或多个总线互连的服务处理器。 该系统包括用于监视本地服务处理器的管理刀片。 安装后,新的刀片通过机柜内的物理插槽位置向管理刀片识别。 响应于重置,本地刀片服务处理器从篡改锁定器确定自上次上电事件以来刀片是否已经从机架移除。 如果篡改锁定器坏了,本地服务处理器会通知管理刀片并复位篡改锁存器。 每个刀片的本地服务处理器可以向管理刀片发送周期性心跳,管理刀片监视心跳信号的丢失以确定刀片何时被移除。