Optimizing Data Processing Across Server Clusters and Data Centers Using Checkpoint-Based Data Replication

    公开(公告)号:US20180307572A1

    公开(公告)日:2018-10-25

    申请号:US15492122

    申请日:2017-04-20

    IPC分类号: G06F11/20 G06F9/48 G06F17/30

    摘要: Aspects of the disclosure relate to optimizing data processing across server clusters and data centers using checkpoint-based data replication. A computing platform may determine to initiate a data processing job associated with identifying one or more features of a source dataset, and the data processing job may include multiple processing steps. Based on determining to initiate the data processing job, the computing platform may generate one or more commands directing one or more cluster server nodes associated with a data center to execute the multiple processing steps. The one or more commands may direct the one or more cluster server nodes to update a checkpoint table as each processing step is completed, and may further direct the one or more cluster server nodes to replicate processing results data to at least one other data center. Subsequently, the computing platform may send the generated commands to the cluster server nodes.

    Method and system for processing fault of lock server in distributed system

    公开(公告)号:US09952947B2

    公开(公告)日:2018-04-24

    申请号:US15592217

    申请日:2017-05-11

    IPC分类号: G06F11/07 G06F11/20 H04L29/08

    摘要: A method for processing a fault of a lock server in a distributed system is disclosed, where the distributed system includes m lock servers, which locally store same lock server takeover relationship information. Lock servers in the distributed system that are not faulty receive a notification message, which carries information about a fault of a first lock server; after receiving the notification message, a second lock server determines that it is a takeover lock server of the first lock server according to the lock server takeover relationship information, and the takeover lock server enters a silent state; after receiving the notification message, a third lock server in the distributed system determines that it is not the takeover lock server of the first lock server according to the lock server takeover relationship information. After receiving a locking request, the third lock server allocates lock permission information according to the locking request.

    DISTRIBUTED BASEBOARD MANAGEMENT CONTROLLER FOR MULTIPLE DEVICES ON SERVER BOARDS

    公开(公告)号:US20180039552A1

    公开(公告)日:2018-02-08

    申请号:US15229772

    申请日:2016-08-05

    IPC分类号: G06F11/20 G06F3/06

    摘要: A server board includes first and second devices. A first service processor of the first device operates as a master baseboard management controller of the server board, and monitors a communication channel for alive messages from a plurality service processors. A second service processor operates as a secondary baseboard management controller, and sets a second timer to a first value. In response to a determination that the second timer has expired based on a first value: the second service processor to start a switchover process, and to set the second timer to a second value based on an alive message period. In response to a primary alive message not being received from the first service processor prior to the second timer expiring based on the second value, the second service processor to reset first service processor and to operate as the master baseboard management controller.