Abstract:
Disclosed are service providing method and device, including: collecting execution state information about a plurality of tasks that constitute at least one service, and are dynamically distributed and arranged over a plurality of nodes; and performing scheduling based on the collected execution state information about the plurality of tasks, wherein each of the plurality of tasks has at least one input source and output source, and a unit of data to be processed for each input source and a data processing operation are defined by a user, and the scheduling is to delete at least a portion of data input into at least one task or to process the at least a portion of input data in at least one duplicate task by referring to the defined unit of data. In particular, the present invention may effectively provide a service of analyzing and processing large stream data in semi-real time.
Abstract:
Disclosed is a data processing method and device according to the present invention, including: calculating a maximum allowable delay time of each of a plurality of operators that constitute a plurality of services and are executed according to a data processing request of a user; classifying the plurality of operators based on the calculated maximum allowable delay time; calculating a quality of service (QoS) satisfaction allowable level with respect to each of the plurality of operators, wherein the allowable level for QoS satisfaction indicates the number of times that there is no need to satisfy one goal of QoS with the premise that there is no problem in satisfying QoS satisfaction requested by the user; setting execution orders of the plurality of classified operators by reflecting the calculated allowable level for QoS satisfaction; and executing the plurality of operators based on the set execution orders.
Abstract:
Provided are a cluster data management system and a method for data restoration using a shared redo log in the cluster data management system. The data restoration method includes collecting service information of a partition served by a failed partition server, dividing redo log files written by the partition server by columns of a table including the partition, restoring data of the partition on the basis of the collected service information and log records of the divided redo log files, and selecting a new partition server that will serve the data-restored partition, and allocating the partition to the selected partition server.