Abstract:
Heterogeneous data source shielding by an integrated development environment (IDE), including receiving a product process release request or a product process test request initiated by a user, obtaining a configuration parameter of a product and a data flow model preconfigured for the product, where the data flow model includes a multi-input node that connects multiple input data sources, and a matching relationship between the input data sources and different configuration parameters is configured on the multi-input node looking up a corresponding input data source according to the configuration parameter of the product, replacing the multi-input node in the data flow model with the corresponding input data source, and using a data flow model obtained after replacing for the product process release or the product process test.
Abstract:
Embodiments of the present disclosure disclose a data processing apparatus and method for processing serial tasks, which can reduce, in a data processing process, a quantity of times of reading an output result from disks of node devices in a distributed system, thereby reducing time required and network resources occupied for processing a large amount of data. The method in the present disclosure includes: acquiring at least two MapReduce tasks, where the at least two MapReduce tasks are serially arranged according to an execution sequence, and when two MapReduce tasks are serially arranged, an output value obtained after the former MapReduce task is executed is an input value of the latter MapReduce task; combining the at least two MapReduce tasks to obtain a target MapReduce task; and executing the target MapReduce task, and obtaining an output result of the target MapReduce task.