-
公开(公告)号:US20190042317A1
公开(公告)日:2019-02-07
申请号:US16082553
申请日:2017-02-16
Applicant: HITACHI, LTD.
Inventor: Yoshihisa IDA , Saori MITSUNAGA , Tsukasa HOSOYA , Atsuyoshi MORISHIMA , Yoshiki AOYAMA
Abstract: A data processing method of a distributed data processing system, in which the base server collects and standardizes data and generates base data by node cut processing, the central server collects the attribute information of the column of the base data from a plurality of base servers and the relationship between the integration source and the integration destination of the base data by the node cut processing of the base server as base column integrated information, a combination of an integration source and an integration destination capable of reducing the data amount as a result of an replacement for calculating a combination of an integration source and an integration destination capable of reducing the data amount by exchanging the integration source and the integration destination when data is combined, the combination is notified to the base server as an exchange instruction.
-
公开(公告)号:US20190391981A1
公开(公告)日:2019-12-26
申请号:US15752338
申请日:2016-03-28
Applicant: Hitachi, Ltd.
Inventor: Tsukasa HOSOYA , Satoru ANAN
IPC: G06F16/2455 , G06F16/901 , G06F16/2458
Abstract: A computer system has a plurality of computers each executing stream data processing and a management computer assigning a plurality of divided queries to the plurality of computers. The management computer includes a parameter input module configured to receive an analysis scenario including a plurality of queries and division information for dividing the analysis scenario into the plurality of divided queries; and a query analysis module configured to analyze the analysis scenario to generate a query graph. The query analysis module specifies, based on the analysis scenario and the division information, at least one of the divided queries that requires flow control; and sets the flow control for the plurality of computers to each of which the at least one of the divided queries that requires the flow control is to be assigned.
-