DATA PROCESSING METHOD, DISTRIBUTED DATA PROCESSING SYSTEM AND STORAGE MEDIUM

    公开(公告)号:US20190042317A1

    公开(公告)日:2019-02-07

    申请号:US16082553

    申请日:2017-02-16

    Applicant: HITACHI, LTD.

    Abstract: A data processing method of a distributed data processing system, in which the base server collects and standardizes data and generates base data by node cut processing, the central server collects the attribute information of the column of the base data from a plurality of base servers and the relationship between the integration source and the integration destination of the base data by the node cut processing of the base server as base column integrated information, a combination of an integration source and an integration destination capable of reducing the data amount as a result of an replacement for calculating a combination of an integration source and an integration destination capable of reducing the data amount by exchanging the integration source and the integration destination when data is combined, the combination is notified to the base server as an exchange instruction.

    COMPUTER SYSTEM AND METHOD FOR SETTING A STREAM DATA PROCESSING SYSTEM

    公开(公告)号:US20190391981A1

    公开(公告)日:2019-12-26

    申请号:US15752338

    申请日:2016-03-28

    Applicant: Hitachi, Ltd.

    Abstract: A computer system has a plurality of computers each executing stream data processing and a management computer assigning a plurality of divided queries to the plurality of computers. The management computer includes a parameter input module configured to receive an analysis scenario including a plurality of queries and division information for dividing the analysis scenario into the plurality of divided queries; and a query analysis module configured to analyze the analysis scenario to generate a query graph. The query analysis module specifies, based on the analysis scenario and the division information, at least one of the divided queries that requires flow control; and sets the flow control for the plurality of computers to each of which the at least one of the divided queries that requires the flow control is to be assigned.

Patent Agency Ranking