摘要:
A data analyzing method for generating a rule based on data items in a data base, wherein the rule expresses relational features of the data items. The invention includes a user interface and a rule generation module. The rule generation module, in response to an input from the user via the user interface, selects data items for use in a conditional clause and a conclusion clause of a rule from the data items stored in the data base, converts, when the selected data items have numerical values, the numerical values into symbolic values and creates plural candidate rules each expressing a correlation between selected data items in a rule form having one or plural sets of item names and symbolic values. The rule generation module further calculates a criterion for evaluating strength of correlation between data items in each of the candidate rules, determines one or plural candidate rules having highest calculated criterion from the candidate rules, and outputs to the user via the user interface the one or plural candidate rules.
摘要:
A data analyzing method and system for generating a rule based on data items in a data base, wherein the rule expresses relational features of the data items. The invention includes a user interface and a rule generation module. The rule generation module, in response to an input from the user via the user interface, selects data items for use in a conditional clause and a conclusion clause of a rule from the data items stored in the data base, converts, when the selected data items have numerical values, the numerical values into symbolic values and creates plural candidate rules each expressing a correlation between selected data items in a rule form having one or plural sets of item names and symbolic values. The rule generation module further calculates a criterion for evaluating strength of correlation between data items in each of the candidate rules, determines one or plural candidate rules having highest calculated criterion from the candidate rules, and outputs to the user via the user interface the one or plural candidate rules.
摘要:
A data analyzing method and system for generating a rule based on data items in a data base, wherein the rule expresses relational features of the data items. The invention includes a user interface and a rule generation module. The rule generation module, in response to an input from the user via the user interface, selects data items for use in a conditional clause and a conclusion clause of a rule from the data items stored in the data base, converts, when the selected data items have numerical values, the numerical values into symbolic values and creates plural candidate rules each expressing a correlation between selected data items in a rule form having one or plural sets of item names and symbolic values. The rule generation module further calculates a criterion for evaluating strength of correlation between data items in each of the candidate rules, determines one or plural candidate rules having highest calculated criterion from the candidate rules, and outputs to the user via the user interface the one or plural candidate rules.
摘要:
A rule generation apparatus includes a label presenter in which when producing from training data including a set of specific values related to input and output variables rules representing input/output relationships between the input and output variables, numeric data of the training data is converted into categorical data expressed by symbols to generate an instance table, an RI device for extracting rules from the instance table, and a rule converter for converting the extracted rules into fuzzy rules. When the training data is divided to be distributively stored in a plurality of server processors, label assignment is conducted by each server processor such that a client processor later combines instance tables with each other to achieve rule induction and conversion.
摘要:
A query issue processing method, a query conversion processing method, and a data control processing method are provided for enhancing the efficiency of random sampling processing for use in a database processing system. In query issue processing 2, a query including random sampling processing is issued. In query conversion processing 8, application sequences of random sampling processing and another query processing are exchanged by considering a sampling unit of the random sampling processing. Further, in record control processing 4, random access to a secondary storage device is reduced, thereby enhancing random sampling processing efficiency. Unlike the conventional query conversion processing not considering the sampling unit, the issuance of the query including random sampling processing and performing query conversion by considering the sampling unit allow random sampling to be applied also to a query including aggregation processing, thereby enhancing the efficiency of queries in a wider range. Reduction in the random access to the secondary storage device further enhances that efficiency.