摘要:
In executing query distributed processing in a computer system having a plurality of computers, a query and a sliding window are inherited between different computers. If a tuple having a new principal key value is detected when a query is to be executed, a relevant individual query and a relevant sliding window are inherited between computers, and further a sliding window of a relevant common query is inherited between computers.
摘要:
A stream data processing system capable of highly reliable and highly usable real time data processing, is provided with: (1) a mechanism of copying some or all of stream data, and archiving the stream data in a non-volatile memory to allow real time data and archive data to be used seamlessly; and (2) a mechanism of improving the performance of query processings by using a plurality of stream data processing systems cooperatively.
摘要:
A stream data processing system capable of highly reliable and highly usable real time data processing, is provided with: (1) a mechanism of copying some or all of stream data, and archiving the stream data in a non-volatile memory to allow real time data and archive data to be used seamlessly; and (2) a mechanism of improving the performance of query processings by using a plurality of stream data processing systems cooperatively.
摘要:
In order to realize a practical data warehouse system with the use of a network in a distributed environment, for example, data collectors for creating/managing each replica of server data for clients shown in FIG. 1 are connected to a network. A data collector holds a data request condition given from a user via a client and negotiates with a server which supplies data according to such resources as a storage unit, a CPU, etc., usable by the data collector, thereby the range and quality of the data for which a replica is to be created can be adjusted. For query processing, a part or the whole of a query from a client is processed with the use of a replica created in the data collector. Queries which cannot be processed with any replica in the data collector are transferred to the server. Because a replica created in the data collector can be used to process a query from a client in such a way, the server load can be reduced.
摘要:
In order to realize a practical data warehouse system with the use of a network in a distributed environment, for example, data collectors (101 and 102) for creating/managing each replica of server data for clients (103 and 104) as shown in FIG. 1 are connected to a network (113). A data collector holds a data request condition given from a user via a client and negotiates with a server (114, 122) which supplies data according to such resources as a storage unit, a CPU, etc. usable by the data collector, thereby the range and quality of the data for which a replica is to be created can be adjusted. For query processing, a part or the whole of a query from a client is processed with the use of a replica (123) created in the data collector. Queries which cannot be processed with any replica in the data collector are transferred to the server. Because a replica created in the data collector can be used to process a query from a client device such a way, the server load can be reduced.
摘要:
In order to realize high speed process for grouping the records having the identical values of one or more columns of the input list, the input list is canned, a hash value is generated using a hash function in which a value of the column as the non-vacant partial aggregation of the columns for the grouping is used as the argument, and it is determined whether two or more records having the hash values exist or not. The input list is scanned again and the calculating process of the aggregation columns is immediately executed for the records having the hash value for which it is determined by the first scanning that there is only one record having the identical hash value, the result of such calculating process is output and the records are defined as the input of the ordinary grouping process for the records which are determined to exist as the two or more records.
摘要:
A query issue processing method, a query conversion processing method, and a data control processing method are provided for enhancing the efficiency of random sampling processing for use in a database processing system. In query issue processing 2, a query including random sampling processing is issued. In query conversion processing 8, application sequences of random sampling processing and another query processing are exchanged by considering a sampling unit of the random sampling processing. Further, in record control processing 4, random access to a secondary storage device is reduced, thereby enhancing random sampling processing efficiency. Unlike the conventional query conversion processing not considering the sampling unit, the issuance of the query including random sampling processing and performing query conversion by considering the sampling unit allow random sampling to be applied also to a query including aggregation processing, thereby enhancing the efficiency of queries in a wider range. Reduction in the random access to the secondary storage device further enhances that efficiency.
摘要:
A stream data processing system capable of highly reliable and highly usable real time data processing, is provided with: (1) a mechanism of copying some or all of stream data, and archiving the stream data in a non-volatile memory to allow real time data and archive data to be used seamlessly; and (2) a mechanism of improving the performance of query processings by using a plurality of stream data processing systems cooperatively.
摘要:
A stream data processing system capable of highly reliable and highly usable real time data processing, is provided with: (1) a mechanism of copying some or all of stream data, and archiving the stream data in a non-volatile memory to allow real time data and archive data to be used seamlessly; and (2) a mechanism of improving the performance of query processings by using a plurality of stream data processing systems cooperatively.
摘要:
A mechanism for managing ranking information using a sign of a stream tuple generated when stream data is inserted into, or deleted from, a window is provided. A mechanism for generating only the differential information of ranking calculation results, a mechanism for adding ranking information according to a request, an interface for generating and outputting all ranking information from the differential information, a mechanism for generating all ranking calculation results, and an interface for using these mechanisms are provided.