摘要:
In a networked computer system that includes multiple nodes and multiple networks interconnecting the nodes, a database query optimizer takes network choice into consideration when optimizing a query. The query optimizer logs historical information for queries that includes network information. A network monitor determines network characteristics of the multiple networks. The query optimizer then selects a network for a query based on the network characteristics, and optionally based on the historical information as well. The result is a query optimizer that takes network choice into consideration, thereby providing a query that is optimized according to the characteristics of the networks.
摘要:
Methods, systems, and products are disclosed for executing a distributed JAVA application on a plurality of compute nodes. The JAVA application includes a plurality of jobs distributed among the plurality of compute nodes. The plurality of compute nodes are connected together for data communications through a data communication network. Each of the plurality of compute nodes has installed upon it a JAVA Virtual Machine (‘JVM’) capable of supporting at least one job of the JAVA application. Executing a distributed JAVA application on a plurality of compute nodes includes: tracking, by an application manager, JVM environment variables for the JVMs installed on the plurality of compute nodes; and configuring, by the application manager, the plurality of jobs for execution on the plurality of compute nodes in dependence upon the JVM environment variables for the JVMs installed on the plurality of compute nodes.
摘要:
The present invention generally relates to data processing and, more particularly, to managing operation of user-defined functions on a partitioned database. One embodiment provides a computer-implemented method for managing execution of a query against a partitioned database having a plurality of data partitions. The method comprises, for each of the plurality of data partitions, examining data of the data partition, and on the basis of examined data, implementing a modified form of a user-defined function on the data partition. The method further comprises executing the query referring to the user-defined function against the plurality of data partitions, wherein execution of the query on each data partition invokes execution of the respective modified form of the user-defined function against the data of the respective data partition.
摘要:
Techniques are described for estimating and managing energy consumption for query processing. Embodiments of the invention may generally receive a query to be executed and calculate an initial estimated energy consumption value for the received query. If the initial estimated energy consumption value does not exceed a threshold amount of energy, embodiments of the invention may submit the query for execution. Once execution of the query has begun, embodiments of the invention may calculate an updated estimated energy consumption value for the executing query, and if the updated value exceeds the threshold amount of energy, may halt the execution of the query.
摘要:
Methods, systems, and products are disclosed for optimizing just-in-time (‘JIT’) compiling for a application executing on a compute node, the compute node having installed upon it a Virtual Machine (‘VM’) capable of supporting the application, that include: identifying, by an application manager, a particular portion of the application; assigning, by the application manager, a JIT level to the particular portion of the application; and jitting, by the VM installed on the compute node, the particular portion of the application in dependence upon the JIT level assigned to that particular portion of the ava application.
摘要:
Techniques are described for managing query execution by estimating and monitoring query execution time. Embodiments of the invention may generally receive a query to be executed and calculate an initial estimated execution time for the received query. If the initial estimated execution time does not exceed a threshold amount of time, embodiments of the invention may submit the query for execution. Once execution of the query has begun, embodiments of the invention may calculate an updated estimated execution time for the executing query, and if the updated estimated execution time exceeds the threshold amount of time, may halt the execution of the query.
摘要:
Techniques are disclosed for processing data streams containing data tuples, where some of the data tuples are load shed. Load shedding refers to a process of selectively discarding some data tuples, e.g., when the amount of received data becomes too overwhelming to manage. When operations are invoked that depend on the values (or count) of the load shed tuples, the value (or counts) of the load shed tuples may be used to perform such operations. For example, and aggregation operation may return the sum, average, mean and/or variance associated with a plurality of tuples processed by a node of the stream application.
摘要:
In a networked computer system that includes multiple nodes and multiple networks interconnecting the nodes, a database query optimizer takes network choice into consideration when optimizing a query. The query optimizer logs historical information for queries that includes network information. A network monitor determines network characteristics of the multiple networks. The query optimizer then selects a network for a query based on the network characteristics, and optionally based on the historical information as well. The result is a query optimizer that takes network choice into consideration, thereby providing a query that is optimized according to the characteristics of the networks.
摘要:
Embodiments of the invention enable a database spread over multiple nodes to allow each node to have different indexes over the data in tables, depending on how each node would benefit (or not benefit) from having the index(es). When a database table is spread across the nodes of a multi-node or distributed system, each node may maintain only the portion of the index relevant to that node, if doing so would improve the performance of query processing operations on that node. Further, the database may periodically redistributed across the compute nodes based on index performance. Doing so allows the database system to intelligently trade off between consuming space for the index on a node and the usefulness of having an index on that node.
摘要:
Techniques are described for estimating and managing memory compression for request processing. Embodiments of the invention may generally include receiving a request for data, determining if the requested data contains any compressed data, and sending the requesting entity only the uncompressed data. A separate embodiment generally includes receiving a request for data, determining if the requested data contains any compressed data, gathering uncompression criteria about the requested data, and using the uncompression criteria to selectively determine what portion of the compressed data to uncompress.