摘要:
A system, method, and program storage device implementing the method, for integrating data in a database management system, wherein the method comprises grouping data sources and replicas of the data sources that provide analogous data into a common logical domain; writing application queries against the common logical domain; selecting a correct set of replicas of the data sources and a query-execution strategy for combining a content of the correct set of replicas of the data sources in order to answer the application queries according to query-cost-based optimization; selecting a correct set of data sources according to run-time constraints; shielding the application queries from changes to the data sources by dynamically binding the application queries against the correct sets of data sources and replicas of the data sources; and processing the application queries by generating an optimum query result based on the steps of grouping and shielding.
摘要:
A system, method, and program storage device implementing the method, for integrating data in a database management system, wherein the method comprises grouping data sources and replicas of the data sources that provide analogous data into a common logical domain; writing application queries against the common logical domain; selecting a correct set of replicas of the data sources and a query-execution strategy for combining a content of the correct set of replicas of the data sources in order to answer the application queries according to query-cost-based optimization; selecting a correct set of data sources according to run-time constraints; shielding the application queries from changes to the data sources by dynamically binding the application queries against the correct sets of data sources and replicas of the data sources; and processing the application queries by generating an optimum query result based on the steps of grouping and shielding.
摘要:
A method masking data failures, when a master copy of data is unavailable comprises storing a single master copy of data and a replica copy of the data; writing to the master using a middleware component; and reading from either the master or the replica using the middleware component, when the master copy is unavailable. When the master is unavailable, the writes are stored in a table located in the middleware component as stored write requests, in which the stored write requests comprise UDI's to be made to the master. The reads are also performed from the table and the replica when the master is unavailable. When the master becomes available, the stored write requests are propagated to the master. Thus, the middleware allows the writes to run during times when the master is unavailable.
摘要:
A system and method of lightweight asynchronous data replication that avoids the need for any persistent store at the replication source or communication channel, and is independent of the underlying data formats, wherein the method of data replication comprises replicating deltas from a source, embedding replication tracking information in the replicated deltas, wherein the tracking information comprises a timestamp and a sequence number, and applying deltas at a target. The tracking information is used to ensure that each delta is applied exactly once. In the event of a crash in the system, the target analyzes the tracking information to determine which deltas have been applied, and requests retransmission of replicated data from the source starting from the earliest unapplied delta. The source and target are treated as only delta producers and consumers, and as such the invention is applicable to replication between arbitrary data source formats.
摘要:
Disclosed are embodiments of a system for optimizing query processing in a federated information system. The system may be used to identify alternative query plans in a simulated environment and to calculate cost estimates associated with the alternative query plans, based not only on database statistics and query statements, but also based on workload and processing latencies associated with specific data source and with the federated information system as a whole. In addition the calculated cost estimates may also factor in data source availability and reliability. The system may use the alternative query plans and the associated cost estimates to influence query processing in a federated information system by feeding to the federated information system query plans that allow for cost-efficient query plan-level load distribution, cost-efficient query fragment plan-level load distribution, and load distribution based upon quality of service cost constraints.
摘要:
Disk check bits refer to bit patterns stored in particular bytes of a page which are used to detect errors in writing the page to storage. Every time a page is obtained from storage, changed from the version retained in storage, and written back to storage, the check bit pattern on the changed page is altered to be different from the bit pattern on the storage page. This is because the changed page overwrites the stored page. The invention provides a method for managing the check bits in a multi-DBMS system employing a high-speed shared electronic store as a store-in cache for all pages obtained from disk storage. When a page is first obtained from disk storage by a DBMS and changed, check bit information for the page is maintained in a directory of the storing cache which indicates what the patterns are for the version of the page in the disk storage. All pages which are modified are stored in the store-in cache and are only returned to disk storage from the cache. Therefore, when a page is to be written to disk storage, the DBMS writing the page to storage processes the check bits on the page itself, changing them as required based on the check bit information stored in the directory for the page.
摘要:
Disclosed are embodiments of a system for optimizing query processing in a federated information system. The system may be used to identify alternative query plans in a simulated environment and to calculate cost estimates associated with the alternative query plans, based not only on database statistics and query statements, but also based on workload and processing latencies associated with specific data source and with the federated information system as a whole. In addition the calculated cost estimates may also factor in data source availability and reliability. The system may use the alternative query plans and the associated cost estimates to influence query processing in a federated information system by feeding to the federated information system query plans that allow for cost-efficient query plan-level load distribution, cost-efficient query fragment plan-level load distribution, and load distribution based upon quality of service cost constraints.
摘要:
A computer program product, such as a pre-recorded software program on a floppy disk, is disclosed, for use with a processing and database system for off-loading, to disk controller, the extraction of committed data. Responsive to the pre-recorded program instructions, the system first picks a Commit.sub.-- LSN value and insures all the data modified prior to the Commit.sub.-- LSN value is processed following the DBMS policy of reducing some disk I/Os or not for the modified pages cached in the system. If the policy is not to do disk I/Os for such pages, then the system places the identifiers of those pages in an ignore list. Otherwise, the system writes those pages to disk and empties the ignore list. Afterwards, the system forwards the ignore list and the Commit.sub.-- LSN along with information regarding the data to be processed to the controller. The controller performs the off-load function by reading from disk every page identified by the system except those in the ignore list, and determining, for each page, if the page's Page.sub.-- LSN value is less than the Commit.sub.-- LSN. If it is, then the controller processes the page and adds any qualifying data from that page to a defined answer set. Otherwise, the controller adds the Page.sub.-- ID for that page to a defined exception list. The controller than passes the answer set and the exception list to the system. The system processes the pages identified in the exception list and those in the ignore list. The system consolidates these answers with the answer set returned by the controller for presentation to the user.
摘要:
A method and means for achieving files of modifiable pages in a log based phased commit transaction management system (TMS) in which those pages which have been modified since the last full or incremental backup donot require during the copy operation any modifications to the page itself but merely to a common status page. This is accomplished by management of a pair of global log sequence numbers. Comparison between a first number (ICBU.sub.-- LSN) and each data page LSN as the page is modified permits the common status page to be updated to correctly reflect the changed status. Subsequent modifications to the same page donot require amendment of the status page. The status page indicia are reset as part of the backup procedure and for ascertaining the page copy set for incremental copying. The ICBU LSN assumes one of two values as a function of the copy operation and another value for processing page modifications after the copy operation. A second number (ICRF.sub.-- LSN) is used in the restoration of a file after the file has been partially restored by a page merge in page number order from full and incremental copies. In this case, the ICRF.sub.-- LSN defines the point in the log for redo since the most recent copy was made.
摘要:
A computer-implemented method for minimizing the amount of time to access current data in a database which may be stored wholly in a DASD-oriented external storage subsystem or partly in DASD and partly in a stored high-speed electronic store while maintaining coherency of the data with respect to multiple user systems.