Abstract:
A method for mirroring data between a plurality of sites comprises: maintaining, at a first site of the plurality of sites, a record that identifies which transactions that have been executed at the first site have had their redo information replicated to the other sites of the plurality of sites; determining a priority value associated with a transaction that is to be performed at the first site, wherein the transaction specifies a modification to a data block; if the priority value is a first value in a set of possible values, then committing the transaction only after the record indicates that redo information associated with the transaction has been replicated to the other sites of the plurality of sites; and if the priority value is a second value in said set of possible values, then committing the transaction even though the record does not indicate that redo information associated with the transaction has been replicated to the other sites of the plurality of sites. The invention also comprises a machine-readable medium.
Abstract:
A graph-based computation has a plurality of nodes representing graph components connected by one or more links representing data flows between the components. Preparing (110) the computation includes: identifying (602) at least a first set of components (250, 402) to process a plurality of transactions in the computation; and associating (606) with a first transaction a first group of one or more data records and data operations corresponding to the data records, and associating (606) with a second transaction a second group of one or more data records and data operations corresponding to the data records. Executing (112) the computation to process at least the first and second transactions includes delaying execution by the first set of components of a data operation corresponding to an initial data record of the second group until after execution by the first set of components of a data operation corresponding to a final data record of the first group.
Abstract:
In an embodiment, a method comprises receiving a plurality of source data records from one or more source computers, wherein one or more first source data records are associated with a first source transaction and one or more second source data records are associated with a second source transaction; generating a first derived transaction comprising one or more first derived records based on the plurality of source data records; generating a first transaction mapping between the first derived transaction and the first source transaction; generating a second transaction mapping between the first derived transaction and the second source transaction; determining that the first derived transaction has ended, and in response, committing first derived transaction including the one or more first derived records to a persistent storage; receiving a first ending punctuation associated with the first source transaction, and in response, committing the first source transaction including the one or more first source data records associated with the first source transaction to the persistent storage; after committing the first derived transaction, determining that the second source transaction failed, and in response: determining that the first derived transaction should be regenerated based on the second transaction mapping, and in response, regenerating the first derived transaction.
Abstract:
Fault tolerant operation is disclosed for a primary match server of a financial exchange using an active copy-cat instance, a.k.a. backup match server, that mirrors operations in the primary match server, but only after those operations have successfully completed in the primary instance. Fault tolerant logic monitors inputs and outputs of the primary match server and gates those inputs to the backup match server once a given input has been processed. The outputs of the backup match server are then compared with the outputs of the primary match server to ensure correct operation. The disclosed embodiments further relate to fault tolerant failover mechanism allowing the backup match server to take over for the primary match server in a fault situation wherein the primary and backup match servers are loosely coupled, i.e. they need not be aware that they are operating in a fault tolerant environment. As such, the primary match server need not be specifically designed or programmed to interact with the fault tolerant mechanisms. Instead, the primary match server need only be designed to adhere to specific basic operating guidelines and shut itself down when it cannot do so. By externally controlling the ability of the primary match server to successfully adhere to its operating guidelines, the fault tolerant mechanisms of the disclosed embodiments can recognize error conditions and easily failover from the primary match server to the backup match server.
Abstract:
Crash recovery of a computing system in the context in which the computing system includes a volatile memory which has contents lost due to the crash, an intermediate non-volatile memory that (together with the volatile memory) contributes to the active memory of the computing system, and non-volatile storage. The recovery occurs by identifying pages that were in the volatile memory at the time of the crash. For each of these pages, the recovery determines whether to recover the page into the volatile memory from either the intermediate non-volatile memory or the storage, and then performs the recovery. In embodiments in which access speeds for the intermediate non-volatile memory may be much faster than for the storage, and in which much of the pages to be recovered are recovered from the intermediate non-volatile memory, the recovery time may be reduced.
Abstract:
A messaging entity configured in a memory of first node of a plurality communicatively coupled nodes is disclosed. The nodes are included in a distributed computing system. The messaging entity is configured to operate as a secondary messaging entity in a messaging server for the plurality communicatively coupled nodes. The messaging entity is communicatively couple to a primary messaging entity configured in a memory of a second node of the plurality of nodes. The primary messaging entity is configured to store a message; store a copy of the message. Also, the messaging entity is configured to be promoted to a new primary messaging entity in the event of failure of the primary messaging entity.
Abstract:
A memory having a plurality of storage blocks, and a memory control unit, are included, and each of the storage blocks is partitioned into a management data area and an actual data area. The management data area stores therein writing-in-progress management data indicating that writing of actual data has started, validity management data indicating that the writing of the actual data has been completed, and pre-erasure management data indicating that stored actual data have been brought into an erasable state. The memory control unit determines, based on a storage state of the management data, a storage block storing therein the latest data.
Abstract:
La présente invention se rapporte essentiellement à un procédé d'exécution synchronisée d'une application dans un environnement à haute disponibilité comportant une pluralité de modules de calcul interconnectés par un réseau à très haut débit, caractérisé en ce qu'il comporte les étapes suivantes : - configuration (2000) de la pluralité de modules en une pluralité de partitions dont au moins: o deux partitions d'exécution l'une primaire, l'autre secondaire o une partition de contrôle; - exécution (2100) de l'application sur chaque partition d'exécution, des entrées-sorties traitées par la partition principale étant transmises à la partition d'exécution secondaire via la partition de contrôle ; - synchronisation (2200) des exécutions via une exploitation des changements de contexte microprocesseur ; - transmission (2300) d'un signal d'erreur catastrophique à la partition de contrôle; - poursuite (2500) de l'exécution en passant dans un mode dégradé, l'exécution continuant sur une seule partition.
Abstract:
A database system may maintain a plurality of log records at a distributed storage system. Each of the plurality of log records may be associated with a respective change to a data page. Upon detection of a coalesce event for a particular data page, log records linked to the particular data page may be applied to generate the particular data page in its current state. Detecting the coalesce event may be a determination that the number of log records linked to the particular data page exceeds a threshold.