摘要:
A work flow description database represents long running work flows as a set of work units, called steps, with information flows therebetween. The description database defines each step's input and output signals, input condition criteria for creating an instance of the step, an application program associated with the step, and criteria for selecting a resource to execute the step. A work flow controller controls the process of executing instances of each defined type of work flow. Execution of a long running work flow begins when a corresponding set of externally generated input event signals are received by the work flow controller. During execution of a work flow, each step of the work flow is instantiated only when a sufficient set of input signals is received to execute that step. At that point an instance of the required type of step is created and then executed by a selected resource. After termination of a step, output signals from the step are converted into input event signals for other steps in the work flow in accordance with data stored in the work flow description database. Each step executes an application program and is treated as an individual transaction insofar as durable storage of its results. Log records are durably stored upon instantiation, execution and termination of each step of a work flow, and output event signals are also logged, thereby durably storing sufficient data to recover a work flow with virtually no loss of the work that was accomplished prior to a system failure.
摘要:
A transaction description database represents long running computations as a flow or set of computational steps with data arcs therebetween. The description database defines each step's input and output signals, input condition criteria for creating an instance of the step, an application program associated with the step, and criteria for selecting a resource to execute the step. A flow controller controls the process of executing instances of each defined type of long running transaction. Execution of a long running transaction begins when a corresponding set of externally generated input event signals are received by the flow controller. During execution of a long running transaction, each step of the transaction is instantiated only when a sufficient set of input signals is received to execute that step. At that point an instance of the required type of step is created and then executed by a selected resource. After termination of a step, output signals from the step are converted into input event signals for other steps in the long running transaction in accordance with data stored in the transaction description database. Each step executes an application program and is treated as an individual computation insofar as durable storage of its computational results. Log records are durably stored upon instantiation, execution and termination of each step of a long running transaction, and output event signals are also logged, thereby durably storing sufficient data to recover a long running transaction with virtually no loss of the work that was accomplished prior to a system failure.
摘要:
Page feed for efficient dataflow between distributed query engines is disclosed. An example method comprises storing a query result relation by a producer query as pages in a query engine buffer pool, and then sending these pages to a distributed caching platform (DCP) as the data source of another query. The example method also includes reading the result relation by a consumer query from the page stored in the DCP.
摘要:
A method includes binding a stationed operator to a database query engine, wherein the stationed operator comprises system utilities and an open executor to execute user-supplied application logic. The method further includes receiving an unbounded input stream of input tuples by the stationed operator, the stream of input tuples comprising a succession of chunks. The method also includes applying a query defined by the application logic to each successive chunk of the input stream to produce an output packet for each such chunk, and generating an unbounded output stream comprising the output packets.
摘要:
Methods, systems, and computer program products are provided for generating application-aware data partitioning to support parallel computing. A label for a user defined data partitioning (UDP) key is generated by a labeling process to configure data partitions of original data. The UDP is labeled by the labeling process to include at least one key property excluded from the original data. The data partitions are evenly distributed to co-locate and balance the data partitions and corresponding computations performed by computational servers. A data record of the data partitions is retrieved by performing an all-node parallel search of the computational servers using the UDP key.
摘要:
The present invention relates to devices and methods that coordinate an external conversation process between entities with an internal workflow of one of the entities. More particularly, it relates to devices and methods that are compliant with an inter-enterprise conversation process standard for routing electronic commerce documents between enterprises. Particular aspects of the present invention are described in the claims, specification and drawings.
摘要:
Attributes in data records are identified. Episodes corresponding to the respective attributes include respective data records. The episodes are scored, where scoring of a particular one of the episodes is based on relative time densities between successive data records of the particular episode. A visualization of at least some of the episodes to provide an alert of time-dense episodes.
摘要:
A method for generating sum maries of text is described. The method includes the step of extracting features from text of text lists from summaries. The explanatoriness of the text is then evaluated, wherein evaluating the explanatoriness of text includes evaluating the features of the text, including at least the step of evaluating the discriminativeness of the features of the text by comparing the text to a first text data set, wherein the first text data set is derived from a topic label. The evaluated text is then ranked based on the explanatoriness evaluation.
摘要:
An open station as a stream analysis operator container is disclosed. In an example, a method is stored as instructions on computer readable media and executable by a processor. The method includes packaging graph-structured stream transformations into a topology of a top-level dataflow process including stream transformation operators (STOs) on open stations. The method also includes sending a tuple from one of the STOs to every other STO in the topology subscribed to a data stream of the tuple.
摘要:
Pixels representing attributes of data records are into discrete rings of a graphical visualization, where the discrete rings correspond to respective ones of the attributes, and where the pixels are arranged in each of the discrete rings according to time intervals of the data records. A correlation ring is provided in the graphical visualization in addition to the discrete rings, where the correlation ring has visual indicators for indicating degrees of correlation between at least two of the attributes.