Abstract:
Disclosed herein are techniques for adjusting a map reduce execution environment. It is determined whether some operations in a sequence of operations should be implemented in a map reduce execution environment. If it is determined that some operations in a sequence of operations should be implemented in a map reduce execution environment, the map reduce execution environment is adjusted to achieve a predefined performance objective.
Abstract:
A technique of optimizing analytic flows includes sampling source data using a sampling method, executing a flow over the sampled data, obtaining runtime statistics from the executed flow, and combining runtime statistics with historical statistics.
Abstract:
Disclosed herein are techniques for optimizing and managing the execution of hybrid flows. An execution plan is generated for each hybrid flow based at least partially on attributes associated therewith. The execution of each hybrid flow is managed in accordance with the execution plan.
Abstract:
A flow of operations is to be executed in a plurality of execution environments according to a distribution. In response to determining that the distribution is unable to achieve at least one criterion, the distribution is modified according to at least one policy that specifies at least one action to apply to the flow of operations in response to a corresponding at least one condition relating to a characteristic of the flow of operations.
Abstract:
Converting an input script includes obtaining an input script comprising at least one variable, obtaining at least one translation transformation rule from a library, converting the input script into a tree representation, folding the tree representation to hide a subset of variables in the input script to create a folded tree, and generating a natural language text by applying at least one translation transformation rule from the library to the folded tree.
Abstract:
Converting an input script includes obtaining an input script comprising at least one variable, obtaining at least one translation transformation rule from a library, converting the input script into a tree representation, folding the tree representation to hide a subset of variables in the input script to create a folded tree, and generating a natural language text by applying at least one translation transformation rule from the library to the folded tree.
Abstract:
Converting a hybrid flow can include combining each of a plurality of task nodes with a plurality of corresponding operators of the hybrid flow and converting the combined plurality of task nodes and the plurality of corresponding operators of the hybrid flow to a data flow graph using a code template.
Abstract:
According to an example, a catalog of scripts may be managed. Management of the catalog of scripts may include the addition of a script description into the catalog of scripts. In one example, the script description may be directly added to the catalog of scripts. In another example, the script description may be added through generation of a merged query of scripts.
Abstract:
According to an example, candidate scripts may be determined from a catalog of scripts to perform a requested operation. In determining the candidate scripts, a request for an operation may be received, in which the request includes an input and an output. In addition, based upon the input and the output, a plurality of candidate scripts that are to perform the requested operation may be identified from the catalog of scripts, in which each of the plurality of candidate scripts comprises at least one of a script that is to perform the requested operation individually or a number of scripts that, in combination, are to perform the requested operation. Moreover, a score for each of plurality of candidate scripts may be calculated based upon a plurality of factors respectively corresponding to the plurality of candidate scripts and the plurality of candidate scripts and the calculated scores may be outputted.
Abstract:
A technique of optimizing analytic flows includes sampling source data using a sampling method, executing a flow over the sampled data, obtaining runtime statistics from the executed flow, and combining runtime statistics with historical statistics.