摘要:
Methods and systems of performing data mining may include receiving a plurality of web log records and a plurality of call log records; associating one or more web log records with a call log record, wherein the associated user for each of the associated one or more web log records and the call log record are the same; identifying one or more patterns among the web log records for the plurality of call log records, wherein each pattern comprises one or more web accesses, a time stamp at which each of the one or more web accesses is performed and the call topic for the call log record; identifying one or more web log records associated with a new call, and predicting a call topic for the new call based on at least one pattern and the one or more web log records.
摘要:
Methods and systems of performing data mining may include receiving a plurality of web log records and a plurality of call log records; associating one or more web log records with a call log record, wherein the associated user for each of the associated one or more web log records and the call log record are the same; identifying one or more patterns among the web log records for the plurality of call log records, wherein each pattern comprises one or more web accesses, a time stamp at which each of the one or more web accesses is performed and the call topic for the call log record; identifying one or more web log records associated with a new call, and predicting a call topic for the new call based on at least one pattern and the one or more web log records.
摘要:
Methods and systems for partitioning data for processing in a plurality of data centers are disclosed. For each of a plurality of data centers, a time period required for the data center to process an amount of information may be estimated. The plurality of data centers may be ordered based on the time period for each data center. Data may be received from one or more sources. A data center having a smallest time period from the ordered plurality of data centers may be selected to be added to a set of data centers. An overall execution time for the set of data centers to process the data may be determined. The selecting and determining operations may be repeated until the overall execution time satisfies one or more threshold criteria. The data may be transmitted to the set of data centers.
摘要:
Methods and systems for scalable extraction of episode rules using incremental episode tree construction in a multi-application event space comprise compiling events from multiple, different domain logs into in a universal log file, rolling domain-dependent and domain-independent windows through the universal log file to identify distinct event-pattern episodes, adding episodes to an episode tree data structure, pruning less frequent episodes from the episode tree, analyzing the episode tree to identify frequent episode rules, and applying the frequent episode rules to future interactions with users.
摘要:
Embodiments relate to systems and methods for self-adaptive episode mining under time threshold using delay estimation and temporal division. An episode mining engine can analyze a set of episodes captured from a set of network resources to detect all sequences of user-specified frequency within a supplied runtime budget or time threshold. The engine can achieve desired levels of completeness in the results by mining the input log file in multiple stages or steps, each having successively longer lengths of event sequences. After completion of each stage, the engine calculates a remaining amount of runtime budget, and updates the amount of time to be allocated for each of the remaining stages up to a generated maximum stage (or sequence length). The engine thus corrects the estimated remaining time in the runtime budget (or threshold) after each stage, and continues to the next stage until the runtime budget is consumed.
摘要:
Embodiments relate to systems and methods for self-adaptive episode mining under time threshold using delay estimation and temporal division. An episode mining engine can analyze a set of episodes captured from a set of network resources to detect all sequences of user-specified frequency within a supplied runtime budget or time threshold. The engine can achieve desired levels of completeness in the results by mining the input log file in multiple stages or steps, each having successively longer lengths of event sequences. After completion of each stage, the engine calculates a remaining amount of runtime budget, and updates the amount of time to be allocated for each of the remaining stages up to a generated maximum stage (or sequence length). The engine thus corrects the estimated remaining time in the runtime budget (or threshold) after each stage, and continues to the next stage until the runtime budget is consumed.
摘要:
Methods and systems for scalable extraction of episode rules using incremental episode tree construction in a multi-application event space comprise compiling events from multiple, different domain logs into in a universal log file, rolling domain-dependent and domain-independent windows through the universal log file to identify distinct event-pattern episodes, adding episodes to an episode tree data structure, pruning less frequent episodes from the episode tree, analyzing the episode tree to identify frequent episode rules, and applying the frequent episode rules to future interactions with users.
摘要:
A method and apparatus for providing a resource allocation policy in a network are disclosed. For example, the method constructs a queuing model for each application. The method defines a utility function for each application and for each transaction type of each application, and defines an overall utility in a system. The method performs an optimization to identify an optimal configuration that maximizes the overall utility for a given workload, and determines one or more adaptation policies for configuring the system in accordance with the optimal configuration.
摘要:
An embodiment generally relates to systems and methods for improving system performance by reducing fragility of computing systems. A processing module can identify separate ensemble files each comprising interpretations, by separate entities of a workflow, of a phrase in a file. The processing module can compare the interpretations to determine if the interpretations are the same or essentially the same. If the interpretations are neither the same nor essentially the same, a subsequent entity in the workflow can create a new file that replaces an associated interpretation of the phrase with a common interpretation. The subsequent entity can proceed with an intended operation.
摘要:
A method for computing the energy rating for cloud-based software services is disclosed. For each of the service, following steps are performed. The method includes identifying configuration parameters impacting the energy consumption. The method further includes determining a value for each configuration parameter. Further, the method includes determining a relative energy rating using a pre-determined equation, based on the values of the configuration parameter. Finally, the method includes assigning a discrete value based on the range of the relative energy rating.