摘要:
Methods and systems for partitioning data for processing in a plurality of data centers are disclosed. For each of a plurality of data centers, a time period required for the data center to process an amount of information may be estimated. The plurality of data centers may be ordered based on the time period for each data center. Data may be received from one or more sources. A data center having a smallest time period from the ordered plurality of data centers may be selected to be added to a set of data centers. An overall execution time for the set of data centers to process the data may be determined. The selecting and determining operations may be repeated until the overall execution time satisfies one or more threshold criteria. The data may be transmitted to the set of data centers.
摘要:
Methods and systems of performing data mining may include receiving a plurality of web log records and a plurality of call log records; associating one or more web log records with a call log record, wherein the associated user for each of the associated one or more web log records and the call log record are the same; identifying one or more patterns among the web log records for the plurality of call log records, wherein each pattern comprises one or more web accesses, a time stamp at which each of the one or more web accesses is performed and the call topic for the call log record; identifying one or more web log records associated with a new call, and predicting a call topic for the new call based on at least one pattern and the one or more web log records.
摘要:
Methods and systems for scalable extraction of episode rules using incremental episode tree construction in a multi-application event space comprise compiling events from multiple, different domain logs into in a universal log file, rolling domain-dependent and domain-independent windows through the universal log file to identify distinct event-pattern episodes, adding episodes to an episode tree data structure, pruning less frequent episodes from the episode tree, analyzing the episode tree to identify frequent episode rules, and applying the frequent episode rules to future interactions with users.
摘要:
Methods and systems of performing data mining may include receiving a plurality of web log records and a plurality of call log records; associating one or more web log records with a call log record, wherein the associated user for each of the associated one or more web log records and the call log record are the same; identifying one or more patterns among the web log records for the plurality of call log records, wherein each pattern comprises one or more web accesses, a time stamp at which each of the one or more web accesses is performed and the call topic for the call log record; identifying one or more web log records associated with a new call, and predicting a call topic for the new call based on at least one pattern and the one or more web log records.
摘要:
Embodiments relate to systems and methods for self-adaptive episode mining under time threshold using delay estimation and temporal division. An episode mining engine can analyze a set of episodes captured from a set of network resources to detect all sequences of user-specified frequency within a supplied runtime budget or time threshold. The engine can achieve desired levels of completeness in the results by mining the input log file in multiple stages or steps, each having successively longer lengths of event sequences. After completion of each stage, the engine calculates a remaining amount of runtime budget, and updates the amount of time to be allocated for each of the remaining stages up to a generated maximum stage (or sequence length). The engine thus corrects the estimated remaining time in the runtime budget (or threshold) after each stage, and continues to the next stage until the runtime budget is consumed.
摘要:
Embodiments relate to systems and methods for self-adaptive episode mining under time threshold using delay estimation and temporal division. An episode mining engine can analyze a set of episodes captured from a set of network resources to detect all sequences of user-specified frequency within a supplied runtime budget or time threshold. The engine can achieve desired levels of completeness in the results by mining the input log file in multiple stages or steps, each having successively longer lengths of event sequences. After completion of each stage, the engine calculates a remaining amount of runtime budget, and updates the amount of time to be allocated for each of the remaining stages up to a generated maximum stage (or sequence length). The engine thus corrects the estimated remaining time in the runtime budget (or threshold) after each stage, and continues to the next stage until the runtime budget is consumed.
摘要:
Methods and systems for scalable extraction of episode rules using incremental episode tree construction in a multi-application event space comprise compiling events from multiple, different domain logs into in a universal log file, rolling domain-dependent and domain-independent windows through the universal log file to identify distinct event-pattern episodes, adding episodes to an episode tree data structure, pruning less frequent episodes from the episode tree, analyzing the episode tree to identify frequent episode rules, and applying the frequent episode rules to future interactions with users.
摘要:
Methods and systems for automatically organizing devices in a network are disclosed. Information may be collected for a plurality of devices in a predetermined area over a predetermined period of time. The information may include device location information and device information for one or more user groups. Each user group may include one or more users. An average preference for each user group may be determined for each device based on the device usage information. The plurality of devices may be organized automatically into a plurality of clusters based on at least the determined average preferences. Each cluster may include one or more devices.
摘要:
Methods and systems for automatically organizing devices in a network are disclosed. Information may be collected for a plurality of devices in a predetermined area over a predetermined period of time. The information may include device location information and device information for one or more user groups. Each user group may include one or more users. An average preference for each user group may be determined for each device based on the device usage information. The plurality of devices may be organized automatically into a plurality of clusters based on at least the determined average preferences. Each cluster may include one or more devices.
摘要:
A system and related method for selectively controlling use of functionality in one or more multifunction devices includes a communications interface in operative communication with a printer and/or a display apparatus. The communications interfaces electronically communicates data with the printer and/or the display apparatus. The data includes a data structure configured to present an advertisement. The data may be print data or image data. The communications interface is operatively associated with an advertisement-credit component which can generate a credit value corresponding to the advertisement. The advertisement-credit component can subsidize multi-function device costs based upon a presented advertisement, e.g., the cost of printing, scanning, faxing, and related consumables including inks, toners and/or papers. The advertisement credit component authorizes use of a predetermined amount of a function with the credit value that corresponds to the advertisement.