Abstract:
Aspects of the present disclosure relate to graphical representation of an output. A server provides for presentation, at a display device, of a first graphical representation of an output for a class. The server receives a selection of a time range for the class. The server determines, based on the output for the class during the selected time range, a baseline output. The server provides for presentation, in response to the selection of the time range, of a second graphical representation of the output, during the time range, for a family from the class, the second graphical representation indicating a difference between the output for the family and an output value for the family predicted based on the baseline output.
Abstract:
A computer system identifies malicious Uniform Resource Locator (URL) data items from a plurality of unscreened data items that have not been previously identified as associated with malicious URLs. The system can execute a number of pre-filters to identify a subset of URLs in the plurality of data items that are likely to be malicious. A scoring processor can score the subset of URLs based on a plurality of input vectors using a suitable machine learning model. Optionally, the system can execute one or more post-filters on the score data to identify data items of interest. Such data items can be fed back into the system to improve machine learning or can be used to provide a notification that a particular resource within a local network is infected with malicious software.
Abstract:
A computer system identifies malicious Uniform Resource Locator (URL) data items from a plurality of unscreened data items that have not been previously identified as associated with malicious URLs. The system can execute a number of pre-filters to identify a subset of URLs in the plurality of data items that are likely to be malicious. A scoring processor can score the subset of URLs based on a plurality of input vectors using a suitable machine learning model. Optionally, the system can execute one or more post-filters on the score data to identify data items of interest. Such data items can be fed back into the system to improve machine learning or can be used to provide a notification that a particular resource within a local network is infected with malicious software.
Abstract:
In various embodiments, systems, methods, and techniques are disclosed for generating a collection of clusters of related data from a seed. Seeds may be generated based on seed generation strategies or rules. Clusters may be generated by, for example, retrieving a seed, adding the seed to a first cluster, retrieving a clustering strategy or rules, and adding related data and/or data entities to the cluster based on the clustering strategy. Various cluster scores may be generated based on attributes of data in a given cluster. Further, cluster metascores may be generated based on various cluster scores associated with a cluster. Clusters may be ranked based on cluster metascores. Various embodiments may enable an analyst to discover various insights related to data clusters, and may be applicable to various tasks including, for example, tax fraud detection, beaconing malware detection, malware user-agent detection, and/or activity trend detection, among various others.
Abstract:
Systems and methods are provided for analyzing entity performance. In one implementation, a method is provided that includes receiving a request with one or more filter selections and accessing a data structure comprising a plurality of categories of information showing interactions associated with multiple entities. The method also comprises identifying a set of categories of the plurality of categories within the data structure based on the one or more filter selections. The method further comprises processing the information of the identified categories to analyze a performance of one or more entities of the multiple entities in accordance with the one or more filter selections and providing the processed information to display the performance of the one or more entities on a user interface.
Abstract:
Embodiments of the present disclosure relate to a computer system and interactive user interfaces configured to enable efficient and rapid access to multiple different data sources simultaneously, and by an unskilled user. The unskilled user may provide simple and intuitive search terms to the system, and the system may thereby automatically query multiple related data sources of different types and present results to the user. Data sources in the system may be efficiently interrelated with one another by way of a mathematical graph in which nodes represent data sources and/or portions of data sources (for example, database tables), and edges represent relationships among the data sources and/or portions of data sources. For example, edges may indicate relationships between particular rows and/or columns of various tables. The table graph enables a compact and memory efficient storage of relationships among various disparate data sources.
Abstract:
Embodiments of the present disclosure relate to a computer system and interactive user interfaces configured to enable efficient and rapid access to multiple different data sources simultaneously, and by an unskilled user. The unskilled user may provide simple and intuitive search terms to the system, and the system may thereby automatically query multiple related data sources of different types and present results to the user. Data sources in the system may be efficiently interrelated with one another by way of a mathematical graph in which nodes represent data sources and/or portions of data sources (for example, database tables), and edges represent relationships among the data sources and/or portions of data sources. For example, edges may indicate relationships between particular rows and/or columns of various tables. The table graph enables a compact and memory efficient storage of relationships among various disparate data sources.
Abstract:
Various systems and methods are provided that retrieve data measured by various sensors associated with a sensor collection site and compress the retrieved data so that the compressed data can be transported to and stored in one or more databases in an efficient manner. The various systems and methods may further retrieve the data stored in the one or more databases and generate an interactive user interface to display the data. While each instance of measured data may include a plurality of parameters, the various systems and methods may flatten the measured data such that information pertaining to each parameter can be displayed in a two-dimensional graph. Features may be extracted from the stored data to identify events that have occurred or that may occur in the future during operation of the sensor collection site such that a user can take appropriate action.
Abstract:
Various systems and methods are provided that retrieve data measured by various sensors associated with a sensor collection site and compress the retrieved data so that the compressed data can be transported to and stored in one or more databases in an efficient manner. The various systems and methods may further retrieve the data stored in the one or more databases and generate an interactive user interface to display the data. While each instance of measured data may include a plurality of parameters, the various systems and methods may flatten the measured data such that information pertaining to each parameter can be displayed in a two-dimensional graph. Features may be extracted from the stored data to identify events that have occurred or that may occur in the future during operation of the sensor collection site such that a user can take appropriate action.
Abstract:
Systems and methods are provided for storing data representing respective sub-elements of a complex task. Data representing one or more links between two or more sub-elements is stored, the links indicating a dependency between said sub-elements. A work order is calculated based on the identified links. A graphical representation of the calculated work order which indicates said sub-elements and their dependencies is provided. The links may indicate a temporal dependency of a second sub-element on a first sub-element and in which the provided graphical representation presents the temporal relationship of the sub-elements. Historical data may be received for association with one or more selected links or sub-elements, the historical data related to a prior event and which affects the temporal relationship between the sub-elements. An updated work order modified by the historical data may be calculated. An updated graphical representation of the work order may be provided.