摘要:
A system, method and computer program product provides a random walk model with heterogeneous graphs to leverage multiple source data and accomplish prediction tasks. The system and method components include: 1) A heterogeneous graph formulation including heterogeneous instances of abstract objects as graph nodes and multiple relations as edges connecting those nodes. The different types of relations, such as client-vendor relation and client-product relation, are often quantified as the weights of edges connecting those entities; 2) To accomplish prediction tasks with such information, launching a multi-stage random walk model over the heterogeneous graph. The random walk within a subgraph with homogenous nodes usually produces the relevance between entities of the same type. The random walk across different type of nodes provides the prediction of decisions, such as a client purchasing a product.
摘要:
A system and method for extending partially labeled data graphs to unlabeled nodes in a single network classification by weighting the data with a weight matrix that uses a modified graph Laplacian based regularization framework and applying graph transduction methods to the weighted data. The technique may be applied to data graphs that are directed or undirected, that may or may not have attributes and that may be homogeneous or heterogeneous.
摘要:
In an approach for post-modeling data visualization and analysis, a processor presents a first visualization of a training dataset in a first plot. Responsive to receiving a selection of a data group of the training dataset to analyze, a processor identifies three or fewer key model features of the data group of the training dataset. A processor ascertains a representative record of each key model feature of the three or fewer key model features using a Local Interpretable Model-Agnostic Explanation technique. A processor presents a second visualization of the three or fewer key model features and the representative record of each key model feature in a second plot.
摘要:
A method, system, and computer program product for recommending an initial database security model. The method may include identifying a plurality of nodes connected to a security network. The method may also include analyzing security characteristics of each node of the plurality of nodes. The method may also include identifying, from the security characteristics, key factors for each node. The method may also include calculating similarities between each node of the plurality of nodes. The method may also include building a self-organized centerless network across the plurality of nodes by grouping nodes with high similarities based on the similarities between each node, where the self-organized centerless network is a centerless network without a central management server, and includes groups of nodes from the plurality of nodes. The method may also include generating federated security models for the groups of nodes.
摘要:
Embodiments of the present disclosure relate to a method, system and computer program product for semantic search based on a graph database. In some embodiments, a method is disclosed. According to the method, the user jobs of a user are obtained from a first software product. Based on the user jobs, target test cases are selected from a plurality of test cases associated with the first software product and a second software product. The target test cases are applied to the first software product and the second software product, and in accordance with a determination that a result of applying the target test cases satisfies a predetermined criterion, an instruction is provided to indicate migrating from the first software product to the second software product. In other embodiments, a system and a computer program product are disclosed.
摘要:
Obtain, at a computing device, a segment of computer code. With a classification module of a machine learning system executing on the computing device, determine a required annotation category for the segment of computer code. With an annotation generation module of the machine learning system executing on the computing device, generate a natural language annotation of the segment of computer code based on the segment of computer code and the required annotation category. Provide the natural language annotation to a user interface for display adjacent the segment of computer code.
摘要:
The present disclosure relates to privacy protection in a search process. According to a method, a target emotion vector is extracted from a search interaction, the target emotion vector representing emotional information in the search interaction. Respective emotion distances between the target emotion vector and respective emotion vectors associated with a plurality of text clusters are determined. The plurality of text clusters is clustered from a dictionary of text elements. A first number of text clusters are selected from the plurality of text clusters based on the determined respective emotion distances. The first number of text clusters have emotion distances larger than at least one unselected text cluster among the plurality of text clusters. A plurality of confused search interactions are constructed for the search interaction based on the first number of text clusters, and the plurality of confused search interactions are performed.
摘要:
In an approach for detecting web browsing subject-oriented event interactions and intelligently organizing web pages based on insights from important interactions for better exploration and efficient management, a processor extracts time series data associated with a plurality of web browsing events based on browsing historical actions of a user. A processor identifies the subject of each web browsing event. A processor determines major events based on the time series data and subjects of the plurality of web browsing events. A processor organizes the plurality of web browsing events based on subject hierarchy and timeline from the time series data. A processor highlights one or more uniform resource locators based on the subject hierarchy and timeline.
摘要:
Systems and computer-implemented methods select a subset of methods to generate data schemas for input data from a list of methods for generating data schemas, based on output of a regression model; generate a candidate schema for each method in the subset of methods to generate data schemas; and generate a master data schema for the input data by merging the candidate schema for each method in the subset of methods to generate data schemas, utilizing predetermined rules.
摘要:
Methods, computer program products, and/or systems are provided that perform the following operations: obtaining a series of indicator diagrams corresponding to strokes of a pumpjack over a specific time duration, dividing each indicator diagram into a plurality of location segments in a direction of location of the rod; obtaining load difference features between upstroke loads and corresponding downstroke loads in the plurality of location segments; identifying a location segment with an abnormal load difference feature based on a time series data of load difference feature corresponding to one of the plurality of location segments, the time series data of load difference feature including a series of data points of load difference feature of the one of the plurality of location segments in time order; and providing an indication of a potential problem based, at least in part, on the identification of the location segment with an abnormal load difference feature.