System and method for relational time series learning with the aid of a digital computer

    公开(公告)号:US11475360B2

    公开(公告)日:2022-10-18

    申请号:US16593065

    申请日:2019-10-04

    摘要: System and methods for relational time-series learning are provided. Unlike traditional time series forecasting techniques, which assume either complete time series independence or complete dependence, the disclosed system and method allow time series forecasting that can be performed on multivariate time series represented as vertices in graphs with arbitrary structures and predicting a future classification for data items represented by one of nodes in the graph. The system and methods also utilize non-relational, relational, temporal data for classification, and allow using fast and parallel classification techniques with linear speedups. The system and methods are well-suited for processing data in a streaming or online setting and naturally handle training data with skewed or unbalanced class labels.

    Fast and accurate graphlet estimation

    公开(公告)号:US11030246B2

    公开(公告)日:2021-06-08

    申请号:US15179724

    申请日:2016-06-10

    摘要: Embodiments of the present invention provide a system for fast, accurate, and scalable unbiased graphlet estimation. The system utilizes neighborhood sampling and combinatorial relations to estimate graphlet counts, statistics, and frequency distributions in a small fraction of the computing time of existing systems. The obtained unbiased estimates are highly accurate, and have applications in the analysis, mining, and predictive modeling of massive real-world networks. During operation, the system obtains data indicating vertices and edges of a graph. The system samples a portion of the graph and counts a number of graph features in the sampled portion of the graph. The system then computes an occurrence frequency of a graphlet pattern and a total number of graphlets associated with the graphlet pattern in the graph.

    SYSTEM AND METHOD FOR RELATIONAL TIME SERIES LEARNING WITH THE AID OF A DIGITAL COMPUTER

    公开(公告)号:US20200034744A1

    公开(公告)日:2020-01-30

    申请号:US16593065

    申请日:2019-10-04

    摘要: System and methods for relational time-series learning are provided. Unlike traditional time series forecasting techniques, which assume either complete time series independence or complete dependence, the disclosed system and method allow time series forecasting that can be performed on multivariate time series represented as vertices in graphs with arbitrary structures and predicting a future classification for data items represented by one of nodes in the graph. The system and methods also utilize non-relational, relational, temporal data for classification, and allow using fast and parallel classification techniques with linear speedups. The system and methods are well-suited for processing data in a streaming or online setting and naturally handle training data with skewed or unbalanced class labels.

    Deep graph representation learning
    34.
    发明授权

    公开(公告)号:US10482375B2

    公开(公告)日:2019-11-19

    申请号:US15802302

    申请日:2017-11-02

    发明人: Ryan Rossi Rong Zhou

    摘要: A method of deep graph representation learning includes: calculating a plurality of base features from a graph and adding the plurality of base features to a feature matrix. The method further includes generating, by a processing device, a current feature layer from the feature matrix and a set of relational feature operators, wherein the current feature layer corresponds to a set of current features, evaluating feature pairs associated with the current feature layer, and selecting a subset of features from the set of current features based on the evaluated feature pairs. The method further includes adding the subset of features to the feature matrix to generate an updated feature matrix.

    System and method for efficient interval search using locality-preserving hashing

    公开(公告)号:US10387495B2

    公开(公告)日:2019-08-20

    申请号:US15179780

    申请日:2016-06-10

    发明人: Rong Zhou

    IPC分类号: G06F16/903 G06F16/901

    摘要: Embodiments of the present invention provide a time- and space-efficient system for representing and searching a set of intervals to find all the intervals that overlap with a given query interval or point. A new structure called an interval hash table is introduced to significantly reduce the average search time, thereby improving computing and search technology. During operation, the system obtains data indicating a set of intervals to be hashed. The system divides a respective interval into a set of sub-intervals based on a locality-preserving hashing. The system then obtains a hash code associated with a respective sub-interval, and inserts the respective sub-interval into an interval hash table at a location corresponding to the hash code. The system may further search the interval hash table.

    System and method for hybrid task management across CPU and GPU for efficient data mining

    公开(公告)号:US10235182B2

    公开(公告)日:2019-03-19

    申请号:US15628153

    申请日:2017-06-20

    IPC分类号: G06F9/38 G06F17/30

    摘要: Embodiments described herein provide a system for facilitating hybrid task management across a central processing unit (CPU) and a graphics processing unit (GPU) of a computer. During operation, the system determines a set of tasks for performing data mining on a data set and storing the set of tasks in a data structure in an ascending order of uniformity associated with a respective task. The uniformity of a task indicates how uneven and skewed the task is compared to other tasks in the set of tasks. The system then allocates a subset of tasks to a core of the CPU from a front of the data structure and a subset of tasks to a core of the GPU from a back of the data structure.

    System and method for compressing graphs via cliques

    公开(公告)号:US10217241B2

    公开(公告)日:2019-02-26

    申请号:US15183561

    申请日:2016-06-15

    摘要: Embodiments of the present invention provide a system for fast parallel graph compression based on identifying a set of large cliques, which is used to encode the graph. The system provides both permanently-stored and in-memory graph encoding and reduces the space needed to represent and store a graph, the I/O traffic to use the graph, and the computation needed to perform algorithms involving the graph. The system thereby improves computing technology and graph computation. During operation, the system obtains data indicating vertices and edges of a graph. The system executes a clique-finding method to identify a maximum clique in the graph. The system then removes the clique from the graph, adds the clique to a set of found cliques, and generates a compressed representation of the graph based on the set of found cliques.

    System and method for speeding up general matrix-matrix multiplication on the GPU

    公开(公告)号:US10073815B2

    公开(公告)日:2018-09-11

    申请号:US15169422

    申请日:2016-05-31

    发明人: Rong Zhou

    IPC分类号: G06F17/16

    CPC分类号: G06F17/16

    摘要: A method and system for performing general matrix-matrix multiplication (GEMM) operations on a graphics processor unit (GPU) using Smart kernels. During operation, the system may generate a set of kernels that includes at least one of a variable-dimension variable-K GEMM kernel, a variable-dimension constant-K GEMM kernel, or a combination thereof. A constant-K GEMM kernel performs computations for matrices with a specific value of K (e.g., the number of columns in a first matrix and the number of rows in a second matrix). Variable-dimension GEMM kernels allow for flexibility in the number of rows and columns used by a thread block to perform matrix multiplication for a sub-matrix. The system may generate rules to select the best (e.g., fastest) kernel for performing computations according to the particular parameter combination of the matrices being multiplied.

    System and method for improved parallel search on bipartite graphs using dynamic vertex-to-processor mapping

    公开(公告)号:US09672557B2

    公开(公告)日:2017-06-06

    申请号:US14052584

    申请日:2013-10-11

    IPC分类号: G06Q30/06 G06Q30/02

    CPC分类号: G06Q30/0631 G06Q30/0282

    摘要: One embodiment of the present invention provides a system for dynamically assigning vertices to processors to generate a recommendation for a customer. During operation, the system receives graph data with customer and product vertices and purchase edges. The system traverses the graph from a customer vertex to a set of product vertices. The system divides the set of product vertices among a set of processors. Subsequently, the system determines a set of product frontier vertices for each processor. The system traverses the graph from the set of product frontier vertices to a set of customer vertices. The system divides the set of customer vertices among a set of processors. Then, the system determines a set of customer frontier vertices for each processor. The system traverses the graph from the set of customer frontier vertices to a set of recommendable product vertices. The system generates one or more product recommendations for the customer.