Invention Grant
- Patent Title: Graph based sampling
- Patent Title (中): 基于图形的抽样
-
Application No.: US11840139Application Date: 2007-08-16
-
Publication No.: US07827123B1Publication Date: 2010-11-02
- Inventor: Jay Yagnik
- Applicant: Jay Yagnik
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fenwick & West LLP
- Main IPC: G06F15/18
- IPC: G06F15/18

Abstract:
An iterative method of sampling real world event data to generate a subset of data that is used for training a classifier. Graph Based Sampling uses an iterative process of evaluating and adding randomly selected event data sets to a training data set. In Graph Based Sampling, at each iteration, a two event data sets are randomly selected from a stored plurality of event data sets. A proximity function is used to generate a correlation or similarity value between each of these randomly selected real world event data sets, and the current training data set. One of the randomly selected event data sets is then added to the training data set based on the proximity value. This process of selection and addition is repeated until the subset of training set is a pre-determined size.
Information query