Invention Grant
- Patent Title: Adaptive sampling scheme for imbalanced large scale data
-
Application No.: US14933254Application Date: 2015-11-05
-
Publication No.: US10346861B2Publication Date: 2019-07-09
- Inventor: Wei Zhang , Said Kobeissi , Anandhavelu Natarajan , Shiv Kumar Saini , Ritwik Sinha , Scott Allen Tomko
- Applicant: ADOBE INC.
- Applicant Address: US CA San Jose
- Assignee: ADOBE INC.
- Current Assignee: ADOBE INC.
- Current Assignee Address: US CA San Jose
- Agency: Shook, Hardy & Bacon L.L.P.
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06Q30/02

Abstract:
Embodiments of the present invention relate to providing business customers with predictive capabilities, such as identifying valuable customers or estimating the likelihood that a product will be purchased. An adaptive sampling scheme is utilized, which helps generate sample data points from large scale data that is imbalanced (for example, digital website traffic with hundreds of millions of visitors but only a small portion of them are of interest). In embodiments, a stream of sample data points is received. Positive samples are added to a positive list until the desired number of positives is reached and negative samples are added to a negative list until the desired number of negative samples is reached. The positive list and the negative list can then be combined, shuffled, and fed into a prediction model.
Public/Granted literature
- US20170132516A1 ADAPTIVE SAMPLING SCHEME FOR IMBALANCED LARGE SCALE DATA Public/Granted day:2017-05-11
Information query