Invention Application
- Patent Title: Supporting Database Constraints in Synthetic Data Generation Based on Generative Adversarial Networks
-
Application No.: US17321709Application Date: 2021-05-17
-
Publication No.: US20220374682A1Publication Date: 2022-11-24
- Inventor: Anisoara NICA , Wanxin LI
- Applicant: SAP SE
- Applicant Address: DE Walldorf
- Assignee: SAP SE
- Current Assignee: SAP SE
- Current Assignee Address: DE Walldorf
- Main IPC: G06N3/04
- IPC: G06N3/04 ; G06F16/2455 ; G06N3/08

Abstract:
Disclosed herein are system, method, and computer program product embodiments for generating synthetic data records with database constraints using generative adversarial networks (GAN). The method can include training, by using a generator loss function, a generator neural network of a generator model of the GAN to generate a scaling factor and a cluster vector for a datum of a continuous variable of a continuous column of a data table, and a datum for a categorical variable of a categorical column of the data table. The generator loss function includes a penalty component determined based on a set of data constraints related to the continuous column or the categorical column.
Information query