-
公开(公告)号:US20230153448A1
公开(公告)日:2023-05-18
申请号:US17525744
申请日:2021-11-12
Applicant: ADOBE INC.
Inventor: Subrata Mitra , Sunny Dhamnani , Piyush Bagad , Raunak Gautam , Haresh Khanna , Atanu R. Sinha
CPC classification number: G06F21/62 , G06K9/6256 , G06N3/0454
Abstract: Methods and systems are provided for facilitating generation of representative datasets. In embodiments, an original dataset for which a data representation is to be generated is obtained. A data generation model is trained to generate a representative dataset that represents the original dataset. The data generation model is trained based on the original dataset, a set of privacy settings indicating privacy of data associated with the original dataset, and a set of value settings indicating value of data associated with the original dataset. A representative dataset that represents the original dataset is generated via the trained data generation model. The generated representative dataset maintains a set of desired statistical properties of the original dataset, maintains an extent of data privacy of the set of original data, and maintains an extent of data value of the set of original data.