Invention Publication
- Patent Title: SYNTHETIC DATASET GENERATOR
-
Application No.: US18212629Application Date: 2023-06-21
-
Publication No.: US20240127075A1Publication Date: 2024-04-18
- Inventor: Shalini De Mello , Christian Jacobsen , Xunlei Wu , Stephen Tyree , Alice Li , Wonmin Byeon , Shangru Li
- Applicant: NVIDIA Corporation
- Applicant Address: US CA Santa Clara
- Assignee: NVIDIA Corporation
- Current Assignee: NVIDIA Corporation
- Current Assignee Address: US CA Santa Clara
- Main IPC: G06N3/0985
- IPC: G06N3/0985

Abstract:
Machine learning is a process that learns a model from a given dataset, where the model can then be used to make a prediction about new data. In order to reduce the costs associated with collecting and labeling real world datasets for use in training the model, computer processes can synthetically generate datasets which simulate real world data. The present disclosure improves the effectiveness of such synthetic datasets for training machine learning models used in real world applications, in particular by generating a synthetic dataset that is specifically targeted to a specified downstream task (e.g. a particular computer vision task, a particular natural language processing task, etc.).
Information query