Invention Application
- Patent Title: DATA LABELING FOR TRAINING ARTIFICIAL INTELLIGENCE SYSTEMS
-
Application No.: PCT/US2022/024750Application Date: 2022-04-14
-
Publication No.: WO2022221488A2Publication Date: 2022-10-20
- Inventor: LAHLOU, Tarek Aziz , DELAUNAY, Megan Lynn , FYOCK, Corey Jonathan , BABINSKY, Erin
- Applicant: CAPITAL ONE SERVICES, LLC
- Applicant Address: 1680 Capital One Drive
- Assignee: CAPITAL ONE SERVICES, LLC
- Current Assignee: CAPITAL ONE SERVICES, LLC
- Current Assignee Address: 1680 Capital One Drive
- Agency: KELLY, Scott M.
- Priority: US17/230,760 2021-04-14
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N20/00 ; G06N3/084 ; G06N5/04
Abstract:
Systems, apparatuses, and methods are described for data labeling for training artificial intelligence systems. A candidate dataset comprising data samples and corresponding labels may be used to update an incumbent dataset comprise data samples and corresponding labels. The integrity of a data sample-label pair in the candidate dataset may be determined before the data sample-label pair is added to the incumbent dataset. For determining labeling integrity, a plurality of machine classifiers may be trained based on the incumbent dataset and portions of the candidate dataset. The plurality of machine classifiers as trained may be used to generate predicted labels for data samples in the candidate dataset. The integrity of the data sample-label pair in the candidate dataset may be measured based on the predicted labels for the data sample.
Information query