Invention Application
- Patent Title: INTERACTIVE FEATURE ENGINEERING IN AUTOMATIC MACHINE LEARNING WITH DOMAIN KNOWLEDGE
-
Application No.: US17317242Application Date: 2021-05-11
-
Publication No.: US20220366269A1Publication Date: 2022-11-17
- Inventor: Dakuo Wang , Udayan Khurana , Daniel Karl I. Weidele , Arunima Chaudhary , Carolina Maria Spina , Abel Valente , Chuang Gan , Horst Cornelius Samulowitz , Lisa Amini
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Main IPC: G06N5/02
- IPC: G06N5/02 ; G06K9/62 ; G06N20/00

Abstract:
A dataset including features and values associated with the features can be received. Each of the features in the dataset can be mapped to a corresponding node in a knowledge graph based on the concept represented by the corresponding node. The knowledge graph can be traversed to find a candidate node connected to at least one mapped node, the candidate node not being mapped to a feature in the dataset. A concept associated with the candidate node can be identified as a new feature. A machine learning model pipeline can use the features in the dataset and the new feature to select a subset of features for training a machine learning model.
Information query