发明公开
- 专利标题: DISCOVERING A SEMANTIC MEANING OF DATA FIELDS FROM PROFILE DATA OF THE DATA FIELDS
-
申请号: US18201545申请日: 2023-05-24
-
公开(公告)号: US20230409835A1公开(公告)日: 2023-12-21
- 发明人: Christopher Thurston Butler , Timothy Spencer Bush
- 申请人: Ab Initio Technology LLC
- 申请人地址: US MA Lexington
- 专利权人: Ab Initio Technology LLC
- 当前专利权人: Ab Initio Technology LLC
- 当前专利权人地址: US MA Lexington
- 主分类号: G06F40/30
- IPC分类号: G06F40/30 ; G06F16/93 ; G06N20/00 ; G06F16/908
摘要:
A data processing system for discovering a semantic meaning of a field included in one or more data sets is configured to identify a field included in one or more data sets, with the field having an identifier. For that field, the system profiles data values of the field to generate a data profile, accesses a plurality of label proposal tests, and generates a set of label proposals by applying the plurality of label proposal tests to the data profile. The system determines a similarity among the label proposals and selects a classification. The system identifies one of the label proposals as identifying the semantic meaning. The system stores the identifier of the field with the identified one of the label proposals that identifies the semantic meaning.
信息查询