- 专利标题: METHOD TO DETERMINE COLUMNS THAT CONTAIN LOCATION DATA IN A DATA SET
-
申请号: US15363617申请日: 2016-11-29
-
公开(公告)号: US20180150765A1公开(公告)日: 2018-05-31
- 发明人: Shilpi Ahuja , Rafael J.Z. Bastidas , Rashmi Gangadharaiah , Mary A. Roth
- 申请人: International Business Machines Corporation
- 主分类号: G06N99/00
- IPC分类号: G06N99/00 ; G06F17/30
摘要:
A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.
公开/授权文献
信息查询