- 专利标题: In-querying data cleansing with semantic standardization
-
申请号: US13493945申请日: 2012-06-11
-
公开(公告)号: US10120916B2公开(公告)日: 2018-11-06
- 发明人: Tanveer A. Faruquie , Mukesh K. Mohania , L. Venkata Subramaniam , Charles D. Wolfson
- 申请人: Tanveer A. Faruquie , Mukesh K. Mohania , L. Venkata Subramaniam , Charles D. Wolfson
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Konrad, Raynes, Davda & Victor LLP
- 代理商 Janaki K. Davda
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
The present invention relates to data cleansing, and in particular performing the semantic standardization process within a database before the transform portion of the extract-transform-load (ETL) process. Provided are a method, system and computer program product for standardizing data within a database engine, configuring the standardization function to determine at least one standardized value for at least one data value by applying the standardization table in a context of at least one data value, receiving a database query identifying the standardization function, at least one database value and the context of the data, and invoking the standardization function.
公开/授权文献
- US20130332407A1 IN-QUERYING DATA CLEANSING WITH SEMANTIC STANDARDIZATION 公开/授权日:2013-12-12
信息查询