IN-QUERYING DATA CLEANSING WITH SEMANTIC STANDARDIZATION
    2.
    发明申请
    IN-QUERYING DATA CLEANSING WITH SEMANTIC STANDARDIZATION 审中-公开
    使用语义标准进行数据清理

    公开(公告)号:US20130332408A1

    公开(公告)日:2013-12-12

    申请号:US13956024

    申请日:2013-07-31

    IPC分类号: G06F17/30

    CPC分类号: G06F16/254 G06F16/215

    摘要: The present invention relates to data cleansing, and in particular performing the semantic standardization process within a database before the transform portion of the extract-transform-load (ETL) process. Provided are a method, system and computer program product for standardizing data within a database engine, configuring the standardization function to determine at least one standardized value for at least one data value by applying the standardization table in a context of at least one data value, receiving a database query identifying the standardization function, at least one database value and the context of the data, and invoking the standardization function.

    摘要翻译: 本发明涉及数据清理,特别是在提取 - 转换 - 加载(ETL)处理的变换部分之前,在数据库中执行语义标准化处理。 提供了一种用于对数据库引擎内的数据进行标准化的方法,系统和计算机程序产品,通过在至少一个数据值的上下文中应用标准化表来配置标准化功能以确定至少一个数据值的至少一个标准化值, 接收识别标准化功能的数据库查询,至少一个数据库值和数据的上下文以及调用标准化功能。