Adapting data quality rules based upon user application requirements
    1.
    发明授权
    Adapting data quality rules based upon user application requirements 有权
    根据用户应用需求调整数据质量规则

    公开(公告)号:US09323814B2

    公开(公告)日:2016-04-26

    申请号:US13552103

    申请日:2012-07-18

    IPC分类号: G06F17/30

    摘要: During application of data quality rules to a data set obtained from a data source, data is retrieved from the data source along with a common set of rules configured to format the retrieved data in a manner in accordance with one or more predefined data quality rules of the common set of rules. At least one predefined data quality rule is adjusted utilizing at least one editable widget to form a modified set of data quality rules adapted for use with a specified application. The modified set of data quality rules is applied to the retrieved data.

    摘要翻译: 在将数据质量规则应用于从数据源获得的数据集中时,从数据源中检索数据以及配置为按照一种或多种预定义的数据质量规则格式化检索的数据的公共规则集 共同的规则。 使用至少一个可编辑小部件来调整至少一个预定数据质量规则,以形成适于与指定应用一起使用的经修改的数据质量规则集合。 经修改的数据质量规则集应用于检索的数据。

    Cleansing a database system to improve data quality
    2.
    发明授权
    Cleansing a database system to improve data quality 有权
    清理数据库系统以提高数据质量

    公开(公告)号:US09104709B2

    公开(公告)日:2015-08-11

    申请号:US13422280

    申请日:2012-03-16

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    摘要: According to one embodiment of the present invention, a system controls cleansing of data within a database system, and comprises a computer system including at least one processor. The system receives a data set from the database system, and one or more features of the data set are selected for determining values for one or more characteristics of the selected features. The determined values are applied to a data quality estimation model to determine data quality estimates for the data set. Problematic data within the data set are identified based on the data quality estimates, where the cleansing is adjusted to accommodate the identified problematic data. Embodiments of the present invention further include a method and computer program product for controlling cleansing of data within a database system in substantially the same manner described above.

    摘要翻译: 根据本发明的一个实施例,系统控制数据库系统内的数据清理,并且包括包括至少一个处理器的计算机系统。 系统从数据库系统接收数据集,并且选择数据集的一个或多个特征以确定所选特征的一个或多个特征的值。 将确定的值应用于数据质量估计模型以确定数据集的数据质量估计。 基于数据质量估计来识别数据集中的有问题的数据,其中调整清洁以适应所识别的有问题的数据。 本发明的实施例还包括一种方法和计算机程序产品,用于以与上述基本相同的方式控制数据库系统内的数据清洗。

    System and method for building a cloud aware massive data analytics solution background
    4.
    发明授权
    System and method for building a cloud aware massive data analytics solution background 有权
    用于构建云感知海量数据分析解决方案背景的系统和方法

    公开(公告)号:US08874600B2

    公开(公告)日:2014-10-28

    申请号:US12697236

    申请日:2010-01-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30424

    摘要: Embodiments of the invention provide data management solutions that go beyond the traditional warehousing system to support advanced analytics. Furthermore, embodiments of the invention relate to systems and methods for extracting data from an existing data warehouse, storing the extracted data in a reusable (intermediate) form using data parallel and compute parallel techniques over cloud, query processing over the data with/without compute parallel techniques, and providing querying using high level querying languages.

    摘要翻译: 本发明的实施例提供超越传统仓储系统以支持高级分析的数据管理解决方案。 此外,本发明的实施例涉及用于从现有数据仓库提取数据的系统和方法,使用数据并行并以并行计算并行技术将提取的数据存储在可重复使用(中间)形式中,并且通过/不计算数据对数据进行查询处理 并行技术,并使用高级查询语言提供查询。

    DISCOVERING A REPORTING MODEL FROM AN EXISTING REPORTING ENVIRONMENT
    6.
    发明申请
    DISCOVERING A REPORTING MODEL FROM AN EXISTING REPORTING ENVIRONMENT 有权
    从现有的报告环境中发现报告模式

    公开(公告)号:US20130265326A1

    公开(公告)日:2013-10-10

    申请号:US13438903

    申请日:2012-04-04

    IPC分类号: G09G5/377

    CPC分类号: G06Q10/0639

    摘要: Computer software is disclosed for discovering and representing a reporting model of an existing reporting environment. For each report in a plurality of reports, the software searches metadata of the report for descriptive information and dependencies on other reports. The software depicts, in a graphical representation, each report and relationships between the reports.

    摘要翻译: 公开了用于发现和表示现有报告环境的报告模型的计算机软件。 对于多个报告中的每个报告,软件会搜索报告的元数据以获取描述性信息和依赖于其他报告。 该软件以图形表示描述了每个报告和报告之间的关系。

    Cleansing a Database System to Improve Data Quality
    7.
    发明申请
    Cleansing a Database System to Improve Data Quality 审中-公开
    清理数据库系统以提高数据质量

    公开(公告)号:US20120150825A1

    公开(公告)日:2012-06-14

    申请号:US12966281

    申请日:2010-12-13

    IPC分类号: G06F17/30

    摘要: According to one embodiment of the present invention, a system controls cleansing of data within a database system, and comprises a computer system including at least one processor. The system receives a data set from the database system, and one or more features of the data set are selected for determining values for one or more characteristics of the selected features. The determined values are applied to a data quality estimation model to determine data quality estimates for the data set. Problematic data within the data set are identified based on the data quality estimates, where the cleansing is adjusted to accommodate the identified problematic data. Embodiments of the present invention further include a method and computer program product for controlling cleansing of data within a database system in substantially the same manner described above.

    摘要翻译: 根据本发明的一个实施例,系统控制数据库系统内的数据清理,并且包括包括至少一个处理器的计算机系统。 系统从数据库系统接收数据集,并且选择数据集的一个或多个特征以确定所选特征的一个或多个特征的值。 将确定的值应用于数据质量估计模型以确定数据集的数据质量估计。 基于数据质量估计来识别数据集中的有问题的数据,其中调整清洁以适应所识别的有问题的数据。 本发明的实施例还包括一种方法和计算机程序产品,用于以与上述基本相同的方式控制数据库系统内的数据清洗。

    ANALYZING XML DATA
    10.
    发明申请
    ANALYZING XML DATA 失效
    分析XML数据

    公开(公告)号:US20110125729A1

    公开(公告)日:2011-05-26

    申请号:US12624315

    申请日:2009-11-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30908

    摘要: Embodiments of the invention disclose a method, a system and a computer program product of discovering automated insights in XML data by generating a query result in response to querying data using a query, wherein the data is in a markup language format, and identifying a pattern associated with the query result, wherein the data in the markup language format is used for pattern identification.

    摘要翻译: 本发明的实施例公开了一种方法,系统和计算机程序产品,其通过响应于使用查询查询数据生成查询结果来发现XML数据的自动化见解,其中数据是标记语言格式,并且识别模式 与查询结果相关联,其中标记语言格式的数据用于模式识别。