DETECTING FACTUAL INCONSISTENCIES BETWEEN A DOCUMENT AND A FACT-BASE
    21.
    发明申请
    DETECTING FACTUAL INCONSISTENCIES BETWEEN A DOCUMENT AND A FACT-BASE 有权
    检测文件与实际基础之间的实际失效

    公开(公告)号:US20100332424A1

    公开(公告)日:2010-12-30

    申请号:US12494399

    申请日:2009-06-30

    IPC分类号: G06F15/18 G06N5/02

    CPC分类号: G06N5/02

    摘要: Techniques for identifying one or more inconsistencies between an unstructured document and a back-end fact-base are provided. The techniques include automatically parsing a query document and comparing the document with a back-end fact-base comprising facts relevant to the document, identifying one or more inconsistencies between information mentioned in the document and the facts stored in the back-end fact-base, and providing a response to the query document, wherein the response additionally includes the one or more identified inconsistencies.

    摘要翻译: 提供了用于识别非结构化文档和后端事实库之间的一个或多个不一致性的技术。 这些技术包括自动解析查询文档并将文档与包含与该文档相关的事实的后端事实基础进行比较,识别文档中提及的信息与存储在后端事实库中的事实之间的一个或多个不一致 并且向所述查询文档提供响应,其中所述响应另外包括所述一个或多个所识别的不一致性。

    Adapting data quality rules based upon user application requirements
    23.
    发明授权
    Adapting data quality rules based upon user application requirements 有权
    根据用户应用需求调整数据质量规则

    公开(公告)号:US09323814B2

    公开(公告)日:2016-04-26

    申请号:US13552103

    申请日:2012-07-18

    IPC分类号: G06F17/30

    摘要: During application of data quality rules to a data set obtained from a data source, data is retrieved from the data source along with a common set of rules configured to format the retrieved data in a manner in accordance with one or more predefined data quality rules of the common set of rules. At least one predefined data quality rule is adjusted utilizing at least one editable widget to form a modified set of data quality rules adapted for use with a specified application. The modified set of data quality rules is applied to the retrieved data.

    摘要翻译: 在将数据质量规则应用于从数据源获得的数据集中时,从数据源中检索数据以及配置为按照一种或多种预定义的数据质量规则格式化检索的数据的公共规则集 共同的规则。 使用至少一个可编辑小部件来调整至少一个预定数据质量规则,以形成适于与指定应用一起使用的经修改的数据质量规则集合。 经修改的数据质量规则集应用于检索的数据。

    System and method for building a cloud aware massive data analytics solution background
    25.
    发明授权
    System and method for building a cloud aware massive data analytics solution background 有权
    用于构建云感知海量数据分析解决方案背景的系统和方法

    公开(公告)号:US08874600B2

    公开(公告)日:2014-10-28

    申请号:US12697236

    申请日:2010-01-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30424

    摘要: Embodiments of the invention provide data management solutions that go beyond the traditional warehousing system to support advanced analytics. Furthermore, embodiments of the invention relate to systems and methods for extracting data from an existing data warehouse, storing the extracted data in a reusable (intermediate) form using data parallel and compute parallel techniques over cloud, query processing over the data with/without compute parallel techniques, and providing querying using high level querying languages.

    摘要翻译: 本发明的实施例提供超越传统仓储系统以支持高级分析的数据管理解决方案。 此外,本发明的实施例涉及用于从现有数据仓库提取数据的系统和方法,使用数据并行并以并行计算并行技术将提取的数据存储在可重复使用(中间)形式中,并且通过/不计算数据对数据进行查询处理 并行技术,并使用高级查询语言提供查询。

    DISCOVERING A REPORTING MODEL FROM AN EXISTING REPORTING ENVIRONMENT
    27.
    发明申请
    DISCOVERING A REPORTING MODEL FROM AN EXISTING REPORTING ENVIRONMENT 有权
    从现有的报告环境中发现报告模式

    公开(公告)号:US20130265326A1

    公开(公告)日:2013-10-10

    申请号:US13438903

    申请日:2012-04-04

    IPC分类号: G09G5/377

    CPC分类号: G06Q10/0639

    摘要: Computer software is disclosed for discovering and representing a reporting model of an existing reporting environment. For each report in a plurality of reports, the software searches metadata of the report for descriptive information and dependencies on other reports. The software depicts, in a graphical representation, each report and relationships between the reports.

    摘要翻译: 公开了用于发现和表示现有报告环境的报告模型的计算机软件。 对于多个报告中的每个报告,软件会搜索报告的元数据以获取描述性信息和依赖于其他报告。 该软件以图形表示描述了每个报告和报告之间的关系。

    Cleansing a Database System to Improve Data Quality
    28.
    发明申请
    Cleansing a Database System to Improve Data Quality 审中-公开
    清理数据库系统以提高数据质量

    公开(公告)号:US20120150825A1

    公开(公告)日:2012-06-14

    申请号:US12966281

    申请日:2010-12-13

    IPC分类号: G06F17/30

    摘要: According to one embodiment of the present invention, a system controls cleansing of data within a database system, and comprises a computer system including at least one processor. The system receives a data set from the database system, and one or more features of the data set are selected for determining values for one or more characteristics of the selected features. The determined values are applied to a data quality estimation model to determine data quality estimates for the data set. Problematic data within the data set are identified based on the data quality estimates, where the cleansing is adjusted to accommodate the identified problematic data. Embodiments of the present invention further include a method and computer program product for controlling cleansing of data within a database system in substantially the same manner described above.

    摘要翻译: 根据本发明的一个实施例,系统控制数据库系统内的数据清理,并且包括包括至少一个处理器的计算机系统。 系统从数据库系统接收数据集,并且选择数据集的一个或多个特征以确定所选特征的一个或多个特征的值。 将确定的值应用于数据质量估计模型以确定数据集的数据质量估计。 基于数据质量估计来识别数据集中的有问题的数据,其中调整清洁以适应所识别的有问题的数据。 本发明的实施例还包括一种方法和计算机程序产品,用于以与上述基本相同的方式控制数据库系统内的数据清洗。