Merging synopses to determine number of distinct values in large databases
    11.
    发明授权
    Merging synopses to determine number of distinct values in large databases 有权
    合并摘要以确定大型数据库中不同值的数量

    公开(公告)号:US07603339B2

    公开(公告)日:2009-10-13

    申请号:US11796110

    申请日:2007-04-25

    IPC分类号: G06F7/00 G06F17/30 G06F17/00

    摘要: A method and apparatus for merging synopses to determine a database statistic, e.g., a number of distinct values (NDV), is disclosed. The merging can be used to determine an initial database statistic or to perform incremental statistics maintenance. For example, each synopsis can pertain to a different partition, such that merging the synopses generates a global statistic. When performing incremental maintenance, only those synopses whose partitions have changed need to be updated. Each synopsis contains domain values that summarize the statistic. However, the synopses may initially contain domain values that are not compatible with each other. Prior to merging the synopses the domain values in each synopsis is made compatible with the domain values in the other synopses. The adjustment is made such that each synopsis represents the same range of domain values, in one embodiment. After “compatible synopses” are formed, the synopses are merged by taking the union of the compatible synopses.

    摘要翻译: 公开了用于合并概要以确定数据库统计量的方法和装置,例如多个不同值(NDV)。 合并可用于确定初始数据库统计信息或执行增量统计维护。 例如,每个概要可以涉及不同的分区,以便合并概要会生成全局统计量。 执行增量维护时,只需要更新其分区已更改的概要文件。 每个概要包含总结统计量的域值。 但是,这些概要可能最初包含彼此不兼容的域值。 在合并概要之前,每个概要中的域值与其他概要中的域值兼容。 在一个实施例中进行调整,使得每个概要表示相同范围的域值。 在形成“兼容简介”之后,通过兼容兼容简报的合并来合并概要。

    HEALTH MONITOR
    12.
    发明申请
    HEALTH MONITOR 有权
    健康监测

    公开(公告)号:US20090106605A1

    公开(公告)日:2009-04-23

    申请号:US12252128

    申请日:2008-10-15

    IPC分类号: G06F11/00

    摘要: Techniques for proactively and reactively running diagnostic functions. These diagnostic functions help to improve diagnostics of conditions detected in a monitored system and to limit/quarantine the damages caused by the detected conditions. In one embodiment, a health monitor infrastructure is provided that is configured to perform one or more health checks in a monitored system for diagnosing and/or gathering information related to the system. The one or more health checks may be invoked pro-actively on a scheduled basis, reactively in response to a condition detected in the system, or may even be invoked manually by a user such as a system administrator.

    摘要翻译: 主动和反应地运行诊断功能的技术。 这些诊断功能有助于改善在受监控系统中检测到的条件的诊断,并限制/检疫由检测到的条件引起的损害。 在一个实施例中,提供健康监视器基础设施,其被配置为在受监视的系统中执行一个或多个健康检查以诊断和/或收集与该系统有关的信息。 响应于在系统中检测到的状况,或者甚至可以由诸如系统管理员的用户手动地调用一个或多个健康检查,可以在调度的基础上主动地调用。

    Approximating a database statistic
    13.
    发明申请
    Approximating a database statistic 有权
    近似数据库统计

    公开(公告)号:US20080120274A1

    公开(公告)日:2008-05-22

    申请号:US11796102

    申请日:2007-04-25

    IPC分类号: G06F7/00

    摘要: A method and apparatus for approximating a database statistic, such as the number of distinct values (NDV) is provided. To approximate the NDV for a portion of a table, a synopsis of distinct values is constructed. Each value in the portion is mapped to a domain of values. The mapping function is implemented with a uniform hash function, in one embodiment. If the resultant domain value does not exist in the synopsis, the domain value is added to the synopsis. If the synopsis reaches its capacity, a portion of the domain values are discarded from the synopsis. The statistic is approximated based on the number (N) of domain values in the synopsis and the portion of the domain that is represented in the synopsis relative to the size of the domain.

    摘要翻译: 提供了用于近似数据库统计量的方法和装置,例如不同值(NDV)的数量。 为了近似表的一部分的NDV,构建了不同值的概要。 该部分中的每个值都映射到值的域。 在一个实施例中,映射功能是用均匀散列函数实现的。 如果在概要中不存在结果域值,则将域值添加到概要中。 如果概要达到其容量,则域值的一部分将从摘要中被丢弃。 统计量基于概要中的域值的数量(N)和在概要中相对于域的大小表示的域的部分近似。

    Unified database and text retrieval system
    15.
    发明授权
    Unified database and text retrieval system 有权
    统一数据库和文本检索系统

    公开(公告)号:US06681222B2

    公开(公告)日:2004-01-20

    申请号:US09906502

    申请日:2001-07-16

    IPC分类号: G06F1730

    摘要: A unified database/text retrieval system converts exact database type queries into text inclusion type queries suitable for text retrieval systems through the use of pseudo keywords. Boolean combination of the text inclusion type query elements may be readily manipulated for optimization and applied to a unified index for rapid search results. Absolute relevance values and relevance multiplier values may be added to the query elements to provide a relevance-based sorting not only of text but also of exact match type search results. Relevance values may be deduced automatically from a variety of sources.

    摘要翻译: 统一的数据库/文本检索系统通过使用伪关键字将精确的数据库类型查询转换为适合文本检索系统的文本包含类型查询。 文本包含类型查询元素的布尔组合可以容易地被操纵以用于优化并应用于用于快速搜索结果的统一索引。 可以将绝对相关性值和相关性乘数值添加到查询元素中,以提供不仅文本的相关性排序,而且还提供精确匹配类型搜索结果的基于关联的排序。 相关性值可以从各种来源自动推导出来。