Processing XML data stream(s) using continuous queries in a data stream management system
    81.
    发明申请
    Processing XML data stream(s) using continuous queries in a data stream management system 审中-公开
    在数据流管理系统中使用连续查询来处理XML数据流

    公开(公告)号:US20080120283A1

    公开(公告)日:2008-05-22

    申请号:US11601415

    申请日:2006-11-17

    IPC分类号: G06F17/30

    CPC分类号: G06F16/83

    摘要: A computer is programmed to accept queries over streams of, data structured as per a predetermined syntax (e.g. defined in XML). The computer is further programmed to execute such queries continually (or periodically) on data streams of tuples containing structured data that conform to the same predetermined syntax. In many embodiments, the computer includes an engine that exclusively processes only structured data, quickly and efficiently. The computer invokes the structured data engine in two different ways depending on the embodiment: (a) directly on encountering a structured data operator, or (b) indirectly by parsing operands within the structured data operator which contain path expressions, creating a new source to supply scalar data extracted from structured data, and generating additional trees of operators that are natively supported, followed by invoking the structured data engine only when the structured data operator in the query cannot be fully implemented by natively supported operators.

    摘要翻译: 计算机被编程为接受根据预定语法(例如以XML定义)结构化的数据流的查询。 计算机还被编程为在包含符合相同预定语法的结构化数据的元组的数据流上持续地(或周期性地)执行这样的查询。 在许多实施例中,计算机包括仅仅快速有效地处理结构化数据的引擎。 根据实施例,计算机以两种不同的方式调用结构化数据引擎:(a)直接遇到结构化数据运算符,或(b)间接地通过解析包含路径表达式的结构化数据运算符中的操作数,创建新的源 提供从结构化数据中提取的标量数据,并产生本机支持的操作符的附加树,之后仅当查询中的结构化数据运算符不能由本机支持的运算符完全实现时调用结构化数据引擎。

    Knowledge extraction for automatic ontology maintenance
    82.
    发明申请
    Knowledge extraction for automatic ontology maintenance 有权
    自动本体维护知识提取

    公开(公告)号:US20080097968A1

    公开(公告)日:2008-04-24

    申请号:US11712346

    申请日:2007-02-28

    IPC分类号: G06F7/10 G06F12/00

    CPC分类号: G06F17/278 G06F17/30731

    摘要: Systems, methods, and other embodiments associated with extracting knowledge from application data and maintaining an ontology based on the extracted knowledge are described. One example system includes a mapping logic to store mappings between application objects and ontology classes and an information extraction (IE) logic that accesses the mapping logic to identify application data to process based on the mappings. The application data may be stored in application data repositories belonging to an enterprise and may be characterized by the application object. Having identified application data to process, the IE logic may locate data in the application data repositories and selectively manipulate an ontology based on selected application data elements.

    摘要翻译: 描述了与从提取的知识中提取知识和维护基于所提取的知识的本体相关联的系统,方法和其他实施例。 一个示例性系统包括用于存储应用对象和本体类之间的映射的映射逻辑,以及访问映射逻辑以识别基于映射进行处理的应用数据的信息提取(IE)逻辑。 应用数据可以存储在属于企业的应用数据存储库中,并且可以由应用对象来表征。 在确定要处理的应用数据之后,IE逻辑可以将数据定位在应用数据存储库中,并基于所选择的应用数据元素选择性地操纵本体。

    Techniques of optimizing queries using NULL expression analysis
    83.
    发明申请
    Techniques of optimizing queries using NULL expression analysis 有权
    使用NULL表达式分析优化查询的技术

    公开(公告)号:US20080065674A1

    公开(公告)日:2008-03-13

    申请号:US11517969

    申请日:2006-09-08

    IPC分类号: G06F17/00

    CPC分类号: G06F17/30454

    摘要: A method for optimizing queries is described. A query, which includes a first expression for determining whether each of a set of values evaluates to NULL or NOT NULL, is received. At compile-time, a first representation of the query is generated, where the first representation indicates that the query is to be performed based on the first expression. Based on the first expression, a second expression is determined. The second expression is such that the first expression would evaluate to any NOT NULL value if and only if the second expression would evaluate to any NOT NULL value. The second expression, when evaluated, is much more computationally efficient compared with the evaluation of the first expression. Based on the first representation, a second representation of the query is generated, where the second representation indicates that the query is to be performed based on the second expression. Thereafter, the query is executed based on the second representation.

    摘要翻译: 描述用于优化查询的方法。 一个查询,其中包含一个第一表达式,用于确定一组值中的每一个是否计算为NULL或NOT NULL。 在编译时,生成查询的第一个表示,其中第一个表示指示要根据第一个表达式执行查询。 基于第一个表达式,确定第二个表达式。 第二个表达式使得当且仅当第二个表达式计算为任何NOT NULL值时,第一个表达式将求值为任何NOT NULL值。 第二个表达式与评估第一个表达式相比,计算效率要高得多。 基于第一表示,生成查询的第二表示,其中第二表示指示将基于第二表达式执行查询。 此后,基于第二表示执行查询。

    Techniques of optimizing XQuery functions using actual argument type information
    84.
    发明申请
    Techniques of optimizing XQuery functions using actual argument type information 有权
    使用实际参数类型信息优化XQuery函数的技术

    公开(公告)号:US20070288429A1

    公开(公告)日:2007-12-13

    申请号:US11452769

    申请日:2006-06-13

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30935 G06F8/437

    摘要: A method for optimizing functions is described. A query including an invocation of a function is received. A function declaration indicates that the function is associated with a first set of datatypes. At compile-time, static type-checking is performed on the query. During the static type-checking, a second set of datatypes is determined for the function. The second set of datatypes includes the datatype of each argument of the function that is specified in the invocation. The query is compiled, where compiling the query includes compiling the function with the second set of datatypes.

    摘要翻译: 描述了优化功能的方法。 收到包含函数调用的查询。 函数声明指示该函数与第一组数据类型相关联。 在编译时,对查询执行静态类型检查。 在静态类型检查期间,为函数确定第二组数据类型。 第二组数据类型包括在调用中指定的函数的每个参数的数据类型。 该查询被编译,其中编译查询包括使用第二组数据类型编译该函数。

    Techniques of efficient XML meta-data query using XML table index
    85.
    发明申请
    Techniques of efficient XML meta-data query using XML table index 有权
    使用XML表索引的高效XML元数据查询技术

    公开(公告)号:US20070239681A1

    公开(公告)日:2007-10-11

    申请号:US11394878

    申请日:2006-03-31

    IPC分类号: G06F17/30

    摘要: XML table indexes provide a more efficient mechanism for searching data stored in aggregate form. XML table indexes are a set of tables created to project out in column form commonly sought metadata from stored XML documents. By projecting the data includes into column form, queries on the XML documents can be efficiently processed as they can leverage the enhanced functionality provided by the database tables. The XML table indexes may use aliases, partitioning, constraints and other functions to further improve query flexibility and performance.

    摘要翻译: XML表索引为搜索以聚合形式存储的数据提供了更有效的机制。 XML表索引是一组创建的表格,通常以存储的XML文档寻求元数据的形式进行投射。 通过将数据包括到列形式中,可以有效地处理关于XML文档的查询,因为它们可以利用数据库表提供的增强功能。 XML表索引可以使用别名,分区,约束和其他功能来进一步提高查询的灵活性和性能。

    Crawler based auditing framework
    86.
    发明申请
    Crawler based auditing framework 审中-公开
    基于履历式的审计框架

    公开(公告)号:US20070226695A1

    公开(公告)日:2007-09-27

    申请号:US11649098

    申请日:2007-01-03

    IPC分类号: G06F9/44

    CPC分类号: G06Q10/06

    摘要: Systems, methods, and other embodiments associated with post-crawl auditing are described. One system embodiment includes an audit logic that can be controlled to apply an audit rule to crawl data. The crawl data may be acquired by a crawl logic that provides the crawl data to an index logic. The crawl logic may be configured to crawl documents stored in different locations in an enterprise. The crawl logic may also be configured to crawl documents having different formats. The index logic may be configured to create an index that supports searching for documents in the enterprise. The audit logic may process the crawl data independent of the operation of the index logic.

    摘要翻译: 描述了与抓取后审计相关联的系统,方法和其他实施例。 一个系统实施例包括可以被控制以将审计规则应用于爬取数据的审计逻辑。 抓取数据可以通过向索引逻辑提供爬取数据的爬行逻辑来获取。 爬网逻辑可以被配置为爬取存储在企业中的不同位置的文档。 爬网逻辑还可以被配置为爬取具有不同格式的文档。 索引逻辑可以被配置为创建支持在企业中搜索文档的索引。 审计逻辑可以独立于索引逻辑的操作来处理爬取数据。

    Progressive relaxation across tiers
    87.
    发明申请
    Progressive relaxation across tiers 有权
    层级逐渐松弛

    公开(公告)号:US20070208712A1

    公开(公告)日:2007-09-06

    申请号:US11649010

    申请日:2007-01-03

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30672

    摘要: Systems, methods, and other embodiments associated with progressive relaxation across tiers of a search system are described. One system embodiment includes a first tier of computing components that stores search data that can be used to locate documents. The system embodiment may also include a second tier of computing components that also store search data that can be used to locate documents. The system may also include a query logic that receives a query and locates a set of items relevant to the query by selectively progressively relaxing a search across the search data stored at different levels until a threshold is met.

    摘要翻译: 描述了与搜索系统的层的渐进放松相关联的系统,方法和其他实施例。 一个系统实施例包括存储可用于定位文档的搜索数据的第一层计算组件。 系统实施例还可以包括还存储可用于定位文档的搜索数据的第二层计算组件。 系统还可以包括查询逻辑,其接收查询并且通过选择性地逐渐放宽在不同级别存储的搜索数据中逐渐放宽查询,从而查找与查询相关的一组项目,直到满足阈值。

    Techniques for rewriting XML queries directed to relational database constructs
    88.
    发明授权
    Techniques for rewriting XML queries directed to relational database constructs 有权
    重写针对关系数据库结构的XML查询的技术

    公开(公告)号:US07120645B2

    公开(公告)日:2006-10-10

    申请号:US10428878

    申请日:2003-05-01

    IPC分类号: G06F17/30

    摘要: Techniques for executing database commands include receiving a database command that includes an XML component operation that operates on an XML construct that is based on a first set of one or more relational database constructs. It is determined whether the XML component operation can be transformed to a relational database operation on a particular set of one or more relational database constructs of the first set, which does not involve the XML component operation. If it is determined that the XML component operation can be transformed, then the XML component operation is rewritten as a particular relational database operation that operates on the particular set and that does not involve the XML component operation. The particular relational database operation is evaluated. In another aspect, techniques include determining a primitive set of XML generation operations and replacing non-primitive XML generation operations with one or more operations from the primitive set.

    摘要翻译: 用于执行数据库命令的技术包括接收包括对基于第一组一个或多个关系数据库构造的XML构造进行操作的XML组件操作的数据库命令。 确定XML组件操作是否可以被转换为第一组的一个或多个关系数据库构造的特定集合上的关系数据库操作,其不涉及XML组件操作。 如果确定可以转换XML组件操作,则将XML组件操作重写为在特定集合上操作且不涉及XML组件操作的特定关系数据库操作。 评估特定的关系数据库操作。 在另一方面,技术包括确定XML生成操作的原始集合并且用来自原语集合的一个或多个操作来替换非原始XML生成操作。

    Techniques for retaining hierarchical information in mapping between XML documents and relational data
    89.
    发明授权
    Techniques for retaining hierarchical information in mapping between XML documents and relational data 有权
    在XML文档和关系数据之间的映射中保留分层信息的技术

    公开(公告)号:US07103611B2

    公开(公告)日:2006-09-05

    申请号:US10428443

    申请日:2003-05-01

    IPC分类号: G06F17/30

    摘要: Techniques for managing XML data in an SQL compliant DBMS include receiving an SQL statement. The SQL statement includes a particular operator that operates on a first instance of XML type that represents a first set of XML elements. During execution of the SQL statement, the particular operator is evaluated by generating an ordered collection of instances of XML type. Each different instance in the ordered collection is based on a different XML element from the first set; and there is an instance in the ordered collection for every XML element from either the first set or from the first set and its descendents. When descendents are included, each entry in the ordered collection indicates a level in the XML tree. In another aspect, an aggregate operator in the SQL statement operates on a collection of instances, with associated levels, to generate a single instance of XML type.

    摘要翻译: 在SQL兼容DBMS中管理XML数据的技术包括接收SQL语句。 SQL语句包括一个特定的操作符,它在代表第一组XML元素的XML类型的第一个实例上运行。 在执行SQL语句期间,通过生成XML类型的实例的有序集合来评估特定的操作符。 有序集合中的每个不同实例都基于与第一个集合不同的XML元素; 并且有序集合中的第一个集合或第一个集合及其后代的每个XML元素都有一个实例。 当包含后代时,有序集合中的每个条目指示XML树中的一个级别。 另一方面,SQL语句中的聚合运算符对具有关联级别的实例集合进行操作,以生成单个XML类型的实例。