Generating a schema-specific load structure to load data into a relational database based on determining whether the schema-specific load structure already exists
    101.
    发明授权
    Generating a schema-specific load structure to load data into a relational database based on determining whether the schema-specific load structure already exists 有权
    基于确定模式特定的负载结构是否已经存在,生成模式特定的负载结构以将数据加载到关系数据库中

    公开(公告)号:US07490093B2

    公开(公告)日:2009-02-10

    申请号:US10718823

    申请日:2003-11-21

    IPC分类号: G06F17/00 G06F17/30

    摘要: Described is a method and system for storing data into a database, where a determination is made if schema metadata that is used to load the data into the database already exists, and where the existing schema metadata is used to load the data into the database if the schema metadata already exists. If the appropriate schema metadata does not exist, then it is generated and cached so that a later load operation for the same schema type will not need to re-generate this information. In this way, the cost to generate the schema metadata is amortized over multiple load operations to load data of the same schema type. The approach is protocol neutral so that multiple different protocol-based loads can operate with the same schema metadata information and load structures.

    摘要翻译: 描述了一种用于将数据存储到数据库中的方法和系统,其中确定用于将数据加载到数据库中的模式元数据是否已经存在,并且使用现有模式元数据将数据加载到数据库中,如果 模式元数据已经存在。 如果相应的模式元数据不存在,那么它将被生成和缓存,以便相同模式类型的后续加载操作将不需要重新生成此信息。 以这种方式,生成架构元数据的成本将通过多个加载操作进行分摊,以加载相同模式类型的数据。 该方法是协议中性的,以便多个不同的基于协议的负载可以使用相同的模式元数据信息和负载结构进行操作。

    Using sibling-count in XML indexes to optimize single-path queries
    102.
    发明申请
    Using sibling-count in XML indexes to optimize single-path queries 有权
    在XML索引中使用sibling-count来优化单路径查询

    公开(公告)号:US20090037369A1

    公开(公告)日:2009-02-05

    申请号:US11888415

    申请日:2007-07-31

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30935 G06F17/30442

    摘要: A method and apparatus are provided for using sibling-counts in XML indices to optimize single-path queries. Using a b-tree XML index with a SQL query logarithmically reduces the number of disk accesses by passing over index entries where it is determined that a match will not be found. However, because certain index entries are passed over, it is impossible to ascertain if a path expression occurs more than once in the XML index, as certain queries sometimes require. This hurdle can be overcome by maintaining a sibling count with each node entry in the XML index. Because the sibling count is stored with the index entry, the index will reveal whether the matching node is single or has other siblings. In additional to re-writing the original query for optimization by use of an XML index, it will be re-written to check for a single-path condition in the index.

    摘要翻译: 提供了一种方法和装置,用于在XML索引中使用同级计数来优化单路径查询。 在SQL查询中使用b-tree XML索引对数地减少了通过传递索引条目的磁盘访问次数,确定不会找到匹配项。 但是,由于某些索引条目被传递,因此某些查询有时需要,无法确定路径表达式是否在XML索引中多次出现。 通过维护XML索引中每个节点条目的兄弟数量,可以克服这个障碍。 由于兄弟计数与索引条目一起存储,索引将显示匹配节点是单个节点还是具有其他兄弟节点。 除了通过使用XML索引重新编写原始查询进行优化外,还将重新编写索引中的单路径条件。

    Mechanism to enable evolving XML schema
    103.
    发明授权
    Mechanism to enable evolving XML schema 有权
    启用演进XML模式的机制

    公开(公告)号:US07395271B2

    公开(公告)日:2008-07-01

    申请号:US10648497

    申请日:2003-08-25

    IPC分类号: G06F17/30

    摘要: A method and system for evolving XML-schema-based data to conform to an evolved XML schema is disclosed. Based on an existing XML schema and an instance document that is based on the existing XML schema, an XML-schema-independent form of the instance document is generated. Based on a set of specified transformations and the XML-schema-independent form of the instance document, an evolved instance document is generated. The evolved instance document conforms to an evolved XML schema that incorporates changes to the existing XML schema. Techniques described herein are flexible enough to accommodate a wide variety of evolutions to XML schemas.

    摘要翻译: 公开了一种用于演进基于XML模式的数据以符合演进的XML模式的方法和系统。 基于现有XML模式和基于现有XML模式的实例文档,生成实例文档的与XML模式无关的形式。 基于一组指定的转换和实例文档的与XML模式无关的形式,生成了演进的实例文档。 演进的实例文档符合演进的XML模式,其中包含对现有XML模式的更改。 本文描述的技术足够灵活,以适应各种演进到XML模式。

    TECHNIQUES FOR EFFICIENT LOADING OF BINARY XML DATA
    105.
    发明申请
    TECHNIQUES FOR EFFICIENT LOADING OF BINARY XML DATA 有权
    有效加载二进制XML数据的技术

    公开(公告)号:US20080098001A1

    公开(公告)日:2008-04-24

    申请号:US11743563

    申请日:2007-05-02

    IPC分类号: G06F17/30

    摘要: Various techniques are described hereafter for improving the efficiency of binary XML encoding and loading operations. In particular, techniques are described for incrementally encoding XML in response to amount-based requests. After encoding enough binary XML to satisfy an amount-based request, the encoder stops encoding the XML until a subsequent request is received. The incremental encoding may take place on the client-side or the server-side. Techniques are also described for reducing the character set conversion operations by having a parser convert tokens in text XML into one character set while converting non-token text in the text XML into another character set. Techniques are also described for generating self-contained binary XML documents, and for improving remap operations by providing a binary XML document on a chunk-by-chunk basis.

    摘要翻译: 以下描述了用于提高二进制XML编码和加载操作的效率的各种技术。 特别地,描述了用于响应于基于量的请求逐渐编码XML的技术。 在编码足够的二进制XML以满足基于数量的请求之后,编码器停止对XML的编码,直到接收到后续请求。 增量编码可能发生在客户端或服务器端。 还描述了通过使解析器将文本XML中的令牌转换为一个字符集同时将文本XML中的非令牌文本转换为另一个字符集来减少字符集转换操作的技术。 还描述了用于生成自包含二进制XML文档以及通过以块为单位提供二进制XML文档来改进重映射操作的技术。

    Managing compound XML documents in a repository

    公开(公告)号:US20080091703A1

    公开(公告)日:2008-04-17

    申请号:US11807938

    申请日:2007-05-29

    申请人: Ravi Murthy

    发明人: Ravi Murthy

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30923

    摘要: A declarative mechanism is used to manage large documents within a repository. The large documents are sectioned into subdocuments that are linked together by a parent document. The combination of the parent document and subdocument is referred to as a compound document. There are multiple options for configuring rules to break up a source document into a compound document and naming the subdocuments. The compound documents may be queried using statements that treat the compound document as a single XML document, or the parent document of a subdocument may be queried and treated independently. Access control and versioning can be applied at the finer granularity of the subdocument.

    Managing compound XML documents in a repository
    107.
    发明申请
    Managing compound XML documents in a repository 有权
    在存储库中管理复合XML文档

    公开(公告)号:US20080091693A1

    公开(公告)日:2008-04-17

    申请号:US11807939

    申请日:2007-05-29

    申请人: Ravi Murthy

    发明人: Ravi Murthy

    IPC分类号: G06F17/30

    摘要: A declarative mechanism is used to manage large documents within a repository. The large documents are sectioned into subdocuments that are linked together by a parent document. The combination of the parent document and subdocument is referred to as a compound document. There are multiple options for configuring rules to break up a source document into a compound document and naming the subdocuments. The compound documents may be queried using statements that treat the compound document as a single XML document, or the parent document of a subdocument may be queried and treated independently. Access control and versioning can be applied at the finer granularity of the subdocument.

    摘要翻译: 声明机制用于管理存储库中的大型文档。 大文档被分段到由父文档链接在一起的子文档中。 父文档和子文档的组合被称为复合文档。 有多种选项用于配置将源文档分解为复合文档并命名子文档的规则。 可以使用将复合文档视为单个XML文档的语句来查询复合文档,或者可以单独查询和处理子文档的父文档。 访问控制和版本控制可以在子文档的更细粒度上应用。

    Technique to estimate the cost of streaming evaluation of XPaths
    108.
    发明申请
    Technique to estimate the cost of streaming evaluation of XPaths 有权
    估算XPath流评估成本的技术

    公开(公告)号:US20080091623A1

    公开(公告)日:2008-04-17

    申请号:US11716505

    申请日:2007-03-08

    IPC分类号: G06Q10/00

    CPC分类号: G06F17/30935 G06Q30/0283

    摘要: A method and apparatus for estimating the cost of streaming evaluation of XPaths is provided. Aggregate statistics are maintained by the database server upon initiation of a database function by the database administrator about the nodes of the XML document. Based upon these statistics and the complexity of the particular XPath query, an estimate of the cost of the query, in time and computing resources required, is computed.

    摘要翻译: 提供了一种用于估计XPath的流评估成本的方法和装置。 在数据库管理员启动数据库功能时,数据库服务器会保留有关XML文档节点的统计统计信息。 基于这些统计信息和特定XPath查询的复杂性,计算查询的时间和计算资源所需成本的估计。

    Mechanism for improving performance on XML over XML data using path subsetting
    109.
    发明申请
    Mechanism for improving performance on XML over XML data using path subsetting 有权
    使用路径子集提高XML数据的性能的机制

    公开(公告)号:US20060184551A1

    公开(公告)日:2006-08-17

    申请号:US11401613

    申请日:2006-04-10

    IPC分类号: G06F17/00 G06F7/00

    CPC分类号: G06F17/30938 G06F17/30321

    摘要: Techniques are provided for indexing XML documents using path subsetting. According to one embodiment, a PATH table created for storing one row for each indexed node of the XML documents using user-defined criteria. The user-defined criteria are used to determine which nodes of XML documents to included in The PATH TABLE. The PATH table row for a node includes (1) information for locating the XML document that contains the node, (2) information that identifies the path of the node, and (3) information that identifies the position of the node within the hierarchical structure of the XML document that contains the node. Use of the user defined criteria is transparent to any query improves DML indexes overhead costs.

    摘要翻译: 提供了使用路径子集对XML文档进行索引的技术。 根据一个实施例,创建用于使用用户定义的标准为XML文档的每个索引节点存储一行的PATH表。 用户定义的标准用于确定要包含在PATH TABLE中的XML文档的哪些节点。 节点的PATH表行包括(1)用于定位包含节点的XML文档的信息,(2)标识节点路径的信息,以及(3)标识分层结构中节点位置的信息 的包含节点的XML文档。 使用用户定义的条件对任何查询都是透明的,改善了DML索引的开销成本。

    Mechanisms for storing content and properties of hierarchically organized resources
    110.
    发明授权
    Mechanisms for storing content and properties of hierarchically organized resources 有权
    存储层次组织资源的内容和属性的机制

    公开(公告)号:US07047253B1

    公开(公告)日:2006-05-16

    申请号:US10256524

    申请日:2002-09-27

    IPC分类号: G06R17/30

    摘要: Various techniques are provided for facilitating the management of hierarchical data within a relational database system. One such technique involves separating the storage structures used to store data that captures the information about the hierarchy (the “hierarchy structures”), from the storage structures used to store the content of the resources that belong to the hierarchy (the “content structures”). Techniques are also provided for allowing users to customize the metadata attributes associated with resources that belong to the information hierarchy. One technique involves registering XML schemas that specify the metadata attributes desired by a user. Another technique involves storing attributes that do not correspond to any declared field in a “catch-all” column within the resource table. Techniques are provided for determining how to store resources as they are added to the database. According to one technique, the database server searches the data of the resource to find content-type information. If content-type information is found, then the database server consults a content-type to content-structure mapping to determine where to store the content of the resource.

    摘要翻译: 提供了各种技术以便于关系数据库系统内的分级数据的管理。 一种这样的技术包括从用于存储属于层级的资源的内容的存储结构(“内容结构”)中分离用于存储捕获关于层级的信息的数据的存储结构(“层次结构”)。 )。 还提供了技术来允许用户定制与属于信息层级的资源相关联的元数据属性。 一种技术涉及注册指定用户所期望的元数据属性的XML模式。 另一种技术涉及存储与资源表中“catch-all”列中任何声明字段不相对应的属性。 提供了技术来确定如何在将资源添加到数据库时存储资源。 根据一种技术,数据库服务器搜索资源的数据以查找内容类型信息。 如果找到内容类型信息,则数据库服务器将查询内容类型到内容 - 结构映射,以确定在哪里存储资源的内容。