Automatic Denormalization For Analytic Query Processing In Large-Scale Clusters
    51.
    发明申请
    Automatic Denormalization For Analytic Query Processing In Large-Scale Clusters 有权
    用于大规模集群中的分析查询处理的自动非规范化

    公开(公告)号:US20140067791A1

    公开(公告)日:2014-03-06

    申请号:US13605903

    申请日:2012-09-06

    IPC分类号: G06F17/30

    摘要: A system and method for improving performance of join operations is disclosed. The method in one embodiment includes building a graph of vertices and edges, which represent the tables and join conditions for a particular database schema. The method then traverses the graph to find a set of candidate join operations. The method further includes selecting from the set of subset of join operations that makes the best use of a given amount of memory, which means finding a collection of denormalizations which when residing in memory provide the most benefit and the least cost for the given amount of memory. The method then deploys the denormalizations so that current queries in a query processing system can use them instead of actually performing the joins.

    摘要翻译: 公开了一种用于提高连接操作性能的系统和方法。 一个实施例中的方法包括构建顶点和边的图,其表示特定数据库模式的表和连接条件。 然后,该方法遍历图形以找到一组候选连接操作。 该方法还包括从充分利用给定量的存储器的连接操作的集合集合中进行选择,这意味着找到一种非规范化集合,其在驻留在存储器中时提供最大的收益,并且给定量的 记忆。 然后,该方法部署非规范化,以便查询处理系统中的当前查询可以使用它们,而不是实际执行连接。

    Efficient way to evaluate uncorrelated path-based row sources with XML storage
    52.
    发明授权
    Efficient way to evaluate uncorrelated path-based row sources with XML storage 有权
    用XML存储来评估不相关的基于路径的行源的有效方式

    公开(公告)号:US08239373B2

    公开(公告)日:2012-08-07

    申请号:US12467984

    申请日:2009-05-18

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30932

    摘要: The approaches described herein provide an efficient way to process certain kinds of queries that retrieve XML data stored in an object-relational database. A technique determines whether a query comprises a plurality of iteration expressions that exhibit a particular relationship to one another, and if so, re-writes the query and produces an optimized query execution plan. The query execution plan specifies an XPath-based row source with caching enabled. A very small cache structure is used, and reference locators are stored in the cache that point into binary-encoded data rather than copying XMLType data into the cache. Row data may be cached on the first use of the path-based row source and re-used in subsequent requests for rows. Once there is a subsequent request for rows from the optimized path-based row source, the cache may be reorganized to efficiently produce the rows of the row source.

    摘要翻译: 本文描述的方法提供了处理检索存储在对象关系数据库中的XML数据的某些类型的查询的有效方式。 技术确定查询是否包括彼此具有特定关系的多个迭代表达式,如果是,则重写该查询并产生优化的查询执行计划。 查询执行计划指定启用缓存的基于XPath的行源。 使用非常小的缓存结构,参考定位器存储在缓存中,指向二进制编码数据,而不是将XMLType数据复制到缓存中。 行数据可以在首次使用基于路径的行源缓存,并在后续的行请求中重新使用。 一旦从优化的基于路径的行源中获得行的后续请求,则可以重新组织高速缓存以有效地产生行源的行。

    TECHNIQUES FOR FAST AND SCALABLE XML GENERATION AND AGGREGATION OVER BINARY XML
    53.
    发明申请
    TECHNIQUES FOR FAST AND SCALABLE XML GENERATION AND AGGREGATION OVER BINARY XML 有权
    用于二进制XML的快速和可扩展的XML生成和聚合的技术

    公开(公告)号:US20110184969A1

    公开(公告)日:2011-07-28

    申请号:US12692549

    申请日:2010-01-22

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30938

    摘要: Techniques for fast and scalable generation and aggregation of XML data are described. In an example embodiment, an XML query that requests data from XML documents is received. The XML query is evaluated to determine one or more XML results. For each particular XML result, evaluating the XML query comprises: instantiating a particular data structure that represents the particular XML result, where the particular data structure is encoded in accordance with tags specified in the XML query but does not store the tags; and storing, in the particular data structure, one or more locators that respectively point to one or more fragments in the XML documents, where the particular data structure stores the one or more locators but does not store the one or more fragments. On demand, in response to a request indicating the particular XML result, a serialized representation of the particular XML result is generated based at least on the particular data structure.

    摘要翻译: 描述了用于快速和可扩展的XML数据生成和聚合的技术。 在示例实施例中,接收从XML文档请求数据的XML查询。 评估XML查询以确定一个或多个XML结果。 对于每个特定的XML结果,评估XML查询包括:实例化表示特定XML结果的特定数据结构,其中特定数据结构根据XML查询中指定的标签进行编码,但不存储标签; 以及在所述特定数据结构中存储分别指向所述XML文档中的一个或多个片段的一个或多个定位符,其中所述特定数据结构存储所述一个或多个定位符,但不存储所述一个或多个片段。 根据需要,响应于指示特定XML结果的请求,至少基于特定数据结构生成特定XML结果的序列化表示。

    METHOD AND APPARATUS TO SUPPORT PRIVILEGES AT MULTIPLE LEVELS OF AUTHENTICATION USING A CONSTRAINING ACL
    54.
    发明申请
    METHOD AND APPARATUS TO SUPPORT PRIVILEGES AT MULTIPLE LEVELS OF AUTHENTICATION USING A CONSTRAINING ACL 有权
    使用约束ACL在多个级别的认证中支持特权的方法和装置

    公开(公告)号:US20090144804A1

    公开(公告)日:2009-06-04

    申请号:US11947235

    申请日:2007-11-29

    IPC分类号: H04L9/32

    CPC分类号: G06F21/6218 G06F2221/2141

    摘要: Embodiments of the present invention provide systems and techniques for creating, updating, and using an ACL (access control list). A database system may include a constraining ACL which represents a global security policy that is to be applied to all applications that interact with the database. By ensuring that all ACLs inherit from the constraining ACL, the database system can ensure that the global security policy is applied to all applications that interact with the database. During operation, the system may receive a request to create or update an ACL. Before creating or updating the ACL, the system may modify the ACL to ensure that it inherits from the constraining ACL. In an embodiment, the system grants a privilege to a user only if both the ACL and the constraining ACL grant the privilege.

    摘要翻译: 本发明的实施例提供了用于创建,更新和使用ACL(访问控制列表)的系统和技术。 数据库系统可以包括约束ACL,其表示要应用于与数据库交互的所有应用的全局安全策略。 通过确保所有ACL从约束ACL继承,数据库系统可以确保将全局安全策略应用于与数据库交互的所有应用程序。 在操作期间,系统可能会收到创建或更新ACL的请求。 在创建或更新ACL之前,系统可能会修改ACL,以确保其从约束ACL继承。 在一个实施例中,只有当ACL和约束ACL同时授予特权时,系统才向用户授予权限。

    Incremental maintenance of an XML index on binary XML data
    55.
    发明申请
    Incremental maintenance of an XML index on binary XML data 有权
    对二进制XML数据的XML索引的增量维护

    公开(公告)号:US20080098020A1

    公开(公告)日:2008-04-24

    申请号:US11715603

    申请日:2007-03-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30911 G06F17/30569

    摘要: Techniques are provided for incrementally maintaining an XML index built to access XML data that is encoded in binary XML form. Rather than delete and reinsert index entries of all the nodes of a modified XML document, only the index entries of the affected nodes are modified. Consequently, the order key values stored in the index may become inconsistent with the current hierarchical locations of the nodes to which the order key values correspond. Techniques are described for resolving the inconsistencies, and for addressing additional problems that result when the XML index is path-subsetted.

    摘要翻译: 提供了用于增量维护构建以访问以二进制XML形式编码的XML数据的XML索引的技术。 修改的XML文档的所有节点的删除和重新插入索引条目,而不是修改受影响的节点的索引条目。 因此,存储在索引中的订单键值可能与订单键值对应的节点的当前分层位置变得不一致。 描述了解决不一致的技术,以及解决当XML索引是路径子集化时产生的其他问题。

    Efficient partitioning technique while managing large XML documents
    56.
    发明申请
    Efficient partitioning technique while managing large XML documents 有权
    管理大型XML文档时的高效分区技术

    公开(公告)号:US20080091714A1

    公开(公告)日:2008-04-17

    申请号:US11716074

    申请日:2007-03-08

    IPC分类号: G06F17/00

    CPC分类号: G06F17/30923

    摘要: A method is provided to efficiently evaluate an expression to determine the partition key for an XML document stored in a database without the entire XML document first being stored in temporary memory storage. The partition key is determined using streaming evaluation or incrementally using a DOM node tree as a portion of the document is read and stored in the buffer. The XML document is stored in the partition using the read portion of the document stored in the buffer and the remaining portion from the original source.

    摘要翻译: 提供了一种方法来有效地评估表达式以确定存储在数据库中的XML文档的分区密钥,而不将整个XML文档首先存储在临时存储器存储器中。 当文档的一部分被读取并存储在缓冲器中时,使用流式评估或递增地使用DOM节点树来确定分区密钥。 使用存储在缓冲区中的文档的读取部分和原始源的剩余部分将XML文档存储在分区中。

    Handling uniqueness constraints in a database system with versioned data
    57.
    发明申请
    Handling uniqueness constraints in a database system with versioned data 有权
    在具有版本化数据的数据库系统中处理唯一性约束

    公开(公告)号:US20070043783A1

    公开(公告)日:2007-02-22

    申请号:US11295310

    申请日:2005-12-05

    IPC分类号: G06F17/30

    摘要: A version history of each of a plurality of resources is stored in a table. The version history for each resource comprises a set of one or more values for the resource, wherein each value in the version history of a resource indicates the value that the resource had in a particular version of the resource. In response to a request to perform an operation that causes a given resource that is subject to a uniqueness constraint to have a particular value, it is determined whether the uniqueness constraint is violated based on whether any version of any resource other than the given resource has the particular value. The uniqueness constraint is determined to be violated if any version of any resource other than the given resource has the particular value. The uniqueness constraint is not violated by existence of a version of the given resource that has the particular value.

    摘要翻译: 多个资源中的每一个的版本历史被存储在表中。 每个资源的版本历史包括用于资源的一个或多个值的集合,其中资源的版本历史中的每个值指示资源在资源的特定版本中具有的值。 响应于执行导致受到唯一性约束的给定资源具有特定值的操作的请求,基于是否具有给定资源的任何资源的任何版本的任何版本是否被确定是否违反唯一性约束 具体的价值。 如果给定资源以外的任何资源的任何版本具有特定值,则唯一性约束被确定为违反。 唯一性约束不被存在具有特定值的给定资源的版本所抵触。

    In-place evolution of XML schemas
    58.
    发明申请
    In-place evolution of XML schemas 有权
    XML模式的就地演进

    公开(公告)号:US20050050105A1

    公开(公告)日:2005-03-03

    申请号:US10648749

    申请日:2003-08-25

    IPC分类号: G06F12/00 G06F17/30

    CPC分类号: G06F17/30297 G06F17/30908

    摘要: A method and system for the in-place evolution of XML schemas is disclosed. To automatically evolve an existing XML schema, a schema evolver receives both an existing XML schema and an XML document as input. The XML document indicates changes to be made to the existing XML schema. Based on the existing XML schema and the XML document, the schema evolver evolves the existing XML schema into a new XML schema that incorporates the changes indicated in the XML document. According to one aspect, the schema evolver generates one or more SQL statements based on the new XML schema. The SQL statements, when executed by a database server, cause the database server to evolve database structures that were based on the formerly existing XML schema so that the database structures conform to the new XML schema. This is accomplished “in place,” without copying the data in the database structures.

    摘要翻译: 公开了一种用于XML模式的就地演化的方法和系统。 为了自动演进现有的XML模式,模式演进器接收现有的XML模式和XML文档作为输入。 XML文档指示对现有XML模式进行的更改。 基于现有的XML架构和XML文档,架构演进者将现有的XML架构演变为新的XML架构,其中包含XML文档中指示的更改。 根据一个方面,模式演进者基于新的XML模式生成一个或多个SQL语句。 SQL语句在由数据库服务器执行时,会导致数据库服务器发展基于以前存在的XML模式的数据库结构,以便数据库结构符合新的XML模式。 这完成了“到位”,而不复制数据库结构中的数据。

    Direct loading of semistructured data
    59.
    发明授权
    Direct loading of semistructured data 有权
    直接加载半结构化数据

    公开(公告)号:US07814047B2

    公开(公告)日:2010-10-12

    申请号:US10648600

    申请日:2003-08-25

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30917

    摘要: Techniques and systems are disclosed for directly storing semistructured data in a database. According to one aspect, a client application reads data that comprises instances of a parent type. The client application invokes routines associated with the parent type. An array is created for storing instances of the parent type. These routines invoke routines associated with a child type of the parent type. An array is created for storing instances of the child type. The arrays are populated with values specified in the data. According to one aspect, some columns of the arrays may be populated with other values to be stored in hidden columns of database tables. The client application converts the arrays into a data stream that conforms to the format of the database's data blocks. The client application then streams the data to a database server, which writes the data blocks directly into one or more data blocks in the database.

    摘要翻译: 公开了用于将半结构化数据直接存储在数据库中的技术和系统。 根据一个方面,客户端应用读取包含父类型的实例的数据。 客户端应用程序调用与父类型相关联的例程。 创建一个数组来存储父类型的实例。 这些例程调用与父类型的子类型相关联的例程。 创建一个数组来存储子类型的实例。 数组用数据中指定的值填充。 根据一个方面,阵列的一些列可以用其他值填充以存储在数据库表的隐藏列中。 客户端应用程序将阵列转换为符合数据库数据块格式的数据流。 然后,客户端应用程序将数据流传输到数据库服务器,数据库服务器将数据块直接写入数据库中的一个或多个数据块。

    Efficient loading of data into a relational database
    60.
    发明申请
    Efficient loading of data into a relational database 有权
    将数据高效加载到关系数据库中

    公开(公告)号:US20050050074A1

    公开(公告)日:2005-03-03

    申请号:US10718823

    申请日:2003-11-21

    IPC分类号: G06F17/00

    摘要: Described is a method and system for storing data into a database, where a determination is made if schema metadata that is used to load the data into the database already exists, and where the existing schema metadata is used to load the data into the database if the schema metadata already exists. If the appropriate schema metadata does not exist, then it is generated and cached so that a later load operation for the same schema type will not need to re-generate this information. In this way, the cost to generate the schema metadata is amortized over multiple load operations to load data of the same schema type. The approach is protocol neutral so that multiple different protocol-based loads can operate with the same schema metadata information and load structures.

    摘要翻译: 描述了一种用于将数据存储到数据库中的方法和系统,其中确定用于将数据加载到数据库中的模式元数据是否已经存在,并且使用现有模式元数据将数据加载到数据库中,如果 模式元数据已经存在。 如果相应的模式元数据不存在,那么它将被生成和缓存,以便相同模式类型的后续加载操作将不需要重新生成此信息。 以这种方式,生成架构元数据的成本将通过多个加载操作进行分摊,以加载相同模式类型的数据。 该方法是协议中性的,以便多个不同的基于协议的负载可以使用相同的模式元数据信息和负载结构进行操作。