摘要:
Methods and apparatus are provided for mapping XML source documents to target documents using schema embeddings. According to one aspect of the invention, one or more edges in the one or more source schemas are mapped to one or more paths in at least one target schema. The disclosed mapping techniques ensure that (i) one or more source documents that conform to one or more of the source schemas can be recovered from one or more target documents that conform to the at least one target schema, if a mapping exists between the one or more of the source schemas and the at least one target schema; (ii) queries on one or more source documents that conform to one or more of the source schemas in a given query language can be answered on one or more target documents that conform to the at least one target schema; and (iii) the one or more target documents conform to a target schema.
摘要:
Methods and apparatus are provided for identifying constraint violation repairs in data that is comprised of a plurality of records, where each record has a plurality of cells. A database is processed, based on a plurality of constraints that data in the database must satisfy. At least one constraint violation to be resolved is identified based on a cost of repair and the corresponding records to be resolved and equivalent cells are identified in the data that violate the identified at least one constraint violation. A value for each of the equivalent cells can optionally be determined, and the determined value can be assigned to each of the equivalent cells. The at least one constraint violation selected for resolution may be, for example, the constraint violation with a lowest cost. The cost of repairing a constraint is based on a distance metric between the attributes values.
摘要:
Methods and apparatus are provided for incremental update of an XML tree defined from a recursive XML view of a relational database. A method comprises the steps of detecting at least one change to the relational database; providing one or more queries to the relational database to map the change to the relational database into changes to the XML tree, wherein at least one component of a definition of the one or more queries is executed a plurality of times in traversing a path through the XML tree; and applying the mapped change to the XML tree. A bud-cut method and a reduction approach are presented.
摘要:
Methods and apparatus are provided for improved schema mapping of source documents to target documents. A list of matches are generated between at least one source table and at least one target table. One or more of the matches are annotated with a logical condition providing a context in which the match applies. Matches can be annotated with a logical condition, for example, by generating a set of candidate view conditions, C, to be applied to the one or more source tables. A schema match algorithm can generate the list of matches. Candidate logical conditions can be identified, for example, by (i) creating a set of views for categorical attributes in the tables and adding a view for each partitioning of the attribute values; (ii) using a classifier built on target attribute values; or (iii) evaluating internal features of a source table.
摘要:
A grammar for mapping a first grouping of XML data into a second grouping of XML data and a method for accomplishing same to incorporate the first grouping into the second grouping. The grammar includes a first rule for computing a first child element attribute and a second rule for computing a second parent element attribute. The first rule and second rule vary according to a production of an element type of the first grouping. The element types include PCDATA, disjunctive, conjunctive and Kleene star, each having a unique rule set for defining inherited and synthesized attributes of the parent and child elements. The method includes the step of executing a mapping of a first grouping having at least one parent element and a set of corresponding child elements into a second grouping in accordance with the grammar rules based on the production of the element type.
摘要:
Methods and apparatus are provided for propagating functional dependencies with conditions. Propagation covers are computed using an SPC view of a dataset, wherein the SPC view comprises selection, projection and Cartesian product operations. Selection operations are processed to extract equivalence classes. Cartesian product operations are processed to obtain a renamed set of the plurality of conditional functional dependencies, that have attributes appearing in the SPC view. Domain constraints from the equivalence classes are applied to the renamed set to remove attributes not in the SPC view. Projection operations are processed using a reduction by resolution procedure to identify inferences that can be propagated to the SPC view from the conditional functional dependencies having attributes that do not appear in the SPC view. Domain constraints of the equivalence classes are converted to conditional functional dependencies; and a minimal cover of the SPC view is determined.
摘要:
Methods and apparatus are provided for rewriting view queries into equivalent queries on the source document. According to one aspect of the invention, methods are provided for processing a view query on a database view. The method comprises the steps of translating the view query to a mixed finite state automata representation of a document query on one or more documents underlying the database view; and evaluating the document query on the one or more documents to obtain a result to the view query. The view query may be, for example, a regular XPath query.
摘要:
Methods and apparatus are provided for incremental update of an XML tree defined from a recursive XML view of a relational database. A method comprises the steps of detecting at least one change to the relational database; providing one or more queries to the relational database to map the change to the relational database into changes to the XML tree, wherein at least one component of a definition of the one or more queries is executed a plurality of times in traversing a path through the XML tree; and applying the mapped change to the XML tree. A bud-cut method and a reduction approach are presented.
摘要:
Method and apparatus for validating propagation of XML constraints to functional dependencies when transforming XML to relational data. The method includes steps of accepting variables indicative of XML-based data, determining if one of the variables is unique based on checking the validity of XML keys defining XML constraints and determining if one or more fields in said relational data do not have a null value. The variables are selected from a set of XML keys (Σ), a transformation Rule (R) and a Functional Dependency (φ). One determining step includes substeps of viewing a transformation Rule as a Table Tree and traversing nodes in the Table Tree. The nodes are traversed until an XML key is found at a particular node and then said one of said plurality of variables (in one embodiment identified as x) is determined to be unique when compared to the context of said XML key.
摘要:
Methods and apparatus are provided for detecting data inconsistencies. Methods are disclosed for determining whether a set of conditional functional dependencies are consistent; determining a minimal cover of a set of conditional functional dependencies and detecting a violation of one or more conditional functional dependencies in a set of conditional functional dependencies. The conditional functional dependencies comprise one or more constraints that data in a database must satisfy including at least one pattern with data values.