摘要:
Methods and systems for determining schema element types are shown that include pooling potential annotations for an element of an unlabeled schema from a plurality of heterogeneous sources, scoring the pool of potential annotations according to relevancy using information using instance information from the plurality of heterogeneous sources to produce a relevancy score, and annotating the element of the unlabeled schema using the most relevant potential annotations.
摘要:
Methods and systems for determining schema element types are shown that include pooling potential annotations for an element of an unlabeled schema from a plurality of heterogeneous sources, scoring the pool of potential annotations according to relevancy using information using instance information from the plurality of heterogeneous sources to produce a relevancy score, and annotating the element of the unlabeled schema using the most relevant potential annotations.
摘要:
Aspects of the present invention provide a tool for extracting schema from a spreadsheet. In an embodiment, a set of data that is stored in an uncataloged tabular format, such as a spreadsheet, is retrieved. The structure of the retrieved set of data is surveyed to determine the dataset schema thereof. Then, data elements within the dataset schema are analyzed to obtain information regarding the data elements. Based on dataset schema and the element information, an interface can be constructed that allows remote access to the set of data.
摘要:
Aspects of the present invention provide a tool for extracting schema from a spreadsheet. In an embodiment, a set of data that is stored in an uncataloged tabular format, such as a spreadsheet, is retrieved. The structure of the retrieved set of data is surveyed to determine the dataset schema thereof. Then, data elements within the dataset schema are analyzed to obtain information regarding the data elements. Based on dataset schema and the element information, an interface can be constructed that allows remote access to the set of data.
摘要:
Methods, apparatus and systems, including computer program products, for reducing an error rate when mapping entities between a first ontology and a second ontology. One or more of a general language dictionary and an industry-specific dictionary are provided. Natural language processing of the first ontology is performed to identify one or more candidate relationship entities in the first ontology. Each candidate relationship entity includes a compound name having two or more semantic labels, and each candidate relationship entity has a name that neither exists in the general language dictionary or the industry-specific dictionary. Each of the one or more candidate relationship entities in the first ontology is mapped to one or more entities in the second ontology using one or more configurable computer-implemented mapping algorithms.
摘要:
Techniques for computing a solution to a query formulated against a knowledge base (KB) are provided. The techniques include receiving a query formulated against a knowledge base, wherein the knowledge base comprises a set of one or more axioms, wherein each axiom is annotated with a specific probability value indicating a degree of certainty assigned thereto, ignoring each probability value of the one or more axioms and computing a solution to the query, computing each of one or more justifications for the query solution, wherein computing each of one or more justifications for the query solution comprises determining a minimal set of one or more axioms in the knowledge base that entail the query solution, and using each probability value of the one or more axioms in each justification to compute a net probability of an inferred query solution.
摘要:
Methods, apparatus and systems, including computer program products, for reducing an error rate when mapping entities between a first ontology and a second ontology. One or more of a general language dictionary and an industry-specific dictionary are provided. Natural language processing of the first ontology is performed to identify one or more candidate relationship entities in the first ontology. Each candidate relationship entity includes a compound name having two or more semantic labels, and each candidate relationship entity has a name that neither exists in the general language dictionary or the industry-specific dictionary. Each of the one or more candidate relationship entities in the first ontology is mapped to one or more entities in the second ontology using one or more configurable computer-implemented mapping algorithms.
摘要:
Techniques for computing a solution to a query formulated against a knowledge base (KB) are provided. The techniques include receiving a query formulated against a knowledge base, wherein the knowledge base comprises a set of one or more axioms, wherein each axiom is annotated with a specific probability value indicating a degree of certainty assigned thereto, ignoring each probability value of the one or more axioms and computing a solution to the query, computing each of one or more justifications for the query solution, wherein computing each of one or more justifications for the query solution comprises determining a minimal set of one or more axioms in the knowledge base that entail the query solution, and using each probability value of the one or more axioms in each justification to compute a net probability of an inferred query solution.
摘要:
A system and method for dynamically selecting logical operations using external criteria, includes providing tools for defining a selection criteria set having parameters including dimensions and valid values, wherein the selection criteria set includes a plurality of different arrangements and values of the parameters. The selection criteria set and collections of instances thereof are persistently maintained. A trigger is provided at runtime to query the persistently maintained instances to automatically select a logic operation to employ based upon a context of the query.
摘要:
Methods and apparatus for authoring and executing an individualized language business rule. In one embodiment, a method comprises creating at least one individualized language resource, creating at least one individualized language rule referencing at least one of said individualized language resource, and transforming said at least one individualized language rule into computer executable format.