Method and system for indexing and serializing data
    31.
    发明授权
    Method and system for indexing and serializing data 失效
    索引和序列化数据的方法和系统

    公开(公告)号:US07752192B2

    公开(公告)日:2010-07-06

    申请号:US11681486

    申请日:2007-03-02

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30911

    摘要: The present invention provides a computer implemented method, an apparatus, and a computer usable program product for indexing data. A controller identifies a set of data to be indexed, wherein a set of data structure trees represents the set of data. The controller merges the set of data structure trees to form a unified tree, wherein the unified tree contains a node for each unit of data in the set of data. The controller assigns an identifier to the node for each unit of data in the set of data that describes the node within the unified tree. The controller then serializes the unified tree to form a set of sequential series that represents the set of data structure trees, wherein the set of sequential series forms an index for the set of data.

    摘要翻译: 本发明提供了一种用于索引数据的计算机实现的方法,装置和计算机可用程序产品。 控制器识别要索引的一组数据,其中一组数据结构树表示该组数据。 控制器将数据结构树组合成一个统一的树,其中统一树包含一组数据中每个数据单元的节点。 控制器为描述统一树中节点的数据集中的每个数据单元向节点分配一个标识符。 然后,控制器对统一树进行序列化以形成一组代表数据结构树的顺序序列,其中,该顺序序列集合形成该组数据的索引。

    Querying Data and an Associated Ontology in a Database Management System
    32.
    发明申请
    Querying Data and an Associated Ontology in a Database Management System 审中-公开
    在数据库管理系统中查询数据和关联本体

    公开(公告)号:US20100145986A1

    公开(公告)日:2010-06-10

    申请号:US12711682

    申请日:2010-02-24

    IPC分类号: G06F17/30

    摘要: A method, apparatus, and computer program for querying data and an associated ontology in a database. An ontology is associated with data in database. Responsive to receiving a query from a requestor, relational data in the database is identified using the query to form identified relational data. Ontological knowledge in the ontology is identified using the identified relational data and the ontology. A result is returned to the requestor.

    摘要翻译: 一种用于在数据库中查询数据和相关本体的方法,装置和计算机程序。 本体与数据库中的数据相关联。 响应于从请求者接收查询,使用查询来识别数据库中的关系数据以形成所识别的关系数据。 本体的本体知识使用所识别的关系数据和本体来识别。 结果返回给请求者。

    Method for supporting ontology-related semantic queries in DBMSs with XML support
    33.
    发明授权
    Method for supporting ontology-related semantic queries in DBMSs with XML support 失效
    用XML支持在DBMS中支持本体相关的语义查询的方法

    公开(公告)号:US07730098B2

    公开(公告)日:2010-06-01

    申请号:US11681319

    申请日:2007-03-02

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30734 G06F17/30404

    摘要: A method for supporting semantic matching queries in a database management system (DBMS) by extracting and storing the transitive/subsumption relationships from a given ontology data in a DBMS with native XML support. These transitive relationships are transformed into a set of XML documents that are natural mappings of the hierarchical structure of the transitive relationships. A table function construct expresses semantic matching queries in a declarative manner. The semantic matching queried are automatically rewritten or translated into standard SQL/XML search operators such as XQuery, XPath and XMLExists, and executed by the SQL/XML DBMS on the given instance data and the extracted transitive relationships data.

    摘要翻译: 一种通过从具有本地XML支持的DBMS中的给定本体数据中提取和存储传递/包含关系来在数据库管理系统(DBMS)中支持语义匹配查询的方法。 这些传递关系被转换成一组XML文档,它们是传递关系的层次结构的自然映射。 表函数构造以声明方式表达语义匹配查询。 查询的语义匹配自动重写或转换为标准SQL / XML搜索运算符,如XQuery,XPath和XMLExists,并由SQL / XML DBMS在给定的实例数据和提取的传递关系数据上执行。

    System and method of mining time-changing data streams using a dynamic rule classifier having low granularity
    34.
    发明授权
    System and method of mining time-changing data streams using a dynamic rule classifier having low granularity 失效
    使用具有低粒度的动态规则分类器挖掘时变数据流的系统和方法

    公开(公告)号:US07720785B2

    公开(公告)日:2010-05-18

    申请号:US12121942

    申请日:2008-05-16

    IPC分类号: G06N5/02

    CPC分类号: G06N5/025

    摘要: A dynamic rule classifier for mining a data stream includes at least one window for viewing data contained in the data stream and a set of rules for mining the data. Rules are added and the set of rules are updated by algorithms when an drift in a concept within the data occurs, causing unacceptable drops in classification accuracy. The dynamic rule classifier is also implemented as a method and a computer program product.

    摘要翻译: 用于挖掘数据流的动态规则分类器包括用于查看数据流中包含的数据的至少一个窗口和用于挖掘数据的一组规则。 添加规则,并且当数据中的概念中的漂移发生时,通过算法更新规则集合,导致分类准确性的不可接受的下降。 动态规则分类器也被实现为一种方法和一种计算机程序产品。

    Querying data and an associated ontology in a database management system
    35.
    发明授权
    Querying data and an associated ontology in a database management system 失效
    在数据库管理系统中查询数据和关联的本体

    公开(公告)号:US07693812B2

    公开(公告)日:2010-04-06

    申请号:US11623941

    申请日:2007-01-17

    IPC分类号: G06F17/00

    摘要: A method, apparatus, and computer program for querying data and an associated ontology in a database. An ontology is associated with data in database. Responsive to receiving a query from a requestor, relational data in the database is identified using the query to form identified relational data. Ontological knowledge in the ontology is identified using the identified relational data and the ontology. A result is returned to the requestor.

    摘要翻译: 一种用于在数据库中查询数据和相关本体的方法,装置和计算机程序。 本体与数据库中的数据相关联。 响应于从请求者接收查询,使用查询来识别数据库中的关系数据以形成所识别的关系数据。 本体的本体知识使用所识别的关系数据和本体来识别。 结果返回给请求者。

    System and method for load shedding in data mining and knowledge discovery from stream data
    36.
    发明授权
    System and method for load shedding in data mining and knowledge discovery from stream data 有权
    数据挖掘中的负载脱落和流数据的知识发现的系统和方法

    公开(公告)号:US07493346B2

    公开(公告)日:2009-02-17

    申请号:US11058944

    申请日:2005-02-16

    IPC分类号: G06F12/00 G06F17/30 G06F9/46

    CPC分类号: G06K9/6297 H04L43/028

    摘要: Load shedding schemes for mining data streams. A scoring function is used to rank the importance of stream elements, and those elements with high importance are investigated. In the context of not knowing the exact feature values of a data stream, the use of a Markov model is proposed herein for predicting the feature distribution of a data stream. Based on the predicted feature distribution, one can make classification decisions to maximize the expected benefits. In addition, there is proposed herein the employment of a quality of decision (QoD) metric to measure the level of uncertainty in decisions and to guide load shedding. A load shedding scheme such as presented herein assigns available resources to multiple data streams to maximize the quality of classification decisions. Furthermore, such a load shedding scheme is able to learn and adapt to changing data characteristics in the data streams.

    摘要翻译: 挖掘数据流的加载脱落方案。 使用评分函数对流元素的重要性进行排序,并调查那些具有重要意义的元素。 在不知道数据流的精确特征值的上下文中,本文提出了使用马尔可夫模型来预测数据流的特征分布。 基于预测的特征分布,可以进行分类决定,以最大限度地提高预期效益。 此外,在此提出采用质量决策(QoD)度量来衡量决策中的不确定性水平并指导负荷脱落。 诸如此处呈现的负载脱落方案将可用资源分配给多个数据流以最大化分类决定的质量。 此外,这种负载脱落方案能够学习和适应数据流中不断变化的数据特性。

    QUERYING DATA AND AN ASSOCIATED ONTOLOGY IN A DATABASE MANAGEMENT SYSTEM
    37.
    发明申请
    QUERYING DATA AND AN ASSOCIATED ONTOLOGY IN A DATABASE MANAGEMENT SYSTEM 失效
    在数据库管理系统中查询数据和相关的本体

    公开(公告)号:US20080172353A1

    公开(公告)日:2008-07-17

    申请号:US11623941

    申请日:2007-01-17

    IPC分类号: G06N5/02 G06F17/30

    摘要: A method, apparatus, and computer program for querying data and an associated ontology in a database. An ontology is associated with data in database. Responsive to receiving a query from a requester, relational data in the database is identified using the query to form identified relational data. Ontological knowledge in the ontology is identified using the identified relational data and the ontology. A result is returned to the requester.

    摘要翻译: 一种用于在数据库中查询数据和相关本体的方法,装置和计算机程序。 本体与数据库中的数据相关联。 响应于从请求者接收查询,使用查询来识别数据库中的关系数据以形成所识别的关系数据。 本体的本体知识使用所识别的关系数据和本体来识别。 结果返回给请求者。

    SYSTEM AND METHOD OF MINING TIME-CHANGING DATA STREAMS USING A DYNAMIC RULE CLASSIFIER HAVING LOW GRANULARITY
    38.
    发明申请
    SYSTEM AND METHOD OF MINING TIME-CHANGING DATA STREAMS USING A DYNAMIC RULE CLASSIFIER HAVING LOW GRANULARITY 审中-公开
    使用具有低精度的动态规则分类器来采集时变数据流的系统和方法

    公开(公告)号:US20070260568A1

    公开(公告)日:2007-11-08

    申请号:US11379692

    申请日:2006-04-21

    IPC分类号: G06N5/02

    CPC分类号: G06N5/025

    摘要: A dynamic rule classifier for mining a data stream includes at least one window for viewing data contained in the data stream and a set of rules for mining the data. Rules are added and the set of rules are updated by algorithms when an drift in a concept within the data occurs, causing unacceptable drops in classification accuracy. The dynamic rule classifier is also implemented as a method and a computer program product.

    摘要翻译: 用于挖掘数据流的动态规则分类器包括用于查看数据流中包含的数据的至少一个窗口和用于挖掘数据的一组规则。 添加规则,并且当数据中的概念中的漂移发生时,通过算法更新规则集合,导致分类准确性的不可接受的下降。 动态规则分类器也被实现为一种方法和一种计算机程序产品。

    Methods and apparatus for mining attribute associations
    39.
    发明授权
    Methods and apparatus for mining attribute associations 失效
    挖掘属性关联的方法和装置

    公开(公告)号:US07243100B2

    公开(公告)日:2007-07-10

    申请号:US10630992

    申请日:2003-07-30

    IPC分类号: G06F17/30 G06F17/00

    摘要: Attribute association discovery techniques that support relational-based data mining are disclosed. In one aspect of the invention, a technique for mining attribute associations in a relational data set comprises the following steps/operations. Multiple items are obtained from the relational data set. Then, attribute associations are discovered using: (i) multi-attribute mining templates formed from at least a portion of the multiple items; and (ii) one or more mining preferences specified by a user. The invention provides a novel architecture for the mining search space so as to exploit the inter-relationships among patterns of different templates. The framework is relational-sensitive and supports interactive and online mining.

    摘要翻译: 公开了支持基于关系的数据挖掘的属性关联发现技术。 在本发明的一个方面,用于挖掘关系数据集中的属性关联的技术包括以下步骤/操作。 从关系数据集获得多个项目。 然后,使用以下方式发现属性关联:(i)由多个项目的至少一部分形成的多属性挖掘模板; 和(ii)用户指定的一个或多个挖掘偏好。 本发明提供了一种用于挖掘搜索空间的新型架构,以便利用不同模板的模式之间的相互关系。 该框架是关系敏感的,支持交互式和在线挖掘。

    Processing queries on hierarchical markup data using shared hierarchical markup trees
    40.
    发明授权
    Processing queries on hierarchical markup data using shared hierarchical markup trees 失效
    使用共享分层标记树处理对分层标记数据的查询

    公开(公告)号:US08635242B2

    公开(公告)日:2014-01-21

    申请号:US11548321

    申请日:2006-10-11

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30929

    摘要: Disclosed are a method, information processing system, and computer readable medium for processing queries. The method includes receiving a data query for a set of hierarchical markup documents. At least one query path expression is extracted from the data query. The query path is processed against at least one shared hierarchical markup document in a plurality of shared hierarchical markup documents. The plurality of shared hierarchical documents is associated with the set of hierarchical markup documents. In response to the shared hierarchical markup document completely matching the query path expression, a query result for the data query is generated. The query result is based on the processing of the query path expression against at least one of the shared hierarchical markup document and the difference hierarchical markup document.

    摘要翻译: 公开了一种用于处理查询的方法,信息处理系统和计算机可读介质。 该方法包括接收一组分层标记文档的数据查询。 从数据查询中提取至少一个查询路径表达式。 针对多个共享分层标记文档中的至少一个共享分层标记文档处理查询路径。 多个共享分层文档与分层标记文档集合相关联。 响应于完全匹配查询路径表达式的共享分层标记文档,生成数据查询的查询结果。 查询结果基于对于共享分层标记文档和差异分层标记文档中的至少一个的查询路径表达的处理。