Orthogonal browsing in object hierarchies
    1.
    发明授权
    Orthogonal browsing in object hierarchies 失效
    对象层次结构中的正交浏览

    公开(公告)号:US06816175B1

    公开(公告)日:2004-11-09

    申请号:US09396595

    申请日:1999-09-15

    IPC分类号: G06F1500

    摘要: The present invention relates to means and a method executable by a computer system for navigation within a tree structure with leaf nodes representing arbitrary types of objects, i.e. of related data treated as a unit. According to the current teaching a travel point representation step is suggested, wherein after selection of at least one non-leaf node as travel point only the path and non-leaf nodes in said tree structure from said travel point to the root of said tree structure is represented in a tree view area. Moreover the complete sub-tree of said travel point is represented in said tree view area. In addition or alternatively after selection of said travel point, a travel box is represented for said travel point, said travel box representing object identifications of all objects of all leaf nodes in said sub tree of said travel point.

    摘要翻译: 本发明涉及可由计算机系统执行的装置和方法,用于在具有表示任意类型的对象的叶节点(即相关数据被视为一个单元)的树结构内导航。根据当前的教导,提出了一个旅行点表示步骤 其中,在将至少一个非叶节点选择为传播点之后,所述树结构中的所述树结构中的路径和非叶节点从所述移动点到所述树结构的根,在树视图区域中被表示。 此外,所述旅行点的完整子树在所述树视图区域中被表示。 另外或者在选择所述行进点之后,为所述旅行点表示旅行箱,所述旅行箱表示所述旅行点的所述子树中所有叶节点的所有对象的对象标识。

    Taxonomy generation for document collections
    2.
    发明授权
    Taxonomy generation for document collections 有权
    文件收集的分类法生成

    公开(公告)号:US06446061B1

    公开(公告)日:2002-09-03

    申请号:US09345260

    申请日:1999-06-30

    IPC分类号: G06F1730

    摘要: This mechanism relates to a method within the area of information mining within a multitude of documents stored on computer systems. More particularly, this mechanism relates to a computerized method of generating a content taxonomy of a multitude of electronic documents. The technique proposed by the current invention is able to improve at the same time the scalability and the coherence and selectivity of taxonomy generation. The fundamental approach of the current invention comprises a subset selection step, wherein a subset of a multitude of documents is being selected. In a taxonomy generation step a taxonomy is generated for that selected subset of documents, the taxonomy being a tree structured taxonomy hierarchy. Moreover this method comprises a routing selection step assigning each unprocessed document to the taxonomy hierarchy based on largest similarity.

    摘要翻译: 该机制涉及存储在计算机系统上的大量文档内的信息挖掘领域内的方法。 更具体地说,该机制涉及产生大量电子文档的内容分类法的计算机化方法。 本发明提出的技术能够同时提高分类法生成的可扩展性和一致性和选择性。 本发明的基本方法包括子集选择步骤,其中正在选择多个文档的子集。 在分类生成步骤中,为选定的文档子集生成分类法,分类法是树结构化分类法层次结构。 此外,该方法包括路由选择步骤,其基于最大相似度将每个未处理的文档分配给分类层次。

    String pattern analysis
    3.
    发明授权
    String pattern analysis 失效
    字符串模式分析

    公开(公告)号:US08171039B2

    公开(公告)日:2012-05-01

    申请号:US12351527

    申请日:2009-01-09

    IPC分类号: G06F7/00 G06F17/30

    摘要: A method of analyzing a string-pattern includes defining a minimum length (Lmin—1) of substrings (STR_A_B) to be considered; defining a maximum length (Lmax—1) of substrings (STR_A_B) to be considered; with a computer, searching the string-pattern for substrings (STR_A_B) with a length in an interval between the minimum length (Lmin—1) and the maximum length (Lmax—1); counting an occurrence (Occ_A_B) of each substring (STR_A_B) found with a length in the interval between the minimum length (Lmin—1) and the maximum length (Lmax—1); and pruning away a number of the substrings (STR_A_B) that meet one or more criteria. The criteria are selected from the group consisting of (1) being contained inside the maximum substring (STR_A_C) in a subset (SET_A) of substrings (STR_A_B), (2) being shorter than the maximum substring (STR_A_C), (3) occurring with a same frequency as the maximum substring (STR_A_C), and combinations thereof.

    摘要翻译: 分析字符串模式的方法包括定义要考虑的子串(STR_A_B)的最小长度(Lmin-1); 定义要考虑的子串(STR_A_B)的最大长度(Lmax-1); 用计算机搜索长度为最小长度(Lmin-1)和最大长度(Lmax-1)之间的间隔的子串(STR_A_B)的字符串模式; 计算以最小长度(Lmin-1)和最大长度(Lmax-1)之间的间隔中的长度发现的每个子串(STR_A_B)的出现(Occ_A_B); 并修剪符合一个或多个标准的多个子串(STR_A_B)。 标准选自(1)包含在子串(STR_A_B)的子集(SET_A)内的最大子字符串(STR_A_C)内,(2)短于最大子字符串(STR_A_C),(3)发生的组中 与最大子串(STR_A_C)的频率相同,以及它们的组合。

    Method and data processing system for providing XML data
    4.
    发明授权
    Method and data processing system for providing XML data 失效
    提供XML数据的方法和数据处理系统

    公开(公告)号:US08001212B2

    公开(公告)日:2011-08-16

    申请号:US12109518

    申请日:2008-04-25

    IPC分类号: G06F15/16

    摘要: A method and systems for providing XML data is disclosed. In accordance with an embodiment of the invention, a second data processing system, which is connected to a first data processing system via a network, receives a first request over the network from the first data processing system. The first request comprises specifications for subsequent transfers of XML data from the second data processing system to the first data processing system. The specifications specify for which type of XML documents to be transferred in subsequent transfers to the first data processing system which excerpts of XML data shall be sent. An acknowledge message, sent to the first data processing system from the second data processing system, indicates the latter's ability to provide the excerpts of XML data for the types of XML documents in the subsequent data transfers.

    摘要翻译: 公开了一种用于提供XML数据的方法和系统。 根据本发明的实施例,经由网络连接到第一数据处理系统的第二数据处理系统通过网络从第一数据处理系统接收第一请求。 第一请求包括用于从第二数据处理系统到第一数据处理系统的XML数据的后续传送的规范。 该规范规定在后续传送到要传送的第一个数据处理系统中传送的XML文档类型,将会发送XML数据摘录。 从第二个数据处理系统发送到第一个数据处理系统的确认消息指示后者能够在随后的数据传输中提供XML文档的类型的XML数据的摘录。

    Processing a Text Search Query in a Collection of Documents
    5.
    发明申请
    Processing a Text Search Query in a Collection of Documents 有权
    处理文档集合中的文本搜索查询

    公开(公告)号:US20080140639A1

    公开(公告)日:2008-06-12

    申请号:US12020196

    申请日:2008-01-25

    IPC分类号: G06F17/30

    摘要: System and computer program product for processing a text search query in a collection of documents. A full posting index is generated that has first index terms and a full posting list for each first index term, enumerating occurrences of the first index terms in the documents of the collection. A text search query includes search conditions search terms. The search conditions are translated into conditions on the first index terms to provide translated conditions. At least one short posting index is generated, which includes second index terms and a short posting list for each second index term, enumerating documents in which the second index terms occur. Filter conditions and complementary conditions are generated to represent the translated conditions. The filter conditions approximate the translated conditions, and are processed using the short posting index. The complementary conditions are processed using the full posting index to provide a query result.

    摘要翻译: 系统和计算机程序产品,用于在文档集合中处理文本搜索查询。 生成一个完整的发布索引,它具有第一个索引项和每个第一个索引项的完整的发布列表,列举了集合文档中第一个索引项的出现。 文本搜索查询包括搜索条件搜索条件。 搜索条件转换为第一个索引条件,以提供翻译条件。 生成至少一个短信息索引,其包括第二索引项和每个第二索引项的简短发布列表,列举其中出现第二索引项的文档。 生成过滤条件和补充条件以表示翻译条件。 过滤条件近似于翻译条件,并使用短发指数进行处理。 使用完整的发布索引处理补充条件以提供查询结果。

    Method and System for Processing a Text Search Query in a Collection of Documents
    6.
    发明申请
    Method and System for Processing a Text Search Query in a Collection of Documents 有权
    在文件集合中处理文本搜索查询的方法和系统

    公开(公告)号:US20080091666A1

    公开(公告)日:2008-04-17

    申请号:US11952627

    申请日:2007-12-07

    IPC分类号: G06F17/30

    摘要: According to the present invention a method and an infrastructure are provided for processing a text search query in a collection of documents (100). Therefore, a full posting index (200) is generated, stored and updated for each document added to the collection (100). Said full posting index (200) comprising a set of index terms and a full posting list for each index term of said set, enumerating all occurrences of said index term in all documents of the collection (100). In addition to said full posting index (200) at least one additional posting index (400, 500, 600) is generated, stored and updated for each document added to the collection (100). Said additional posting index (400, 500, 600) is related to a defined document part and comprises a set of index terms and a restricted posting list for each index term of said set, enumerating all occurrences of said index term in said document part of all documents of the collection (100). A text search query comprises search conditions on search terms, which are translated into conditions on the index terms of said full posting index (200). Then, said translated conditions of a given text search query are optimized (a) by identifying all conditions of said translated conditions, which are restricted to defined document parts, for which an additional posting index is available, and (b) by re-writing said identified conditions with part restriction as pair conditions on index terms of said additional posting index (400, 500, 600) and the corresponding document part. Thus, said pair conditions can be processed by using only said additional posting index (400, 500, 600).

    摘要翻译: 根据本发明,提供一种用于在文档集合(100)中处理文本搜索查询的方法和基础设施。 因此,为添加到集合(100)的每个文档生成,存储和更新完整发布索引(200)。 所述完整发布索引(200)包括一组索引项和用于所述集合的每个索引项的完整发布列表,列举所述集合(100)的所有文档中的所有索引项的所有出现。 除了所述完整发布索引(200)之外,为添加到集合(100)的每个文档生成,存储和更新至少一个附加发布索引(400,500,600)。 所述附加发布索引(400,500,600)与定义的文档部分相关,并且包括一组索引项和针对所述集合的每个索引项的限制发布列表,列举在所述文档部分的所有文档部分中的所有索引项的所有出现 所有文件的收藏(100)。 文本搜索查询包括关于搜索词的搜索条件,其被转换成所述完整发布索引(200)的索引项的条件。 然后,对给定文本搜索查询的所述翻译条件进行优化(a)通过识别所述翻译条件的所有条件,所述条件限于定义的文档部分,附加的发布索引可用于其定义的文档部分,以及(b)通过重写 所述识别的条件具有部分限制,作为所述附加发布索引(400,500,600)的索引项和对应的文档部分的对条件。 因此,可以仅使用所述附加过帐索引(400,500,600)来处理所述对条件。

    Method and infrastructure for processing queries in a database
    7.
    发明授权
    Method and infrastructure for processing queries in a database 失效
    在数据库中处理查询的方法和基础设施

    公开(公告)号:US07299224B2

    公开(公告)日:2007-11-20

    申请号:US10926591

    申请日:2004-08-26

    IPC分类号: G06F17/30

    摘要: Provided is a method for processing queries in a database in which data records have a parametric object and an extension of a nonparametric data type. A query includes a parametric condition for the parametric object of the data records and a nonparametric condition for the nonparametric extension of the data records. Parametric information of each data record is translated into constructs of the data type of the extension. A parametric result set of data records for the parametric condition is generated. The parametric condition of said query is translated into a filter condition for said constructs of the data type of the extension. The nonparametric condition of said query and said filter condition are employed to generate a nonparametric result set. The parametric result set and the nonparametric result set are joined to obtain a result set.

    摘要翻译: 提供了一种在数据库中处理查询的方法,其中数据记录具有参数对象和非参数数据类型的扩展。 查询包括数据记录的参数对象的参数条件和数据记录的非参数扩展的非参数条件。 每个数据记录的参数信息被转换为扩展的数据类型的构造。 生成参数条件数据记录的参数结果集。 所述查询的参数条件被转换成用于扩展的数据类型的所述构造的过滤条件。 采用所述查询和所述滤波条件的非参数条件来生成非参数结果集。 参数化结果集和非参数结果集合被连接以获得结果集。

    Method and infrastructure for processing queries in a database
    8.
    发明申请
    Method and infrastructure for processing queries in a database 失效
    在数据库中处理查询的方法和基础设施

    公开(公告)号:US20050138024A1

    公开(公告)日:2005-06-23

    申请号:US10926591

    申请日:2004-08-26

    IPC分类号: G06F17/30

    摘要: Method and Infrastructure for Processing Queries in a Database According to the present invention a method and an infrastructure are provided for processing queries in a database (1) of data records each comprising at least one parametric object with parametric information and at least one extension of a nonparametric datatype, the query comprising at least one parametric condition for the parametric object of the data records and at least one nonparametric condition for the nonparametric extension of the data records. First, at least parts of the parametric information of each data record are translated into constructs of the datatype of the extension. Processing a query comprises evaluation of a parametric result set (2) of data records for the parametric condition. In order to evaluate a nonparametric result set (5) of data records for the nonparametric condition, the parametric condition of said query is translated into at least one filter condition for said constructs of the datatype of the extension. Then, both the nonparametric condition of said query and said filter condition are considered by evaluating a nonparametric result set (5). Finally, the parametric result set (2) and the nonparametric result set (5) are joined to obtain a result set (4) for the query as a whole.

    摘要翻译: 用于在数据库中处理查询的方法和基础设施根据本发明,提供了一种方法和基础设施,用于处理数据库(1)中的查询,每个数据记录包括至少一个具有参数信息的参数对象和至少一个扩展 非参数数据类型,所述查询包括用于数据记录的参数对象的至少一个参数条件和用于数据记录的非参数扩展的至少一个非参数条件。 首先,每个数据记录的参数信息的至少一部分被转换为扩展的数据类型的结构。 处理查询包括评估用于参数条件的数据记录的参数结果集(2)。 为了评估用于非参数条件的数据记录的非参数结果集(5),所述查询的参数条件被转换成用于扩展的数据类型的所述构造的至少一个过滤条件。 然后,通过评估非参数结果集(5)来考虑所述查询的非参数条件和所述过滤条件。 最后,参数化结果集(2)和非参数结果集合(5)被连接以获得整个查询的结果集(4)。

    Persisting of a low latency in-memory database

    公开(公告)号:US11086850B2

    公开(公告)日:2021-08-10

    申请号:US13442900

    申请日:2012-04-10

    IPC分类号: G06F16/23

    摘要: Processing is provided for operating an in-memory database, wherein transaction data is stored by a persistence buffer in an FIFO queue, and update processor subsequently: waits for a trigger; extracts the last transactional data associated with a single transaction of the in-memory database from the FIFO memory queue; determines if the transaction data includes updates to data fields in the in-memory database which were already processed; and if not, then stores the extracted transaction data to a store queue, remembering the fields updated in the in-memory database, or otherwise updates the store queue with the extracted transaction data. The process continues until the extracting is complete, and the content of the store queue is periodically written into a persistent storage device.

    Method and system for processing a text search query in a collection of documents
    10.
    发明授权
    Method and system for processing a text search query in a collection of documents 有权
    在文档集合中处理文本搜索查询的方法和系统

    公开(公告)号:US07882107B2

    公开(公告)日:2011-02-01

    申请号:US11952627

    申请日:2007-12-07

    IPC分类号: G06F7/00 G06F17/30

    摘要: A method, system and computer program product implementing the method are provided to process a text search query in a collection of documents. A full posting index is generated for the documents in the collection. The full posting index comprises one or more first index terms and a full posting list for each first index term, enumerating the occurrences of the first index term in the documents. In addition to the full posting index, at least one additional posting index is generated for the documents. The additional posting index is related to a defined document part and comprises one or more second index terms and a restricted posting list for each second index term, enumerating all occurrences of the second index term in the document part of the documents of the collection. The text search query is performed using the additional posting index.

    摘要翻译: 提供了一种实现该方法的方法,系统和计算机程序产品,用于处理文档集合中的文本搜索查询。 为集合中的文档生成完整的发布索引。 完整发布索引包括用于每个第一索引项的一个或多个第一索引项和完整过帐列表,列举文档中第一索引项的出现。 除了完整的发布索引之外,还会为文档生成至少一个附加过帐索引。 附加的发布索引与定义的文档部分相关,并且包括一个或多个第二索引项和每个第二索引项的受限张贴列表,列举在集合的文档的文档部分中的第二索引项的所有出现。 使用附加发布索引执行文本搜索查询。