发明申请
US20050114298A1 System and method for indexing weighted-sequences in large databases
有权
用于索引大数据库中加权序列的系统和方法
- 专利标题: System and method for indexing weighted-sequences in large databases
- 专利标题(中): 用于索引大数据库中加权序列的系统和方法
-
申请号: US10723229申请日: 2003-11-26
-
公开(公告)号: US20050114298A1公开(公告)日: 2005-05-26
- 发明人: Wei Fan , Chang-Shing Perng , Haixun Wang , Philip Yu
- 申请人: Wei Fan , Chang-Shing Perng , Haixun Wang , Philip Yu
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
The present invention provides an index structure for managing weighted-sequences in large databases. A weighted-sequence is defined as a two-dimensional structure in which each element in the sequence is associated with a weight. A series of network events, for instance, is a weighted-sequence because each event is associated with a timestamp. Querying a large sequence database by events' occurrence patterns is a first step towards understanding the temporal causal relationships among the events. The index structure proposed herein enables the efficient retrieval from the database of all subsequences (contiguous and non-contiguous) that match a given query sequence both by events and by weights. The index structure also takes into consideration the nonuniform frequency distribution of events in the sequence data.
公开/授权文献
信息查询