发明授权
US07418455B2 System and method for indexing weighted-sequences in large databases
有权
用于索引大数据库中加权序列的系统和方法
- 专利标题: System and method for indexing weighted-sequences in large databases
- 专利标题(中): 用于索引大数据库中加权序列的系统和方法
-
申请号: US10723229申请日: 2003-11-26
-
公开(公告)号: US07418455B2公开(公告)日: 2008-08-26
- 发明人: Wei Fan , Chang-Shing Perng , Haixun Wang , Philip Shi-Lung Yu
- 申请人: Wei Fan , Chang-Shing Perng , Haixun Wang , Philip Shi-Lung Yu
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: F. Chau & Associates, LLC
- 主分类号: G06F7/00
- IPC分类号: G06F7/00 ; G06F17/00
摘要:
The present invention provides an index structure for managing weighted-sequences in large databases. A weighted-sequence is defined as a two-dimensional structure in which each element in the sequence is associated with a weight. A series of network events, for instance, is a weighted-sequence because each event is associated with a timestamp. Querying a large sequence database by events' occurrence patterns is a first step towards understanding the temporal causal relationships among the events. The index structure proposed herein enables the efficient retrieval from the database of all subsequences (contiguous and non-contiguous) that match a given query sequence both by events and by weights. The index structure also takes into consideration the nonuniform frequency distribution of events in the sequence data.
公开/授权文献
信息查询