发明授权
- 专利标题: String pattern analysis
- 专利标题(中): 字符串模式分析
-
申请号: US12351527申请日: 2009-01-09
-
公开(公告)号: US08171039B2公开(公告)日: 2012-05-01
- 发明人: Andreas Arning , Roland Seiffert
- 申请人: Andreas Arning , Roland Seiffert
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Van Cott, Bagley, Cornwall & McCarthy PC
- 代理商 Steven L. Nichols
- 优先权: EP08100342 20080111
- 主分类号: G06F7/00
- IPC分类号: G06F7/00 ; G06F17/30
摘要:
A method of analyzing a string-pattern includes defining a minimum length (Lmin—1) of substrings (STR_A_B) to be considered; defining a maximum length (Lmax—1) of substrings (STR_A_B) to be considered; with a computer, searching the string-pattern for substrings (STR_A_B) with a length in an interval between the minimum length (Lmin—1) and the maximum length (Lmax—1); counting an occurrence (Occ_A_B) of each substring (STR_A_B) found with a length in the interval between the minimum length (Lmin—1) and the maximum length (Lmax—1); and pruning away a number of the substrings (STR_A_B) that meet one or more criteria. The criteria are selected from the group consisting of (1) being contained inside the maximum substring (STR_A_C) in a subset (SET_A) of substrings (STR_A_B), (2) being shorter than the maximum substring (STR_A_C), (3) occurring with a same frequency as the maximum substring (STR_A_C), and combinations thereof.
公开/授权文献
- US20090182744A1 STRING PATTERN ANALYSIS 公开/授权日:2009-07-16
信息查询