发明授权
US07730013B2 System and method for searching dates efficiently in a collection of web documents
失效
在Web文档集合中有效搜索日期的系统和方法
- 专利标题: System and method for searching dates efficiently in a collection of web documents
- 专利标题(中): 在Web文档集合中有效搜索日期的系统和方法
-
申请号: US11259664申请日: 2005-10-25
-
公开(公告)号: US07730013B2公开(公告)日: 2010-06-01
- 发明人: Stephen Dill , Madhukar R. Korupolu
- 申请人: Stephen Dill , Madhukar R. Korupolu
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Shimokaji & Associates, P.C.
- 代理商 Samuel A. Kassatly
- 主分类号: G06F17/30
- IPC分类号: G06F17/30 ; G06F17/00
摘要:
A date querying system processes free-form text in documents to identify and locate some or all of the dates in the documents using extended regular expression matching to capture various date formats. The system packages a canonicalized format of each identified date to support various types of queries such as, for example, specific date querying, hierarchical date querying, range date querying, proximity queries comprising a date and any keywords, and any combination of types of queries. The system scans a document to identify the various format dates occurring in the document, disambiguates the resulting occurrences of dates, and canonicalizes the dates according to one or more predetermined formats.
公开/授权文献
信息查询