发明授权
- 专利标题: Method and system for extracting web query interfaces
- 专利标题(中): Web查询界面提取方法和系统
-
申请号: US10913721申请日: 2004-08-06
-
公开(公告)号: US07552116B2公开(公告)日: 2009-06-23
- 发明人: Kevin Chen-Chuan Chang , Zhen Zhang , Bin He
- 申请人: Kevin Chen-Chuan Chang , Zhen Zhang , Bin He
- 申请人地址: US IL Urbana
- 专利权人: The Board of Trustees of the University of Illinois
- 当前专利权人: The Board of Trustees of the University of Illinois
- 当前专利权人地址: US IL Urbana
- 代理机构: Duane Morris LLP
- 主分类号: G06F17/30
- IPC分类号: G06F17/30 ; G06F17/27
摘要:
A computer program product being embodied on a computer readable medium for extracting semantic information about a plurality of documents being accessible via a computer network, the computer program product including computer-executable instructions for: generating a plurality of tokens from at least one of the documents, each token being indicative of a displayed item and a corresponding position; and, constructing at least one parse tree indicative of a semantic structure of the at least one document from the tokens dependently upon a grammar being indicative of presentation conventions.
公开/授权文献
- US20060031202A1 Method and system for extracting web query interfaces 公开/授权日:2006-02-09
信息查询