发明申请
- 专利标题: INTERACTIVELY CRAWLING DATA RECORDS ON WEB PAGES
- 专利标题(中): 互联网络数据记录在网页上
-
申请号: US11456753申请日: 2006-07-11
-
公开(公告)号: US20080016087A1公开(公告)日: 2008-01-17
- 发明人: Benyu Zhang , Chenxi Lin , Hua-Jun Zeng , Jian Wang , Ke Tang , Zheng Chen
- 申请人: Benyu Zhang , Chenxi Lin , Hua-Jun Zeng , Jian Wang , Ke Tang , Zheng Chen
- 申请人地址: US WA Redmond
- 专利权人: One Microsoft Way
- 当前专利权人: One Microsoft Way
- 当前专利权人地址: US WA Redmond
- 主分类号: G06F7/00
- IPC分类号: G06F7/00
摘要:
The invention provides a method of interactively crawling data records on a web page. Users may select various data records of interest on a web page to generate templates to search for similar data items on the same web page or on different web pages. A tree matching algorithm may be used to compare and extract data matching the generated template.
公开/授权文献
信息查询