发明授权
- 专利标题: Comparatively crawling web page data records relative to a template
- 专利标题(中): 相对于模板抓取网页数据记录
-
申请号: US11456753申请日: 2006-07-11
-
公开(公告)号: US07555480B2公开(公告)日: 2009-06-30
- 发明人: Benyu Zhang , Chenxi Lin , Hua-Jun Zeng , Jian Wang , Ke Tang , Zheng Chen
- 申请人: Benyu Zhang , Chenxi Lin , Hua-Jun Zeng , Jian Wang , Ke Tang , Zheng Chen
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 代理机构: Lee & Hayes, PLLC
- 主分类号: G06F17/30
- IPC分类号: G06F17/30 ; G06F17/00 ; G06F12/00 ; G06F3/048
摘要:
The invention provides a method of interactively crawling data records on a web page. Users may select various data records of interest on a web page to generate templates to search for similar data items on the same web page or on different web pages. A tree matching algorithm may be used to compare and extract data matching the generated template.
公开/授权文献
- US20080016087A1 INTERACTIVELY CRAWLING DATA RECORDS ON WEB PAGES 公开/授权日:2008-01-17
信息查询