发明授权
- 专利标题: Enabling a web-crawling robot to collect information from web sites that tailor information content to the capabilities of accessing devices
- 专利标题(中): 启用网络抓取机器人从网站收集信息,将信息内容定制到访问设备的功能
-
申请号: US10751767申请日: 2004-01-05
-
公开(公告)号: US07536445B2公开(公告)日: 2009-05-19
- 发明人: Takafumi Kinoshita
- 申请人: Takafumi Kinoshita
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Hoffman Warnick LLC
- 代理商 Robert Straight
- 优先权: JP2003-047983 20030225
- 主分类号: G06F15/16
- IPC分类号: G06F15/16
摘要:
A web-crawling robot retrieves information from a web server that tailors information content to the capability of an accessing device. A link deriving unit in a proxy server for relaying data exchanged between the robot and the site analyzes a response from the site to the robot, and acquires information on a user agent corresponding to a particular kind of content of a link destination. On the basis of the information, a user agent information editing unit in the proxy server adds user agent information to the content retrieval request from the web-crawling robot to the site so as to disguise it as a content retrieval request issued from a given user agent, thereby acquiring a response corresponding to capabilities of the user agent.
公开/授权文献
信息查询