发明授权
US07702814B2 System and method for downloading hypertext markup language formatted web pages
失效
用于下载超文本标记语言格式的网页的系统和方法
- 专利标题: System and method for downloading hypertext markup language formatted web pages
- 专利标题(中): 用于下载超文本标记语言格式的网页的系统和方法
-
申请号: US11756593申请日: 2007-05-31
-
公开(公告)号: US07702814B2公开(公告)日: 2010-04-20
- 发明人: Chung-I Lee , Chien-Fa Yeh , Chiu-Hua Lu , Zhi-Qiang Jiang
- 申请人: Chung-I Lee , Chien-Fa Yeh , Chiu-Hua Lu , Zhi-Qiang Jiang
- 申请人地址: CN Shenzhen, Guangdong Province TW Tu-Cheng, Taipei Hsien
- 专利权人: Hong Fu Jin Precision Industry (ShenZhen) Co., Ltd.,Hon Hai Precision Industry Co., Ltd.
- 当前专利权人: Hong Fu Jin Precision Industry (ShenZhen) Co., Ltd.,Hon Hai Precision Industry Co., Ltd.
- 当前专利权人地址: CN Shenzhen, Guangdong Province TW Tu-Cheng, Taipei Hsien
- 代理商 Wei Te Chung
- 优先权: CN200610062196 20060818
- 主分类号: G06F15/16
- IPC分类号: G06F15/16
摘要:
A method for downloading HTML formatted Web pages is provided. The method includes the steps of writing a URL of a Web page to be downloaded to an XQuery script; analyzing the XQuery script to obtain the URL of the HTML Web page and saving the downloaded Web page in a database as the local Web page; analyzing the contents of the local Web page to obtain target contents; converting the relative URLs of all image files to the absolute URLs; downloading all the image files according to the absolute URLs; replacing the absolute URLs of the image files with an local image file path; converting the relative URLs of the embedded links to the absolute URLs of the embedded links; saving all the converted absolute URLs in the database, creating identifiers; replacing the converted absolute URLs of the embedded links with an embedded link local path. A related system is also disclosed.
公开/授权文献
信息查询