发明授权
- 专利标题: Detecting separator lines in a web page
- 专利标题(中): 检测网页中的分隔线
-
申请号: US13812421申请日: 2010-07-30
-
公开(公告)号: US08867837B2公开(公告)日: 2014-10-21
- 发明人: Hui-Man Hou , Li-Wei Zheng , Jian-Ming Jin , Jian Fan , Suk Hwan Lim
- 申请人: Hui-Man Hou , Li-Wei Zheng , Jian-Ming Jin , Jian Fan , Suk Hwan Lim
- 申请人地址: US TX Houston
- 专利权人: Hewlett-Packard Development Company, L.P.
- 当前专利权人: Hewlett-Packard Development Company, L.P.
- 当前专利权人地址: US TX Houston
- 国际申请: PCT/CN2010/001156 WO 20100730
- 国际公布: WO2012/012915 WO 20120202
- 主分类号: G06K9/34
- IPC分类号: G06K9/34 ; C07D309/28 ; G06K9/00
摘要:
A system and method of detecting separator lines in a web page may include determining coordinates of visible web elements on a web page, generating an edge image of the web page based on the coordinates of the web elements, filtering edges belonging to non-separator line elements within the edge image, detecting horizontal lines within the edge image, detecting vertical lines within the edge image, and filtering short lines within the edge image. A system for detecting separator lines in a web page may include a memory device, and a processor communicatively coupled to the memory, in which the processor determines coordinates of visible web elements on a web page, generates an edge image of the web page based on the coordinates of the web elements, filters edges belonging to non-separator line elements within the edge image, detects horizontal lines within the edge image, detects vertical lines within the edge image, and filters short lines within the edge image.
公开/授权文献
- US20130163873A1 Detecting Separator Lines in a Web Page 公开/授权日:2013-06-27