Detecting Separator Lines in a Web Page
    7.
    发明申请
    Detecting Separator Lines in a Web Page 有权
    检测网页中的分隔线

    公开(公告)号:US20130163873A1

    公开(公告)日:2013-06-27

    申请号:US13812421

    申请日:2010-07-30

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00463 C07D309/28

    摘要: A system and method of detecting separator lines in a web page may include determining coordinates of visible web elements on a web page, generating an edge image of the web page based on the coordinates of the web elements, filtering edges belonging to non-separator line elements within the edge image, detecting horizontal lines within the edge image, detecting vertical lines within the edge image, and filtering short lines within the edge image. A system for detecting separator lines in a web page may include a memory device, and a processor communicatively coupled to the memory, in which the processor determines coordinates of visible web elements on a web page, generates an edge image of the web page based on the coordinates of the web elements, filters edges belonging to non-separator line elements within the edge image, detects horizontal lines within the edge image, detects vertical lines within the edge image, and filters short lines within the edge image.

    摘要翻译: 检测网页中的分隔线的系统和方法可以包括确定网页上的可视网页元素的坐标,基于网页元素的坐标生成网页的边缘图像,过滤属于非分隔线的边 边缘图像内的元素,检测边缘图像内的水平线,检测边缘图像内的垂直线,以及过滤边缘图像内的短线。 用于检测网页中的分隔线的系统可以包括存储器设备和通信地耦合到存储器的处理器,其中处理器确定网页上的可视网页元素的坐标,基于网页生成网页的边缘图像 网页元素的坐标,属于边缘图像内的非分隔线元素的滤镜边缘,检测边缘图像内的水平线,检测边缘图像内的垂直线,并对边缘图像内的短线进行滤波。