发明申请
US20120089903A1 SELECTIVE CONTENT EXTRACTION 有权
选择性内容提取

SELECTIVE CONTENT EXTRACTION
摘要:
A method for extracting web content includes detecting, within a web page, a hierarchical structure that includes a plurality of nodes. Potential article nodes from the plurality of nodes are identified. The identified potential article node with a highest rank in the hierarchical structure is identified as an article node. Content is extracted from the article node.
公开/授权文献
信息查询
0/0