发明申请
US20110029528A1 CITATION RECORD EXTRACTION SYSTEM AND METHOD, AND PROGRAM PRODUCT 有权
引用记录提取系统和方法以及程序产品

CITATION RECORD EXTRACTION SYSTEM AND METHOD, AND PROGRAM PRODUCT
摘要:
A citation record extraction system is provided. An HTML rendering engine receives a publication list web page, parses the publication list web page to obtain layout information of the web page. A web page sequence builder generates a web page characteristic sequence for the web page according to the layout information. A web page repeated pattern analyzer analyzes repeated pattern presented in the web page characteristic sequence, screens out non-citation record therefrom, and obtains a citation record of the publication list web page.
公开/授权文献
信息查询
0/0