发明申请
US20110029528A1 CITATION RECORD EXTRACTION SYSTEM AND METHOD, AND PROGRAM PRODUCT
有权
引用记录提取系统和方法以及程序产品
- 专利标题: CITATION RECORD EXTRACTION SYSTEM AND METHOD, AND PROGRAM PRODUCT
- 专利标题(中): 引用记录提取系统和方法以及程序产品
-
申请号: US12834757申请日: 2010-07-12
-
公开(公告)号: US20110029528A1公开(公告)日: 2011-02-03
- 发明人: Hahn-Ming Lee , Jan-Ming Ho , Shui-Shi Chen , Kai-Hsiang Yang , Ruei-Yuan Wang , Jerome Yeh
- 申请人: Hahn-Ming Lee , Jan-Ming Ho , Shui-Shi Chen , Kai-Hsiang Yang , Ruei-Yuan Wang , Jerome Yeh
- 申请人地址: TW Taipei City
- 专利权人: NATIONAL TAIWAN UNIVERSITY OF SCIENCE & TECHNOLOGY
- 当前专利权人: NATIONAL TAIWAN UNIVERSITY OF SCIENCE & TECHNOLOGY
- 当前专利权人地址: TW Taipei City
- 优先权: TWTW98126042 20090803
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
A citation record extraction system is provided. An HTML rendering engine receives a publication list web page, parses the publication list web page to obtain layout information of the web page. A web page sequence builder generates a web page characteristic sequence for the web page according to the layout information. A web page repeated pattern analyzer analyzes repeated pattern presented in the web page characteristic sequence, screens out non-citation record therefrom, and obtains a citation record of the publication list web page.
公开/授权文献
- US08429520B2 Citation record extraction system and method 公开/授权日:2013-04-23
信息查询