发明授权
US07013309B2 Method and apparatus for extracting anchorable information units from complex PDF documents 失效
从复杂PDF文档中提取可锚定信息单元的方法和装置

Method and apparatus for extracting anchorable information units from complex PDF documents
摘要:
A method for extracting Anchorable Information Units (AIUs), from a Portable Document Format (PDF) file, which may either be created using either an editor or by scanning in documents. The method includes parsing the portable document format document into textual portions and non-text portions, and extracting structure from the textual portions and the non-text portions. The method further includes determining text within textual portions, and text the non-text portions, and hyperlinking a plurality of keywords within the textual portions and non-text portions to a related document.
信息查询
0/0