发明授权
- 专利标题: Page classifier engine
- 专利标题(中): 页面分类引擎
-
申请号: US11949586申请日: 2007-12-03
-
公开(公告)号: US08392816B2公开(公告)日: 2013-03-05
- 发明人: Bogdan Radakovic , Aleksandar Uzelac , Bodin Dresevic , Oren Trutner
- 申请人: Bogdan Radakovic , Aleksandar Uzelac , Bodin Dresevic , Oren Trutner
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 代理机构: Shook Hardy & Bacon L.L.P.
- 主分类号: G06F17/21
- IPC分类号: G06F17/21
摘要:
Embodiments of the present invention relate to classifying pages of an electronic document, such as a scanned book page. OCR software is applied to the contents of the electronic document, revealing semantic information about the content of the electronic document. Software-based features are applied to the semantic information to determine the type of page the electronic document is. Page types may include table of contents (TOC), table of figures (TOF), bibliography, index, or other types of pages commonly found in a book, magazine, or other publication. Once determined, the determined page type is stored and used by other software engines.
公开/授权文献
- US20090144605A1 PAGE CLASSIFIER ENGINE 公开/授权日:2009-06-04
信息查询