发明授权
- 专利标题: Document page segmentation in optical character recognition
- 专利标题(中): 光学字符识别中的文档页面分割
-
申请号: US12720943申请日: 2010-03-10
-
公开(公告)号: US08509534B2公开(公告)日: 2013-08-13
- 发明人: Sasa Galic , Bogdan Radakovic , Nikola Todic
- 申请人: Sasa Galic , Bogdan Radakovic , Nikola Todic
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 主分类号: G06K9/36
- IPC分类号: G06K9/36
摘要:
Page segmentation in an optical character recognition process is performed to detect textual objects and/or image objects. Textual objects in an input gray scale image are detected by selecting candidates for native lines which are sets of horizontally neighboring connected components (i.e., subsets of image pixels where each pixel from the set is connected with all remaining pixels from the set) having similar vertical statistics defined by values of baseline (the line upon which most text characters “sit”) and mean line (the line under which most of the characters “hang”). Binary classification is performed on the native line candidates to classify them as textual or non-textual through examination of any embedded regularity. Image objects are indirectly detected by detecting the image's background using the detected text to define the background. Once the background is detected, what remains (i.e., the non-background) is an image object.
公开/授权文献
信息查询