Document page segmentation in optical character recognition

发明授权

US08509534B2 Document page segmentation in optical character recognition 有权

标题翻译：光学字符识别中的文档页面分割

请登陆查看更多内容

专利标题： Document page segmentation in optical character recognition
专利标题（中）： 光学字符识别中的文档页面分割
申请号： US12720943

申请日： 2010-03-10
公开(公告)号： US08509534B2

公开(公告)日： 2013-08-13
发明人: Sasa Galic , Bogdan Radakovic , Nikola Todic
申请人： Sasa Galic , Bogdan Radakovic , Nikola Todic
申请人地址： US WA Redmond
专利权人： Microsoft Corporation
当前专利权人： Microsoft Corporation
当前专利权人地址： US WA Redmond
主分类号： G06K9/36
IPC分类号： G06K9/36

Document page segmentation in optical character recognition

摘要：

Page segmentation in an optical character recognition process is performed to detect textual objects and/or image objects. Textual objects in an input gray scale image are detected by selecting candidates for native lines which are sets of horizontally neighboring connected components (i.e., subsets of image pixels where each pixel from the set is connected with all remaining pixels from the set) having similar vertical statistics defined by values of baseline (the line upon which most text characters “sit”) and mean line (the line under which most of the characters “hang”). Binary classification is performed on the native line candidates to classify them as textual or non-textual through examination of any embedded regularity. Image objects are indirectly detected by detecting the image's background using the detected text to define the background. Once the background is detected, what remains (i.e., the non-background) is an image object.

摘要（中）：

执行光学字符识别处理中的页面分割以检测文本对象和/或图像对象。通过选择作为水平相邻连接分量的集合（即，来自集合的每个像素与集合中的每个像素与集合中的所有剩余像素连接的图像像素的集合），选择具有相似垂直方向的本机线的候选，来检测输入灰度图像中的文本对象由基准值（大多数文本字符“坐”的行）和平均线（大多数字符“挂起”的行）定义的统计信息。对本地候选人执行二进制分类，以便通过审查任何嵌入规律性将其分类为文本或非文本。通过使用检测到的文本检测图像的背景以定义背景来间接检测图像对象。一旦检测到背景，剩余的（即非背景）是图像对象。

公开/授权文献

US20110222769A1 DOCUMENT PAGE SEGMENTATION IN OPTICAL CHARACTER RECOGNITION 公开/授权日：2011-09-15

信息查询

Espacenet