发明授权
US08571270B2 Segmentation of a word bitmap into individual characters or glyphs during an OCR process
有权
在OCR过程中将单词位图分割成单个字符或字形
- 专利标题: Segmentation of a word bitmap into individual characters or glyphs during an OCR process
- 专利标题(中): 在OCR过程中将单词位图分割成单个字符或字形
-
申请号: US12776576申请日: 2010-05-10
-
公开(公告)号: US08571270B2公开(公告)日: 2013-10-29
- 发明人: Djordje Nijemcevic
- 申请人: Djordje Nijemcevic
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 代理机构: Mayer & Williams, PC
- 主分类号: G06K9/34
- IPC分类号: G06K9/34
摘要:
An image processing apparatus is provided that includes a character chopper component that segments words into individual characters in a bitmap of a textual image undergoing an OCR process. The Character chopper component is configured to produce a set of (possibly curved) chop-lines which divide a bitmap of any given word into its individual character or glyph candidates. Cases where an input bitmap contains two separate words are handled by marking a place where those words should be split. The character segmentation algorithm computes the set of vertically oriented, curved chop-lines by considering glyph and background colors in a given word bitmap. The set is filtered afterwards using various heuristics, in order to preserve those lines that indeed do separate a word's glyphs and minimize the number of those that do not.
公开/授权文献
信息查询