发明授权
US08571270B2 Segmentation of a word bitmap into individual characters or glyphs during an OCR process 有权
在OCR过程中将单词位图分割成单个字符或字形

  • 专利标题: Segmentation of a word bitmap into individual characters or glyphs during an OCR process
  • 专利标题(中): 在OCR过程中将单词位图分割成单个字符或字形
  • 申请号: US12776576
    申请日: 2010-05-10
  • 公开(公告)号: US08571270B2
    公开(公告)日: 2013-10-29
  • 发明人: Djordje Nijemcevic
  • 申请人: Djordje Nijemcevic
  • 申请人地址: US WA Redmond
  • 专利权人: Microsoft Corporation
  • 当前专利权人: Microsoft Corporation
  • 当前专利权人地址: US WA Redmond
  • 代理机构: Mayer & Williams, PC
  • 主分类号: G06K9/34
  • IPC分类号: G06K9/34
Segmentation of a word bitmap into individual characters or glyphs during an OCR process
摘要:
An image processing apparatus is provided that includes a character chopper component that segments words into individual characters in a bitmap of a textual image undergoing an OCR process. The Character chopper component is configured to produce a set of (possibly curved) chop-lines which divide a bitmap of any given word into its individual character or glyph candidates. Cases where an input bitmap contains two separate words are handled by marking a place where those words should be split. The character segmentation algorithm computes the set of vertically oriented, curved chop-lines by considering glyph and background colors in a given word bitmap. The set is filtered afterwards using various heuristics, in order to preserve those lines that indeed do separate a word's glyphs and minimize the number of those that do not.
信息查询
0/0