发明授权
US08494280B2 Automated method for extracting highlighted regions in scanned source
有权
扫描源中突出显示区域的自动提取方法
- 专利标题: Automated method for extracting highlighted regions in scanned source
- 专利标题(中): 扫描源中突出显示区域的自动提取方法
-
申请号: US11414053申请日: 2006-04-27
-
公开(公告)号: US08494280B2公开(公告)日: 2013-07-23
- 发明人: Ramesh Nagarajan , Michael R. Campanelli , Isaiah Simmons
- 申请人: Ramesh Nagarajan , Michael R. Campanelli , Isaiah Simmons
- 申请人地址: US CT Norwalk
- 专利权人: Xerox Corporation
- 当前专利权人: Xerox Corporation
- 当前专利权人地址: US CT Norwalk
- 代理商 Luis M. Ortiz; Kermit D. Lopez; Melissa Silverstein
- 主分类号: G06K9/46
- IPC分类号: G06K9/46
摘要:
An automated method for extracting highlighted regions in a scanned text documents includes color masking of highlight regions, extracting text from highlighted regions, recognizing the characters in extracted text optically and inserting the recognized characters to new document in order to easily identify highlighted text in scanned images. Using a two-layer multi-mask compression technology configured in a scanned export image path, edges and text regions can be extracted and together with the use of mask coordinates and associated mask colors, all highlighted texts can be easily identified and extracted. Optical Character Recognition (OCR) can then be utilized to appropriate summarization of different extracted highlighted texts.
公开/授权文献
信息查询