发明申请
- 专利标题: METHODS AND SYSTEMS FOR EFFICIENT AUTOMATED SYMBOL RECOGNITION
- 专利标题(中): 有效自动符号识别的方法和系统
-
申请号: US14508492申请日: 2014-10-07
-
公开(公告)号: US20150213330A1公开(公告)日: 2015-07-30
- 发明人: Yuri Chulinin
- 申请人: ABBYY Development LLC
- 优先权: RU2014103152 20140130
- 主分类号: G06K9/62
- IPC分类号: G06K9/62 ; G06K9/82 ; G06F17/28 ; G06K9/78
摘要:
The current document is directed to methods and systems for identifying symbols corresponding to symbol images in a scanned-document image or other text-containing image, with the symbols corresponding to Chinese or Japanese characters, to Korean morpho-syllabic blocks, or to symbols of other languages that use a large number of symbols for writing and printing. In one implementation, the methods and systems to which the current document is directed carry out an initial processing step on one or more scanned images to identify a subset of the total number of symbols frequently used in the scanned document image or images. One or more lists of graphemes for the language of the text are then ordered in most-likely-occurring to least-likely-occurring order to facilitate a second optical-character-recognition step in which symbol images extracted from the one or more scanned-document images are associated with one or more graphemes most likely to correspond to the scanned symbol image.
公开/授权文献
信息查询