Invention Grant
- Patent Title: Document image processing apparatus
- Patent Title (中): 文件图像处理装置
-
Application No.: US11972477Application Date: 2008-01-10
-
Publication No.: US08160402B2Publication Date: 2012-04-17
- Inventor: Bo Wu , Jianjun Dou , Ning Le , Yadong Wu , Jing Jia
- Applicant: Bo Wu , Jianjun Dou , Ning Le , Yadong Wu , Jing Jia
- Applicant Address: JP Osaka
- Assignee: Sharp Kabushiki Kaisha
- Current Assignee: Sharp Kabushiki Kaisha
- Current Assignee Address: JP Osaka
- Agency: Birch, Stewart, Kolasch & Birch, LLP
- Priority: CN200710129607 20070723
- Main IPC: G06K9/03
- IPC: G06K9/03 ; G06K9/18

Abstract:
An image of a character string composed of M pieces of characters is clipped from a document image, and the image is divided character by character, and image features of each character image are extracted. On the basis of the image features, N (N>1, integer) pieces of character images in descending order of degree of similarity are selected as candidate characters from a character image feature dictionary which stores the image features of character image in units of character, and the first index matrix of M×N cells is prepared. A candidate character string composed of a plurality of candidate characters constituting the first column of the first index matrix, is subjected to a lexical analysis according to a predetermined language model, whereby a second index matrix adjusted into a character string which makes sense is prepared to be utilized for searching.
Public/Granted literature
Information query