摘要:
There provides an apparatus for and a method of generating a classifier for detecting a specific object in an image. The apparatus for generating a classifier for detecting a specific object in an image includes: a region dividing section for dividing, from a sample image, at least one square region having a side length equal to or shorter than the length of shorter side of the sample image; a feature extracting section for extracting an image feature from at least a part of the square regions divided by the region dividing section; and a training section for performing training based on the extracted image feature to generate a classifier. By using the apparatus for and method of generating the classifier, it becomes possible to make full use of recognizable regions of objects to be recognized with variable aspect ratios and improve speed and accuracy for recognizing in complex backgrounds.
摘要:
A method for processing a document image includes: performing horizontal and vertical text line extraction on the document image; providing an overlapping matrix, a value of an element of the overlapping matrix indicating an overlapping relation between horizontal and vertical text lines; merging the overlapping matrix in the vertical and horizontal direction; determining one or more text overlapping regions in the document image, based on the values of the elements of the merged overlapping matrix; counting the total number of strokes or pixel points in the horizontal and vertical text lines, respectively, within one of the one or more text overlapping regions; and determining an orientation of the text overlapping region is horizontal if the total number of strokes or pixel points in the horizontal text lines is larger than that in the vertical text lines, otherwise, determining the orientation is vertical.
摘要:
A method for processing a document image includes: performing horizontal and vertical text line extraction on the document image; providing an overlapping matrix, a value of an element of the overlapping matrix indicating an overlapping relation between horizontal and vertical text lines; merging the overlapping matrix in the vertical and horizontal direction; determining one or more text overlapping regions in the document image, based on the values of the elements of the merged overlapping matrix; counting the total number of strokes or pixel points in the horizontal and vertical text lines, respectively, within one of the one or more text overlapping regions; and determining an orientation of the text overlapping region is horizontal if the total number of strokes or pixel points in the horizontal text lines is larger than that in the vertical text lines, otherwise, determining the orientation is vertical.
摘要:
Ruled lines are extracted and fitted into a real 2-D space. Correspondence between fitted cells and template cells of a ruled line template is determined. For each pair of cells corresponding to each other, the position of each pixel in the template cell is mapped into a real position in the real 2-D space based on an affine transformation between the cells. A pixel value based on pixel values of a plurality of pixels in the image with positions adjacent to the real position is generated as a pixel value of the pixel in the template cell corresponding to the real position. A synthesized image corresponding to the image is generated by merging the ruled lines of the ruled line template with the pixels in the template cells having the pixel values as generated. A form template is obtained based on the synthesized images corresponding to the plurality of images.
摘要:
The present embodiments disclose a method of and device for identifying the direction of characters in an image block. The method includes: performing optical character recognition processing on the image block by assuming various directions as assumed character directions to obtain sub image blocks, recognized characters corresponding to the sub image blocks and correctness measures thereof in each assumed character directions; in sub image blocks in the assumed character directions with 180° mutual relation, searching for a minimum matching pair of the sub image blocks; adjusting the sub image blocks in the searched minimum matching pair to eliminate the effect, on an identification result, of different numbers of sub image blocks in various assumed character directions; calculating an accumulative correctness measure in each assumed character directions based on the adjusted sub image blocks; and identifying the direction of characters in the image block according to the accumulative correctness measures.
摘要:
An image processing method generally includes: obtaining a vanishing point on a curved surface in a two-dimension image; extracting all the straight line segments between a top contour line and a bottom contour line of the curved surface by the vanishing point; removing a perspective distortion to get parallel straight line segments; obtaining the lengths of the straight line segments, obtaining the true width of each of the straight line segments in a three-dimension space and the depth increment of the straight line segments according to the lengths; obtaining the expanded width of each straight line segment according to the true width and the depth increment; obtaining the total expanded width of the curved surface to transform it into a flat surface; transforming image contents on the curved surface onto the flat surface.
摘要:
Method and apparatus for processing an image including a character are disclosed. The method may include: searching in a set of characters one or more characters having highest similarities of shape to a character in the set of characters, hereinafter the character being referred to as a first character, the one or more searched characters forming a similar character list of the first character; searching in the set of characters one or more characters having highest similarities of shape to each character in the similar character list of the first character, to form a similar character list of each character in the similar character list of the first character; and selecting in the similar character lists one or more characters having a high mutual similarity between each other, as a character cluster.
摘要:
Locating text areas in an image and recognizing text contents in the text areas through optical character recognition, OCR; selecting a first class of pending keywords from the recognized text contents to search for webpages; extracting a second class of pending keywords from the retrieved webpages; and determining one or more keywords corresponding to the image from at least the second class of pending keywords. With the embodiment, both OCR and webpage searching can be combined so that the webpages can be retrieved based upon the first class of pending keywords recognized and selected through OCR to ensure convergence of the keywords and then the second class of pending keywords can be selected from the retrieved webpages to ensure correctness of the keywords.
摘要:
Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.
摘要:
Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.