摘要:
Locating text areas in an image and recognizing text contents in the text areas through optical character recognition, OCR; selecting a first class of pending keywords from the recognized text contents to search for webpages; extracting a second class of pending keywords from the retrieved webpages; and determining one or more keywords corresponding to the image from at least the second class of pending keywords. With the embodiment, both OCR and webpage searching can be combined so that the webpages can be retrieved based upon the first class of pending keywords recognized and selected through OCR to ensure convergence of the keywords and then the second class of pending keywords can be selected from the retrieved webpages to ensure correctness of the keywords.
摘要:
Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.
摘要:
Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.
摘要:
Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.
摘要:
A method and apparatus for generating a degraded character image at various levels of degradation automatically is presented in this invention. The method comprises rendering the character image on a scene plane; translating and rotating the scene plane according to various parameters; determining a projection region of the character image on an image plane according to various parameters; generating a pixel region mask; and generating a final degraded image by super-sampling. Thus various degraded character images are generated on various conditions of degradation. The generated synthetic characters can be used for performance evaluation and training data augmentation in optical character recognition (OCR).
摘要:
Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.
摘要:
A data medium handling apparatus and a data medium handling method suitable for use for handling of documents, for example, in a financial organ. The data medium handling apparatus (30) for recognizing, based on an image (19) read from a data medium on which information is described in an arbitrary format, the information, is constructed such that it comprises means (2) for extracting characteristics unique to the data medium including the format from the read image data (19) and specifying, from the characteristics, a position at which information to be recognized is present, and image recognition means (3) for recognizing the image (19) at the position specified by the is preceding means (2) to discriminate the information, so that the data medium handling apparatus (30) can handle documents having various formats such as private slips.
摘要:
A data medium handling apparatus and a data medium handling method suitable for use for handling of documents, for example, in a financial organ. The data medium handling apparatus (30) for recognizing, based on an image (19) read from a data medium on which information is described in an arbitrary format, the information, is constructed such that it comprises means (2) for extracting characteristics unique to the data medium including the format from the read image data (19) and specifying, from the characteristics, a position at which information to be recognized is present, and image recognition means (3) for recognizing the image (19) at the position specified by the preceding means (2) to discriminate the information, so that the data medium handling apparatus (30) can handle documents having various formats such as private slips.
摘要:
A character box extracting unit extracts a line forming a character box. Then, the character box intersection calculating unit calculates the intersection of the character box with a character pattern. An intersection corresponding unit associates intersections with each other based on the directional property of character lines, distance between the character lines, etc. An in-box character extracting unit extracts a virtual image according to the association information between the intersections. A character size evaluating unit obtains from an optional character string an average character size of a character including the virtual image, and extracts a true character pattern by removing a redundant virtual image based on the average character size. A character structure analyzing and evaluating unit obtains from a prepared table a true image corresponding to the virtual image and extracts a true character pattern, thereby correctly extracting the pattern from the image in which the line crosses the pattern.11
摘要:
A method and apparatus for assigning a temporary label to each connected area in an image by scanning the image by using a window which has a size of two pixels in the vertical direction and of a plurality of pixels in the horizontal direction. A set of values of pixels contained in the above window is obtained and one of predetermined temporary label assignment rules corresponding to the obtained set of pixel values is selected. A temporary label is assigned to each pixel contained in the window, based on the above one of the temporary label assignment rules determined as above, and on temporary labels of pixels in the second group in the window at the above each location. In addition, the temporary labels are converted to true labels, by scanning the image pixel within the at least one circumscribing area only, where each circumscribing area is predetermined so that the at least one circumscribing area contains all pixels which do not belong to a background area in the image.