摘要:
A document image search apparatus generates a text by performing the character recognition of a document image and determines a re-process scope. Then, the apparatus generates a candidate character lattice from the re-recognition result of the re-process scope, generates character strings from the candidate character lattice and adds the character strings to the text. Then, the apparatus performs index search using the text with the character strings added.
摘要:
A pointing information extraction unit extracts pointing information indicating a pointing position and a pointing time on a slide from a slide file used in a lecture and a video file of a lecture video using a pointing device. A word information generation unit analyzes a text sentence extracted from the slide file to generate a word information file indicating a word and a position thereof. A word pointing information generation unit estimates a word closest to the pointing position on the slide to generate a word pointing information file with the pointing time assigned. A fill-in-the-blank word extraction unit extracts a word having a pointing time equal to or longer than a predetermined time from the word pointing information as a fill-in-the-blank word file. A fill-in-the-blank test question is generated by setting the fill-in-the-blank word of the slide information as a blank region.
摘要:
A grayscale character dictionary generation apparatus, comprising a first synthetic grayscale degraded character image generation unit for generating first synthetic grayscale degraded character images using binary character images inputted therein; a clustering unit for dividing each category of the first synthetic grayscale degraded character images generated by the first synthetic grayscale degraded character image generation unit into a plurality of clusters; a template generation unit for generating template for each of the clusters; a transformation matrix generation unit for generating transformation matrix in relation to each of the templates; and a second synthetic grayscale degraded character dictionary generation unit for obtaining character feature of every grayscale degraded character of each of the clusters using the transformation matrix, and for constructing eigenspace of each category of the synthetic grayscale degraded character, which is the second synthetic grayscale character dictionary.
摘要:
Character recognition apparatus and method for recognizing characters in an image, of which the character recognition apparatus comprises a text line extraction unit for extracting a plurality of text lines from an input image, a feature recognition unit for recognizing one or more features of each of the text lines, a synthetic pattern generation unit for generating synthetic character images for each of the text lines by using the features recognized by the feature recognition unit and the original character images, a synthetic dictionary generation unit for generating a synthetic dictionary for each of the text lines by using the synthetic character images, and a text line recognition unit for recognizing characters in each of the text lines by using the synthetic dictionary.
摘要:
Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.
摘要:
Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.
摘要:
Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.
摘要:
The present invention as disclosed hereby is to provide an apparatus and method for allowing images to correspond to each other and a program therefore to automatically perform or make synchronous correspondence of images. According to the apparatus on the condition that there are OHP files and a moving image photographing a lecture or the like using such OHP files, resemblances between the OHP files and OHP image regions of frame images constituting the moving image is calculated such that the correspondence processing between the OHP files and the frame images can be performed based on the calculated resemblances.
摘要:
A method and apparatus for generating a degraded character image at various levels of degradation automatically is presented in this invention. The method comprises rendering the character image on a scene plane; translating and rotating the scene plane according to various parameters; determining a projection region of the character image on an image plane according to various parameters; generating a pixel region mask; and generating a final degraded image by super-sampling. Thus various degraded character images are generated on various conditions of degradation. The generated synthetic characters can be used for performance evaluation and training data augmentation in optical character recognition (OCR).
摘要:
Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.