摘要:
A method and apparatus for generating a degraded character image at various levels of degradation automatically is presented in this invention. The method comprises rendering the character image on a scene plane; translating and rotating the scene plane according to various parameters; determining a projection region of the character image on an image plane according to various parameters; generating a pixel region mask; and generating a final degraded image by super-sampling. Thus various degraded character images are generated on various conditions of degradation. The generated synthetic characters can be used for performance evaluation and training data augmentation in optical character recognition (OCR).
摘要:
Precise grayscale character segmentation apparatus and method. The precise grayscale character segmentation apparatus comprises an adjustment and segmentation unit for adjusting and segmenting an inputted low resolution text line image undergone coarse segmentation, so as to generate an adjusted character image; a character image binarization unit for generating a binary character image from the character image inputted therein; a noise removal unit for removing noise information in the binary character image generated by the binarization unit; and a final character image segmentation unit for generating a precisely segmented character image from the binary character image from which noise has been removed.
摘要:
A method and apparatus for generating a degraded dictionary automatically is presented in this invention. Herein, a degraded pattern generating means generates a plurality of degraded patterns from an original character image, based on a plurality of degradation parameters. A degraded dictionary generating means generates a plurality of degraded dictionaries corresponding to the plurality of degradation parameters, based on the plurality of degradation patterns. Finally, a dictionary matching means selects one of the plurality of dictionaries which matches the degradation level of a test sample set best, as the final degraded dictionary. In this invention, various degraded patterns can be generated by means of simple scaling and blurring process for establishing degraded dictionaries. Therefore, the invention can be implemented simply and easily. The method and apparatus of the invention can not only be used in character recognition field, but also can be used in other fields such as speech recognition and face recognition.
摘要:
A grayscale character dictionary generation apparatus, comprising a first synthetic grayscale degraded character image generation unit for generating first synthetic grayscale degraded character images using binary character images inputted therein; a clustering unit for dividing each category of the first synthetic grayscale degraded character images generated by the first synthetic grayscale degraded character image generation unit into a plurality of clusters; a template generation unit for generating template for each of the clusters; a transformation matrix generation unit for generating transformation matrix in relation to each of the templates; and a second synthetic grayscale degraded character dictionary generation unit for obtaining character feature of every grayscale degraded character of each of the clusters using the transformation matrix, and for constructing eigenspace of each category of the synthetic grayscale degraded character, which is the second synthetic grayscale character dictionary.
摘要:
Character recognition apparatus and method for recognizing characters in an image, of which the character recognition apparatus comprises a text line extraction unit for extracting a plurality of text lines from an input image, a feature recognition unit for recognizing one or more features of each of the text lines, a synthetic pattern generation unit for generating synthetic character images for each of the text lines by using the features recognized by the feature recognition unit and the original character images, a synthetic dictionary generation unit for generating a synthetic dictionary for each of the text lines by using the synthetic character images, and a text line recognition unit for recognizing characters in each of the text lines by using the synthetic dictionary.
摘要:
A method and apparatus for generating a degraded character image at various levels of degradation automatically is presented in this invention. The method comprises rendering the character image on a scene plane; translating and rotating the scene plane according to various parameters; determining a projection region of the character image on an image plane according to various parameters; generating a pixel region mask; and generating a final degraded image by super-sampling. Thus various degraded character images are generated on various conditions of degradation. The generated synthetic characters can be used for performance evaluation and training data augmentation in optical character recognition (OCR).
摘要:
Precise grayscale character segmentation apparatus and method. The precise grayscale character segmentation apparatus comprises an adjustment and segmentation unit for adjusting and segmenting an inputted low resolution text line image undergone coarse segmentation, so as to generate an adjusted character image; a character image binarization unit for generating a binary character image from the character image inputted therein; a noise removal unit for removing noise information in the binary character image generated by the binarization unit; and a final character image segmentation unit for generating a precisely segmented character image from the binary character image from which noise has been removed.
摘要:
A grayscale character dictionary generation apparatus, comprising a first synthetic grayscale degraded character image generation unit for generating first synthetic grayscale degraded character images using binary character images inputted therein; a clustering unit for dividing each category of the first synthetic grayscale degraded character images generated by the first synthetic grayscale degraded character image generation unit into a plurality of clusters; a template generation unit for generating template for each of the clusters; a transformation matrix generation unit for generating transformation matrix in relation to each of the templates; and a second synthetic grayscale degraded character dictionary generation unit for obtaining character feature of every grayscale degraded character of each of the clusters using the transformation matrix, and for constructing eigenspace of each category of the synthetic grayscale degraded character, which is the second synthetic grayscale character dictionary.
摘要:
A method and apparatus for generating a degraded dictionary automatically is presented in this invention. Herein, a degraded pattern generating means generates a plurality of degraded patterns from an original character image, based on a plurality of degradation parameters. A degraded dictionary generating means generates a plurality of degraded dictionaries corresponding to the plurality of degradation parameters, based on the plurality of degradation patterns. Finally, a dictionary matching means selects one of the plurality of dictionaries which matches the degradation level of a test sample set best, as the final degraded dictionary. In this invention, various degraded patterns can be generated by means of simple scaling and blurring process for establishing degraded dictionaries. Therefore, the invention can be implemented simply and easily. The method and apparatus of the invention can not only be used in character recognition field, but also can be used in other fields such as speech recognition and face recognition.
摘要:
Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.