Degraded character image generation method and apparatus

    公开(公告)号:US20060056697A1

    公开(公告)日:2006-03-16

    申请号:US11200202

    申请日:2005-08-10

    IPC分类号: G06K9/18

    摘要: A method and apparatus for generating a degraded character image at various levels of degradation automatically is presented in this invention. The method comprises rendering the character image on a scene plane; translating and rotating the scene plane according to various parameters; determining a projection region of the character image on an image plane according to various parameters; generating a pixel region mask; and generating a final degraded image by super-sampling. Thus various degraded character images are generated on various conditions of degradation. The generated synthetic characters can be used for performance evaluation and training data augmentation in optical character recognition (OCR).

    Precise grayscale character segmentation apparatus and method
    2.
    发明授权
    Precise grayscale character segmentation apparatus and method 有权
    精确的灰度字符分割装置和方法

    公开(公告)号:US07715628B2

    公开(公告)日:2010-05-11

    申请号:US11356449

    申请日:2006-02-17

    IPC分类号: G06K9/34

    摘要: Precise grayscale character segmentation apparatus and method. The precise grayscale character segmentation apparatus comprises an adjustment and segmentation unit for adjusting and segmenting an inputted low resolution text line image undergone coarse segmentation, so as to generate an adjusted character image; a character image binarization unit for generating a binary character image from the character image inputted therein; a noise removal unit for removing noise information in the binary character image generated by the binarization unit; and a final character image segmentation unit for generating a precisely segmented character image from the binary character image from which noise has been removed.

    摘要翻译: 精确的灰度字符分割装置和方法。 精确的灰度字符分割装置包括调整和分割单元,用于调整和分割经粗分割的输入低分辨率文本行图像,以便产生经调整的字符图像; 字符图像二值化单元,用于从输入的字符图像生成二进制字符图像; 噪声去除单元,用于去除由二值化单元生成的二进制字符图像中的噪声信息; 以及最终字符图像分割单元,用于从已经去除噪声的二进制字符图像生成精确分割的字符图像。

    Degraded dictionary generation method and apparatus
    3.
    发明授权
    Degraded dictionary generation method and apparatus 有权
    降级字典生成方法和装置

    公开(公告)号:US07480408B2

    公开(公告)日:2009-01-20

    申请号:US11200194

    申请日:2005-08-10

    IPC分类号: G06K9/34 G06K9/46 G06K9/68

    CPC分类号: G06K9/6255

    摘要: A method and apparatus for generating a degraded dictionary automatically is presented in this invention. Herein, a degraded pattern generating means generates a plurality of degraded patterns from an original character image, based on a plurality of degradation parameters. A degraded dictionary generating means generates a plurality of degraded dictionaries corresponding to the plurality of degradation parameters, based on the plurality of degradation patterns. Finally, a dictionary matching means selects one of the plurality of dictionaries which matches the degradation level of a test sample set best, as the final degraded dictionary. In this invention, various degraded patterns can be generated by means of simple scaling and blurring process for establishing degraded dictionaries. Therefore, the invention can be implemented simply and easily. The method and apparatus of the invention can not only be used in character recognition field, but also can be used in other fields such as speech recognition and face recognition.

    摘要翻译: 本发明提出了一种自动生成退化字典的方法和装置。 这里,劣化图案生成单元基于多个劣化参数,从原始文字图像生成多个劣化图案。 劣化字典生成装置基于多个劣化模式,生成与多个劣化参数对应的多个劣化字典。 最后,字典匹配装置选择与测试样本集合的劣化级别最佳匹配的多个词典中的一个作为最终退化字典。 在本发明中,可以通过用于建立退化字典的简单缩放和模糊处理来生成各种退化图案。 因此,本发明可以简单且容易地实现。 本发明的方法和装置不仅可以用于字符识别领域,而且可以用于诸如语音识别和面部识别等其他领域。

    Grayscale character dictionary generation apparatus
    4.
    发明授权
    Grayscale character dictionary generation apparatus 有权
    灰度字符字典生成装置

    公开(公告)号:US07532756B2

    公开(公告)日:2009-05-12

    申请号:US11329407

    申请日:2006-01-11

    IPC分类号: G06K9/18 G06K9/62 G06K9/34

    摘要: A grayscale character dictionary generation apparatus, comprising a first synthetic grayscale degraded character image generation unit for generating first synthetic grayscale degraded character images using binary character images inputted therein; a clustering unit for dividing each category of the first synthetic grayscale degraded character images generated by the first synthetic grayscale degraded character image generation unit into a plurality of clusters; a template generation unit for generating template for each of the clusters; a transformation matrix generation unit for generating transformation matrix in relation to each of the templates; and a second synthetic grayscale degraded character dictionary generation unit for obtaining character feature of every grayscale degraded character of each of the clusters using the transformation matrix, and for constructing eigenspace of each category of the synthetic grayscale degraded character, which is the second synthetic grayscale character dictionary.

    摘要翻译: 一种灰度字符字典生成装置,包括:第一合成灰度级退化字符图像生成单元,用于使用输入的二进制字符图像生成第一合成灰度级退化字符图像; 聚类单元,用于将由第一合成灰度级降级字符图像生成单元生成的第一合成灰阶降级字符图像的每个类别划分为多个聚类; 用于为每个簇生成模板的模板生成单元; 变换矩阵生成单元,用于生成关于每个模板的变换矩阵; 以及第二合成灰阶降级字符字典生成单元,用于使用所述变换矩阵来获得每个所述聚类的每个灰度级退化特征的特征,并且用于构建作为所述第二合成灰度字符的所述合成灰度级退化字符的每个类别的本征空间 字典。

    Character recognition apparatus and method for recognizing characters in an image
    5.
    发明申请
    Character recognition apparatus and method for recognizing characters in an image 审中-公开
    用于识别图像中的字符的字符识别装置和方法

    公开(公告)号:US20060062460A1

    公开(公告)日:2006-03-23

    申请号:US11199993

    申请日:2005-08-10

    IPC分类号: G06K9/18

    CPC分类号: G06K9/325 G06K2209/01

    摘要: Character recognition apparatus and method for recognizing characters in an image, of which the character recognition apparatus comprises a text line extraction unit for extracting a plurality of text lines from an input image, a feature recognition unit for recognizing one or more features of each of the text lines, a synthetic pattern generation unit for generating synthetic character images for each of the text lines by using the features recognized by the feature recognition unit and the original character images, a synthetic dictionary generation unit for generating a synthetic dictionary for each of the text lines by using the synthetic character images, and a text line recognition unit for recognizing characters in each of the text lines by using the synthetic dictionary.

    摘要翻译: 用于识别图像中的字符的字符识别装置和方法,其中字符识别装置包括用于从输入图像中提取多条文本行的文本行提取单元,用于识别每一个的一个或多个特征的特征识别单元 文本行,合成图案生成单元,用于通过使用由特征识别单元识别的特征和原始字符图像来生成每个文本行的合成人物图像;合成词典生成单元,用于为每个文本生成合成词典 通过使用合成字符图像的行,以及用于通过使用合成字典来识别每个文本行中的字符的文本行识别单元。

    Degraded character image generation method and apparatus
    6.
    发明授权
    Degraded character image generation method and apparatus 失效
    降级字符图像生成方法和装置

    公开(公告)号:US07480409B2

    公开(公告)日:2009-01-20

    申请号:US11200202

    申请日:2005-08-10

    IPC分类号: G06K9/34 G06K9/32 G06K9/62

    摘要: A method and apparatus for generating a degraded character image at various levels of degradation automatically is presented in this invention. The method comprises rendering the character image on a scene plane; translating and rotating the scene plane according to various parameters; determining a projection region of the character image on an image plane according to various parameters; generating a pixel region mask; and generating a final degraded image by super-sampling. Thus various degraded character images are generated on various conditions of degradation. The generated synthetic characters can be used for performance evaluation and training data augmentation in optical character recognition (OCR).

    摘要翻译: 本发明提供了一种用于在自动降级的各种劣化级别生成降级字符图像的方法和装置。 该方法包括:将场景平面上的人物图像渲染; 根据各种参数平移和旋转场景平面; 根据各种参数确定图像平面上的字符图像的投影区域; 生成像素区域掩模; 并通过超采样生成最终退化图像。 因此,在各种劣化条件下产生各种退化的字符图像。 所生成的合成字符可用于光学字符识别(OCR)中的性能评估和训练数据增加。

    Precise grayscale character segmentation apparatus and method
    7.
    发明申请
    Precise grayscale character segmentation apparatus and method 有权
    精确的灰度字符分割装置和方法

    公开(公告)号:US20060245650A1

    公开(公告)日:2006-11-02

    申请号:US11356449

    申请日:2006-02-17

    IPC分类号: G06K9/34

    摘要: Precise grayscale character segmentation apparatus and method. The precise grayscale character segmentation apparatus comprises an adjustment and segmentation unit for adjusting and segmenting an inputted low resolution text line image undergone coarse segmentation, so as to generate an adjusted character image; a character image binarization unit for generating a binary character image from the character image inputted therein; a noise removal unit for removing noise information in the binary character image generated by the binarization unit; and a final character image segmentation unit for generating a precisely segmented character image from the binary character image from which noise has been removed.

    摘要翻译: 精确的灰度字符分割装置和方法。 精确的灰度字符分割装置包括调整和分割单元,用于调整和分割经粗分割的输入低分辨率文本行图像,以便产生经调整的字符图像; 字符图像二值化单元,用于从输入的字符图像生成二进制字符图像; 噪声去除单元,用于去除由二值化单元生成的二进制字符图像中的噪声信息; 以及最终字符图像分割单元,用于从已经去除噪声的二进制字符图像生成精确分割的字符图像。

    Grayscale character dictionary generation apparatus

    公开(公告)号:US20060171589A1

    公开(公告)日:2006-08-03

    申请号:US11329407

    申请日:2006-01-11

    IPC分类号: G06K9/34 G06K9/18

    摘要: A grayscale character dictionary generation apparatus, comprising a first synthetic grayscale degraded character image generation unit for generating first synthetic grayscale degraded character images using binary character images inputted therein; a clustering unit for dividing each category of the first synthetic grayscale degraded character images generated by the first synthetic grayscale degraded character image generation unit into a plurality of clusters; a template generation unit for generating template for each of the clusters; a transformation matrix generation unit for generating transformation matrix in relation to each of the templates; and a second synthetic grayscale degraded character dictionary generation unit for obtaining character feature of every grayscale degraded character of each of the clusters using the transformation matrix, and for constructing eigenspace of each category of the synthetic grayscale degraded character, which is the second synthetic grayscale character dictionary.

    Degraded dictionary generation method and apparatus

    公开(公告)号:US20060056696A1

    公开(公告)日:2006-03-16

    申请号:US11200194

    申请日:2005-08-10

    IPC分类号: G06K9/18

    CPC分类号: G06K9/6255

    摘要: A method and apparatus for generating a degraded dictionary automatically is presented in this invention. Herein, a degraded pattern generating means generates a plurality of degraded patterns from an original character image, based on a plurality of degradation parameters. A degraded dictionary generating means generates a plurality of degraded dictionaries corresponding to the plurality of degradation parameters, based on the plurality of degradation patterns. Finally, a dictionary matching means selects one of the plurality of dictionaries which matches the degradation level of a test sample set best, as the final degraded dictionary. In this invention, various degraded patterns can be generated by means of simple scaling and blurring process for establishing degraded dictionaries. Therefore, the invention can be implemented simply and easily. The method and apparatus of the invention can not only be used in character recognition field, but also can be used in other fields such as speech recognition and face recognition.

    Video text processing apparatus
    10.
    发明授权
    Video text processing apparatus 有权
    视频文本处理装置

    公开(公告)号:US07929765B2

    公开(公告)日:2011-04-19

    申请号:US12778336

    申请日:2010-05-12

    IPC分类号: G06K9/18

    CPC分类号: G06K9/3266 G06K2209/01

    摘要: Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.

    摘要翻译: 通过删除冗余帧和非文本帧,从给定的视频帧中选择包含文本区域的视频帧,通过删除假笔划来选择所选帧中的文本区域,文本区域中的文本行被提取和二值化。