Automatic training of character templates using a transcription and a two-dimensional image source model

发明授权

US5689620A Automatic training of character templates using a transcription and a two-dimensional image source model 失效

标题翻译：使用转录和二维图像源模型自动训练角色模板

请登陆查看更多内容

专利标题： Automatic training of character templates using a transcription and a two-dimensional image source model
专利标题（中）： 使用转录和二维图像源模型自动训练角色模板
申请号： US431223

申请日： 1995-04-28
公开(公告)号： US5689620A

公开(公告)日： 1997-11-18
发明人: Gary E. Kopec , Philip Andrew Chou , Leslie T. Niles
申请人： Gary E. Kopec , Philip Andrew Chou , Leslie T. Niles
申请人地址： CT Stamford
专利权人： Xerox Corporation
当前专利权人： Xerox Corporation
当前专利权人地址： CT Stamford
主分类号： G06K9/66
IPC分类号： G06K9/66 ; G06K9/62 ; G06T1/40 ; G06K9/00

Automatic training of character templates using a transcription and a
two-dimensional image source model

摘要：

A technique for automatically training a set of character templates using unsegmented training samples uses as input a two-dimensional (2D) image of characters, called glyphs, as the source of training samples, a transcription associated with the 2D image as a source of labels for the glyph samples, and an explicit, formal 2D image source model that models as a grammar the structural and functional features of a set of 2D images that may be used as the source of training data. The input transcription may be a literal transcription associated with the 2D input image, or it may be nonliteral, for example containing logical structure tags for document formatting, such as found in markup languages. The technique uses spatial positioning information about the 2D image modeled by the 2D image source model and uses labels in the transcription to determine labeled glyph positions in the 2D image that identify locations of glyph samples. The character templates are produced using the input 2D image and the labeled glyph positions without assigning pixels to glyph samples prior to training. In one implementation, the 2D image source model is a regular grammar having the form of a finite state transition network, and the transcription is also represented as a finite state network. The two networks are merged to produce a transcription-image network, which is used to decode the input 2D image to produce labeled glyph positions that identify training data samples in the 2D image. In one implementation of the template construction process, a pixel scoring technique is used to produce character templates contemporaneously from blocks of training data samples aligned at glyph positions.

摘要（中）：

用于使用未分段训练样本自动训练一组角色模板的技术将作为训练样本的来源的称为字形的二维（2D）图像的字符（2D）用作输入，与2D图像相关联的转录作为标签的来源对于字形样本，以及一个明确的，正式的2D图像源模型，其将模型化为可以用作训练数据源的一组2D图像的结构和功能特征作为语法。输入转录可以是与2D输入图像相关联的文字转录，或者它可以是非标准的，例如包含用于文档格式化的逻辑结构标签，例如以标记语言找到的。该技术使用关于由2D图像源模型建模的2D图像的空间定位信息，并使用转录中的标签来确定2D图像中识别字形样本位置的标记字形位置。使用输入的2D图像和标记的字形位置产生字符模板，而不在训练之前将像素分配给字形样本。在一个实现中，2D图像源模型是具有有限状态转换网络形式的规则语法，并且转录也被表示为有限状态网络。两个网络被合并以产生转录图像网络，其用于解码输入的2D图像以产生识别2D图像中的训练数据样本的标记的字形位置。在模板构建过程的一个实现中，使用像素评分技术从与字形位置对齐的训练数据样本的块同时产生字符模板。

公开/授权文献

US4959295A Process of making a photosensitive semi-aqueous developable ceramic coating composition 公开/授权日：1990-09-25

信息查询

Global Dossier Espacenet