Logical structure and layout based offline character recognition
    2.
    发明申请
    Logical structure and layout based offline character recognition 有权
    基于逻辑结构和布局的离线字符识别

    公开(公告)号:US20070133883A1

    公开(公告)日:2007-06-14

    申请号:US11299873

    申请日:2005-12-12

    IPC分类号: G06K9/62

    CPC分类号: G06K9/80

    摘要: A method and system for implementing character recognition is described herein. An input character is received. The input character is composed of one or more logical structures in a particular layout. The layout of the one or more logical structures is identified. One or more of a plurality of classifiers are selected based on the layout of the one or more logical structures in the input character. The entire character is input into the selected classifiers. The selected classifiers classify the logical structures. The outputs from the selected classifiers are then combined to form an output character vector.

    摘要翻译: 本文描述了用于实现字符识别的方法和系统。 接收到一个输入字符。 输入字符由特定布局中的一个或多个逻辑结构组成。 识别一个或多个逻辑结构的布局。 基于输入字符中的一个或多个逻辑结构的布局来选择多个分类器中的一个或多个。 整个字符被输入到所选择的分类器中。 所选分类器对逻辑结构进行分类。 然后将所选分类器的输出组合以形成输出字符向量。

    Tarp filter
    3.
    发明申请
    Tarp filter 有权
    篷布过滤器

    公开(公告)号:US20060078210A1

    公开(公告)日:2006-04-13

    申请号:US11287671

    申请日:2005-11-28

    IPC分类号: G06K9/36

    CPC分类号: G06T9/004 G06T9/007

    摘要: Systems and methods for performing adaptive filtering are disclosed. The present invention generates probabilities that can be used in an encoder, such as an arithmetic encoder and generates those probabilities in a computationally efficient manner. Probabilities of previously encoded coefficients are employed, effectively, in generating probabilities of the coefficients without regard to directional information. Thus, a large amount of information is adaptively and efficiently used in generating the probabilities. For the coefficients, the probability is computed based at least partly on at least one probability of a previously computed probability of a neighboring coefficient. Then, the coefficients are encoded using those computed probabilities.

    摘要翻译: 公开了用于执行自适应滤波的系统和方法。 本发明产生可以在诸如算术编码器的编码器中使用的概率,并以计算有效的方式生成这些概率。 先前编码的系数的概率被有效地用于在不考虑方向信息的情况下生成系数的概率。 因此,在生成概率时自适应地有效地使用大量的信息。 对于系数,概率至少部分地基于先前计算的相邻系数的概率的至少一个概率来计算。 然后,使用那些计算的概率对系数进行编码。

    Document content and structure conversion
    5.
    发明申请
    Document content and structure conversion 有权
    文件内容和结构转换

    公开(公告)号:US20070192687A1

    公开(公告)日:2007-08-16

    申请号:US11353915

    申请日:2006-02-14

    IPC分类号: G06F17/00 G06F15/00 G06K9/36

    CPC分类号: G06K9/00442

    摘要: A system that can convert content and structure of a document from an original format into a target format irrespective of the functional specifics of the original format. The system can automatically infer the content and structure of a document via a rendered format thereby restoring the programmatic functionality of the original file (or generating programmatic functionality of a desired target format) through the novel conversion/import process. The system can extract the document structure (e.g., layout) together with the content in order to effectuate the conversion. Heuristics (e.g., logic and/or reasoning) can be employed to make decisions with respect to importing the document into a target format and/or formats.

    摘要翻译: 一种可以将文档的内容和结构从原始格式转换为目标格式的系统,而不考虑原始格式的功能细节。 该系统可以通过呈现的格式自动推断文档的内容和结构,从而通过新颖的转换/导入过程恢复原始文件的编程功能(或产生所需目标格式的编程功能)。 系统可以与内容一起提取文档结构(例如布局),以便实现转换。 可以采用启发式(例如,逻辑和/或推理)来做出关于将文档导入目标格式和/或格式的决定。

    Generation Of Documents From Images
    6.
    发明申请
    Generation Of Documents From Images 有权
    从图像生成文档

    公开(公告)号:US20070177183A1

    公开(公告)日:2007-08-02

    申请号:US11275908

    申请日:2006-02-02

    IPC分类号: G06K15/00

    CPC分类号: G06F17/2765

    摘要: A system for generating soft copy (digital) versions of hard copy documents uses images of the hard copy documents. The images may be captured using a device suitable for capturing images, like a camera phone. Once available, the images may be processed to improve their suitability for document generation. The images may then be processed to recognize and generate soft copy versions of the documents represented by the images.

    摘要翻译: 用于生成软拷贝文档的软拷贝(数字)版本的系统使用硬拷贝文件的图像。 可以使用适合于捕获图像的设备来捕获图像,如摄像机电话。 一旦可用,可以处理图像以改善其对文档生成的适用性。 然后可以处理图像以识别并生成由图像表示的文档的软拷贝版本。

    Scalable hash-based character recognition
    7.
    发明申请
    Scalable hash-based character recognition 有权
    可扩展的基于哈希的字符识别

    公开(公告)号:US20060171588A1

    公开(公告)日:2006-08-03

    申请号:US11045792

    申请日:2005-01-28

    IPC分类号: G06K9/18 G06K9/00

    摘要: The subject invention leverages a scalable character glyph hash table to provide an efficient means to identify print characters where the character glyphs are identical over independent presentation. The hash table allows for quick determinations of glyph meta data as, for example, a pre-filter to traditional OCR techniques. The hash table can be trained for a particular environment, user, language, character set (e.g., alphabet), document type, and/or specific document and the like. This permits substantial flexibility and increases in speed in identifying unknown glyphs. The hash table itself can be composed of single or multiple tables that have a specific optimization purpose. In one instance of the subject invention, traditional OCR techniques can be utilized to update the hash tables as needed based on glyph frequency. This keeps the hash tables from growing by limiting updates that reduce its performance, while adding frequently determined glyphs to increase the pre-filter performance.

    摘要翻译: 本发明利用可缩放的字符字形哈希表来提供用于识别字符字形在独立呈现上相同的打印字符的有效手段。 哈希表允许快速确定字形元数据,例如,对传统的OCR技术进行预过滤。 可以针对特定环境,用户,语言,字符集(例如字母表),文档类型和/或特定文档等对哈希表进行训练。 这允许在识别未知字形中的基本灵活性和速度增加。 散列表本身可以由具有特定优化目的的单个或多个表组成。 在本发明的一个实例中,可以使用传统的OCR技术来根据字形频率根据需要来更新哈希表。 这样可以通过限制降低性能的更新来限制哈希表的增长,同时添加经常确定的字形以增加预过滤器的性能。

    Clustering
    9.
    发明申请
    Clustering 有权
    聚类

    公开(公告)号:US20050271281A1

    公开(公告)日:2005-12-08

    申请号:US11198562

    申请日:2005-08-05

    摘要: Systems and methods for performing clustering of a document image are disclosed. A property of an extracted mark from a document is compared to the properties of the existing clusters. If the property of the mark fails to match any of the properties of the existing clusters, the mark is added as a new cluster to the existing cluster. One property that can be utilized is x size and y size, which is the width and height, of the existing clusters. Another property that can be employed is ink size, which refers to the ratio of black pixels to total pixels in a cluster. Yet another property that can be utilized is a reduced mark or image, which is a pixel size reduced version the bitmap of the mark and/or cluster. The above properties can be employed to identify mismatches and reduce the number of bit by bit comparisons performed.

    摘要翻译: 公开了用于执行文档图像的聚类的系统和方法。 将来自文档的提取标记的属性与现有集群的属性进行比较。 如果标记的属性无法匹配现有集群的任何属性,则该标记作为新集群添加到现有集群。 可以使用的一个属性是x size和y size,这是现有集群的宽度和高度。 可以使用的另一个属性是墨水大小,其指的是群集中黑色像素与总像素的比例。 可以使用的另一个属性是缩小的标记或图像,其是像素尺寸缩小版本的标记和/或集群的位图。 可以采用上述特性来识别不匹配并减少进行的逐比较比较。

    Segmentation based content alteration techniques
    10.
    发明申请
    Segmentation based content alteration techniques 有权
    基于分割的内容变更技术

    公开(公告)号:US20050246775A1

    公开(公告)日:2005-11-03

    申请号:US11046996

    申请日:2005-01-31

    摘要: The subject invention provides a unique system and method that facilitates creating HIP challenges (HIPs) that can be readily segmented and solved by human users but that are too difficult for non-human users. More specifically, the system and method utilize a variety of unique alteration techniques that are segmentation-based. For example, the system and method employ thicker arcs or occlusions that do not intersect characters already placed in the HIP. The thickness of the arc can be measured or determined by the thickness of the characters in the HIP. In addition to increasing the thickness, the arcs can be lengthened because longer arcs tend to resemble pieces of characters and may be harder to erode. Usability maps can be generated and used to selectively place clutter or occlusions and to selectively warp characters or the character sequence to facilitate human recognition of the characters.

    摘要翻译: 本发明提供了一种独特的系统和方法,其有助于创建可以容易地由人类用户分割和解决的HIP挑战(HIP),但是对于非人类用户来说太难了。 更具体地说,该系统和方法利用了基于分段的各种独特的改变技术。 例如,系统和方法采用较大的弧或闭合不与HIP中已经放置的字符相交。 电弧的厚度可以通过HIP中字符的厚度来测量或确定。 除了增加厚度之外,弧可以延长,因为较长的弧往往类似于一些字符,并且可能难以侵蚀。 可用性图可以被生成并用于选择性地放置杂乱或闭塞,并且选择性地扭曲字符或字符序列以促进人类对字符的识别。