Logical structure and layout based offline character recognition
    1.
    发明申请
    Logical structure and layout based offline character recognition 有权
    基于逻辑结构和布局的离线字符识别

    公开(公告)号:US20070133883A1

    公开(公告)日:2007-06-14

    申请号:US11299873

    申请日:2005-12-12

    IPC分类号: G06K9/62

    CPC分类号: G06K9/80

    摘要: A method and system for implementing character recognition is described herein. An input character is received. The input character is composed of one or more logical structures in a particular layout. The layout of the one or more logical structures is identified. One or more of a plurality of classifiers are selected based on the layout of the one or more logical structures in the input character. The entire character is input into the selected classifiers. The selected classifiers classify the logical structures. The outputs from the selected classifiers are then combined to form an output character vector.

    摘要翻译: 本文描述了用于实现字符识别的方法和系统。 接收到一个输入字符。 输入字符由特定布局中的一个或多个逻辑结构组成。 识别一个或多个逻辑结构的布局。 基于输入字符中的一个或多个逻辑结构的布局来选择多个分类器中的一个或多个。 整个字符被输入到所选择的分类器中。 所选分类器对逻辑结构进行分类。 然后将所选分类器的输出组合以形成输出字符向量。

    Scalable hash-based character recognition
    2.
    发明申请
    Scalable hash-based character recognition 有权
    可扩展的基于哈希的字符识别

    公开(公告)号:US20060171588A1

    公开(公告)日:2006-08-03

    申请号:US11045792

    申请日:2005-01-28

    IPC分类号: G06K9/18 G06K9/00

    摘要: The subject invention leverages a scalable character glyph hash table to provide an efficient means to identify print characters where the character glyphs are identical over independent presentation. The hash table allows for quick determinations of glyph meta data as, for example, a pre-filter to traditional OCR techniques. The hash table can be trained for a particular environment, user, language, character set (e.g., alphabet), document type, and/or specific document and the like. This permits substantial flexibility and increases in speed in identifying unknown glyphs. The hash table itself can be composed of single or multiple tables that have a specific optimization purpose. In one instance of the subject invention, traditional OCR techniques can be utilized to update the hash tables as needed based on glyph frequency. This keeps the hash tables from growing by limiting updates that reduce its performance, while adding frequently determined glyphs to increase the pre-filter performance.

    摘要翻译: 本发明利用可缩放的字符字形哈希表来提供用于识别字符字形在独立呈现上相同的打印字符的有效手段。 哈希表允许快速确定字形元数据,例如,对传统的OCR技术进行预过滤。 可以针对特定环境,用户,语言,字符集(例如字母表),文档类型和/或特定文档等对哈希表进行训练。 这允许在识别未知字形中的基本灵活性和速度增加。 散列表本身可以由具有特定优化目的的单个或多个表组成。 在本发明的一个实例中,可以使用传统的OCR技术来根据字形频率根据需要来更新哈希表。 这样可以通过限制降低性能的更新来限制哈希表的增长,同时添加经常确定的字形以增加预过滤器的性能。

    Segmentation based content alteration techniques
    3.
    发明申请
    Segmentation based content alteration techniques 有权
    基于分割的内容变更技术

    公开(公告)号:US20050246775A1

    公开(公告)日:2005-11-03

    申请号:US11046996

    申请日:2005-01-31

    摘要: The subject invention provides a unique system and method that facilitates creating HIP challenges (HIPs) that can be readily segmented and solved by human users but that are too difficult for non-human users. More specifically, the system and method utilize a variety of unique alteration techniques that are segmentation-based. For example, the system and method employ thicker arcs or occlusions that do not intersect characters already placed in the HIP. The thickness of the arc can be measured or determined by the thickness of the characters in the HIP. In addition to increasing the thickness, the arcs can be lengthened because longer arcs tend to resemble pieces of characters and may be harder to erode. Usability maps can be generated and used to selectively place clutter or occlusions and to selectively warp characters or the character sequence to facilitate human recognition of the characters.

    摘要翻译: 本发明提供了一种独特的系统和方法,其有助于创建可以容易地由人类用户分割和解决的HIP挑战(HIP),但是对于非人类用户来说太难了。 更具体地说,该系统和方法利用了基于分段的各种独特的改变技术。 例如,系统和方法采用较大的弧或闭合不与HIP中已经放置的字符相交。 电弧的厚度可以通过HIP中字符的厚度来测量或确定。 除了增加厚度之外,弧可以延长,因为较长的弧往往类似于一些字符,并且可能难以侵蚀。 可用性图可以被生成并用于选择性地放置杂乱或闭塞,并且选择性地扭曲字符或字符序列以促进人类对字符的识别。

    Unfolded convolution for fast feature extraction
    4.
    发明申请
    Unfolded convolution for fast feature extraction 有权
    用于快速特征提取的展开卷积

    公开(公告)号:US20070086655A1

    公开(公告)日:2007-04-19

    申请号:US11250819

    申请日:2005-10-14

    IPC分类号: G06K9/46 G06K9/64

    CPC分类号: G06K9/4628 G06K2209/01

    摘要: Systems and methods are described that facilitate performing feature extraction across multiple received input features to reduce computational overhead associated with feature processing related to, for instance, optical character recognition. Input feature information can be unfolded and concatenated to generate an aggregated input matrix, which can be convolved with a kernel matrix to produce output feature information for multiple output features concurrently.

    摘要翻译: 描述了有助于在多个接收到的输入特征之间执行特征提取的系统和方法,以减少与例如光学字符识别相关的特征处理相关联的计算开销。 输入特征信息可以展开并连接以生成聚合输入矩阵,其可以与内核矩阵进行卷积以同时产生多个输出特征的输出特征信息。

    Optimization of cascaded classifiers
    5.
    发明申请
    Optimization of cascaded classifiers 审中-公开
    级联分类器的优化

    公开(公告)号:US20070112701A1

    公开(公告)日:2007-05-17

    申请号:US11204145

    申请日:2005-08-15

    IPC分类号: G06N3/02

    CPC分类号: G06K9/6256 G06N20/00

    摘要: An optimization system comprises a reception component that receives a cascade of classifiers. The system further includes an optimization component communicatively coupled to the reception component, the optimization component receives input relating to one of speed and accuracy of the cascade of classifiers and optimizes the cascade of classifiers based at least in part upon the received input and confidence scores associated with each classifier within the cascade of classifiers. The optimization component can utilize at least one of a steepest descent algorithm, a dynamic programming algorithm, a simulated annealing algorithm, and a branch and bound variant of a depth first search algorithm in connection with optimizing the cascade of classifiers.

    摘要翻译: 优化系统包括接收分级器级联的接收组件。 该系统还包括通信地耦合到接收组件的优化组件,优化组件接收与分级器级联的速度和精度之一相关的输入,并且至少部分地基于所接收的输入和置信度得分相关联来优化分类器的级联 每个分类器都在级联的分类器中。 优化组件可以利用最优速下降算法,动态规划算法,模拟退火算法以及深度优先搜索算法的分支和绑定变体中的至少一种,以优化分类器的级联。

    Systems and methods that facilitate improved display of electronic documents
    6.
    发明申请
    Systems and methods that facilitate improved display of electronic documents 有权
    促进电子文档显示的系统和方法

    公开(公告)号:US20060271846A1

    公开(公告)日:2006-11-30

    申请号:US11135717

    申请日:2005-05-24

    IPC分类号: G06F15/00

    CPC分类号: G06Q10/10

    摘要: A computer-implemented word processing system comprises an interface component that receives a features vector associated with an electronic document. An analysis component communicatively coupled to the interface component analyzes the features vector and determines a viewing mode in which to display the electronic document. In accordance with one aspect of the subject invention, the viewing mode can be one of a conventional viewing mode and a viewing mode associated with enhanced readability.

    摘要翻译: 计算机实现的文字处理系统包括接收与电子文档相关联的特征向量的接口组件。 通信地耦合到接口组件的分析组件分析特征向量并且确定在其中显示电子文档的观看模式。 根据本发明的一个方面,观看模式可以是与增强的可读性相关联的常规观看模式和观看模式之一。

    High performance content alteration architecture and techniques
    7.
    发明申请
    High performance content alteration architecture and techniques 有权
    高性能内容改变架构和技术

    公开(公告)号:US20050229251A1

    公开(公告)日:2005-10-13

    申请号:US10815086

    申请日:2004-03-31

    摘要: The present invention provides a unique system and method that facilitates obtaining high performance and more secure HIPs. More specifically, the HIPs can be generated in part by caching pre-rendered characters and/or pre-rendered arcs as bitmaps in binary form and then selecting any number of the characters and/or arcs randomly to form a HIP sequence. The warp field can be pre-computed and converted to integers in binary form and can include a plurality of sub-regions. The warp field can be cached as well. Any one sub-region can be retrieved from the warp field cache and mapped to the HIP sequence to warp the HIP. Thus, the pre-computed warp field can be used to warp multiple HIP sequences. The warping can occur in binary form and at a high resolution to mitigate reverse engineering. Following, the warped HIP sequence can be down-sampled and texture and/or color can be added as well to improve its appearance.

    摘要翻译: 本发明提供了一种独特的系统和方法,其有助于获得高性能和更安全的HIP。 更具体地说,可以部分地通过将预渲染字符和/或预渲染的弧缓存为二进制形式的位图,然后随机选择任意数量的字符和/或弧形成HIP序列,来部分地生成HIP。 翘曲域可以被预先计算并转换成二进制形式的整数,并且可以包括多个子区域。 翘曲区也可以缓存。 任何一个子区域都可以从warp域高速缓存中检索,并映射到HIP序列以扭曲HIP。 因此,可以使用预先计算的翘曲场来扭曲多个HIP序列。 翘曲可以以二进制形式和高分辨率发生,以减轻逆向工程。 以下,翘曲的HIP序列可以进行下采样,并且可以添加纹理和/或颜色以改善其外观。

    Tarp filter
    8.
    发明申请
    Tarp filter 有权
    篷布过滤器

    公开(公告)号:US20060078210A1

    公开(公告)日:2006-04-13

    申请号:US11287671

    申请日:2005-11-28

    IPC分类号: G06K9/36

    CPC分类号: G06T9/004 G06T9/007

    摘要: Systems and methods for performing adaptive filtering are disclosed. The present invention generates probabilities that can be used in an encoder, such as an arithmetic encoder and generates those probabilities in a computationally efficient manner. Probabilities of previously encoded coefficients are employed, effectively, in generating probabilities of the coefficients without regard to directional information. Thus, a large amount of information is adaptively and efficiently used in generating the probabilities. For the coefficients, the probability is computed based at least partly on at least one probability of a previously computed probability of a neighboring coefficient. Then, the coefficients are encoded using those computed probabilities.

    摘要翻译: 公开了用于执行自适应滤波的系统和方法。 本发明产生可以在诸如算术编码器的编码器中使用的概率,并以计算有效的方式生成这些概率。 先前编码的系数的概率被有效地用于在不考虑方向信息的情况下生成系数的概率。 因此,在生成概率时自适应地有效地使用大量的信息。 对于系数,概率至少部分地基于先前计算的相邻系数的概率的至少一个概率来计算。 然后,使用那些计算的概率对系数进行编码。

    Document content and structure conversion
    10.
    发明申请
    Document content and structure conversion 有权
    文件内容和结构转换

    公开(公告)号:US20070192687A1

    公开(公告)日:2007-08-16

    申请号:US11353915

    申请日:2006-02-14

    IPC分类号: G06F17/00 G06F15/00 G06K9/36

    CPC分类号: G06K9/00442

    摘要: A system that can convert content and structure of a document from an original format into a target format irrespective of the functional specifics of the original format. The system can automatically infer the content and structure of a document via a rendered format thereby restoring the programmatic functionality of the original file (or generating programmatic functionality of a desired target format) through the novel conversion/import process. The system can extract the document structure (e.g., layout) together with the content in order to effectuate the conversion. Heuristics (e.g., logic and/or reasoning) can be employed to make decisions with respect to importing the document into a target format and/or formats.

    摘要翻译: 一种可以将文档的内容和结构从原始格式转换为目标格式的系统,而不考虑原始格式的功能细节。 该系统可以通过呈现的格式自动推断文档的内容和结构,从而通过新颖的转换/导入过程恢复原始文件的编程功能(或产生所需目标格式的编程功能)。 系统可以与内容一起提取文档结构(例如布局),以便实现转换。 可以采用启发式(例如,逻辑和/或推理)来做出关于将文档导入目标格式和/或格式的决定。