-
公开(公告)号:US20070177183A1
公开(公告)日:2007-08-02
申请号:US11275908
申请日:2006-02-02
IPC分类号: G06K15/00
CPC分类号: G06F17/2765
摘要: A system for generating soft copy (digital) versions of hard copy documents uses images of the hard copy documents. The images may be captured using a device suitable for capturing images, like a camera phone. Once available, the images may be processed to improve their suitability for document generation. The images may then be processed to recognize and generate soft copy versions of the documents represented by the images.
摘要翻译: 用于生成软拷贝文档的软拷贝(数字)版本的系统使用硬拷贝文件的图像。 可以使用适合于捕获图像的设备来捕获图像,如摄像机电话。 一旦可用,可以处理图像以改善其对文档生成的适用性。 然后可以处理图像以识别并生成由图像表示的文档的软拷贝版本。
-
公开(公告)号:US20070133883A1
公开(公告)日:2007-06-14
申请号:US11299873
申请日:2005-12-12
申请人: Kumar Chellapilla , Patrice Simard
发明人: Kumar Chellapilla , Patrice Simard
IPC分类号: G06K9/62
CPC分类号: G06K9/80
摘要: A method and system for implementing character recognition is described herein. An input character is received. The input character is composed of one or more logical structures in a particular layout. The layout of the one or more logical structures is identified. One or more of a plurality of classifiers are selected based on the layout of the one or more logical structures in the input character. The entire character is input into the selected classifiers. The selected classifiers classify the logical structures. The outputs from the selected classifiers are then combined to form an output character vector.
摘要翻译: 本文描述了用于实现字符识别的方法和系统。 接收到一个输入字符。 输入字符由特定布局中的一个或多个逻辑结构组成。 识别一个或多个逻辑结构的布局。 基于输入字符中的一个或多个逻辑结构的布局来选择多个分类器中的一个或多个。 整个字符被输入到所选择的分类器中。 所选分类器对逻辑结构进行分类。 然后将所选分类器的输出组合以形成输出字符向量。
-
公开(公告)号:US20060078210A1
公开(公告)日:2006-04-13
申请号:US11287671
申请日:2005-11-28
IPC分类号: G06K9/36
摘要: Systems and methods for performing adaptive filtering are disclosed. The present invention generates probabilities that can be used in an encoder, such as an arithmetic encoder and generates those probabilities in a computationally efficient manner. Probabilities of previously encoded coefficients are employed, effectively, in generating probabilities of the coefficients without regard to directional information. Thus, a large amount of information is adaptively and efficiently used in generating the probabilities. For the coefficients, the probability is computed based at least partly on at least one probability of a previously computed probability of a neighboring coefficient. Then, the coefficients are encoded using those computed probabilities.
摘要翻译: 公开了用于执行自适应滤波的系统和方法。 本发明产生可以在诸如算术编码器的编码器中使用的概率,并以计算有效的方式生成这些概率。 先前编码的系数的概率被有效地用于在不考虑方向信息的情况下生成系数的概率。 因此,在生成概率时自适应地有效地使用大量的信息。 对于系数,概率至少部分地基于先前计算的相邻系数的概率的至少一个概率来计算。 然后,使用那些计算的概率对系数进行编码。
-
公开(公告)号:US20050180597A1
公开(公告)日:2005-08-18
申请号:US11095815
申请日:2005-03-31
申请人: Patrice Simard , Michael Sinclair
发明人: Patrice Simard , Michael Sinclair
CPC分类号: G06F3/0425 , B43L1/00 , H04N7/142
摘要: An image capturing system is installable in a room separate from a writing surface and a second area. The image capturing system is adapted to take visual images of the writing surface and second area and identify information written thereon.
摘要翻译: 图像捕获系统可安装在与书写表面和第二区域分离的房间中。 图像拍摄系统适于拍摄书写表面和第二区域的视觉图像,并识别写在其上的信息。
-
公开(公告)号:US20070192687A1
公开(公告)日:2007-08-16
申请号:US11353915
申请日:2006-02-14
申请人: Patrice Simard , Radoslav Nickolov
发明人: Patrice Simard , Radoslav Nickolov
CPC分类号: G06K9/00442
摘要: A system that can convert content and structure of a document from an original format into a target format irrespective of the functional specifics of the original format. The system can automatically infer the content and structure of a document via a rendered format thereby restoring the programmatic functionality of the original file (or generating programmatic functionality of a desired target format) through the novel conversion/import process. The system can extract the document structure (e.g., layout) together with the content in order to effectuate the conversion. Heuristics (e.g., logic and/or reasoning) can be employed to make decisions with respect to importing the document into a target format and/or formats.
摘要翻译: 一种可以将文档的内容和结构从原始格式转换为目标格式的系统,而不考虑原始格式的功能细节。 该系统可以通过呈现的格式自动推断文档的内容和结构,从而通过新颖的转换/导入过程恢复原始文件的编程功能(或产生所需目标格式的编程功能)。 系统可以与内容一起提取文档结构(例如布局),以便实现转换。 可以采用启发式(例如,逻辑和/或推理)来做出关于将文档导入目标格式和/或格式的决定。
-
公开(公告)号:US20060171588A1
公开(公告)日:2006-08-03
申请号:US11045792
申请日:2005-01-28
CPC分类号: G06K9/6828 , G06K9/72 , G06K2209/01
摘要: The subject invention leverages a scalable character glyph hash table to provide an efficient means to identify print characters where the character glyphs are identical over independent presentation. The hash table allows for quick determinations of glyph meta data as, for example, a pre-filter to traditional OCR techniques. The hash table can be trained for a particular environment, user, language, character set (e.g., alphabet), document type, and/or specific document and the like. This permits substantial flexibility and increases in speed in identifying unknown glyphs. The hash table itself can be composed of single or multiple tables that have a specific optimization purpose. In one instance of the subject invention, traditional OCR techniques can be utilized to update the hash tables as needed based on glyph frequency. This keeps the hash tables from growing by limiting updates that reduce its performance, while adding frequently determined glyphs to increase the pre-filter performance.
摘要翻译: 本发明利用可缩放的字符字形哈希表来提供用于识别字符字形在独立呈现上相同的打印字符的有效手段。 哈希表允许快速确定字形元数据,例如,对传统的OCR技术进行预过滤。 可以针对特定环境,用户,语言,字符集(例如字母表),文档类型和/或特定文档等对哈希表进行训练。 这允许在识别未知字形中的基本灵活性和速度增加。 散列表本身可以由具有特定优化目的的单个或多个表组成。 在本发明的一个实例中,可以使用传统的OCR技术来根据字形频率根据需要来更新哈希表。 这样可以通过限制降低性能的更新来限制哈希表的增长,同时添加经常确定的字形以增加预过滤器的性能。
-
公开(公告)号:US20060078202A1
公开(公告)日:2006-04-13
申请号:US11281462
申请日:2005-11-18
申请人: Michael Shilman , Zile Wei , Yu Zou , Patrice Simard , Sashi Raghupathy , F. Jones , Charlton Lui , Jian Wang
发明人: Michael Shilman , Zile Wei , Yu Zou , Patrice Simard , Sashi Raghupathy , F. Jones , Charlton Lui , Jian Wang
IPC分类号: G06K9/18
CPC分类号: G06K9/00409 , G06K9/222
摘要: Electronic ink layout analysis systems and methods provide flexibility and efficiency in organizing, analyzing, and processing digital ink. These layout analysis systems and methods allow users substantial freedom in entering electronic ink into a pen-based computer system. Using these systems and methods, a user's input digital ink is not constrained by requirements that a user write in a specific screen orientation, that a user write in one specific orientation on all portions of a page, or that a user write using a specific minimum or maximum sized stroke. Rather, the systems and methods freely allow the user to write anywhere on a given page, in any orientation or size, while still enabling effective and efficient handwriting recognition and other processing of the input digital ink.
-
公开(公告)号:US20050271281A1
公开(公告)日:2005-12-08
申请号:US11198562
申请日:2005-08-05
申请人: Patrice Simard , Henrique Malvar , Erin Renshaw
发明人: Patrice Simard , Henrique Malvar , Erin Renshaw
IPC分类号: G06T7/00 , G06F19/00 , G06K9/20 , G06K9/36 , G06K9/46 , G06K9/62 , G06K9/64 , G06K9/68 , G06T5/00 , G06T11/00
CPC分类号: G06K9/00442 , G06K9/46 , G06K9/6202 , G06K2209/01
摘要: Systems and methods for performing clustering of a document image are disclosed. A property of an extracted mark from a document is compared to the properties of the existing clusters. If the property of the mark fails to match any of the properties of the existing clusters, the mark is added as a new cluster to the existing cluster. One property that can be utilized is x size and y size, which is the width and height, of the existing clusters. Another property that can be employed is ink size, which refers to the ratio of black pixels to total pixels in a cluster. Yet another property that can be utilized is a reduced mark or image, which is a pixel size reduced version the bitmap of the mark and/or cluster. The above properties can be employed to identify mismatches and reduce the number of bit by bit comparisons performed.
摘要翻译: 公开了用于执行文档图像的聚类的系统和方法。 将来自文档的提取标记的属性与现有集群的属性进行比较。 如果标记的属性无法匹配现有集群的任何属性,则该标记作为新集群添加到现有集群。 可以使用的一个属性是x size和y size,这是现有集群的宽度和高度。 可以使用的另一个属性是墨水大小,其指的是群集中黑色像素与总像素的比例。 可以使用的另一个属性是缩小的标记或图像,其是像素尺寸缩小版本的标记和/或集群的位图。 可以采用上述特性来识别不匹配并减少进行的逐比较比较。
-
公开(公告)号:US20050246775A1
公开(公告)日:2005-11-03
申请号:US11046996
申请日:2005-01-31
申请人: Kumar Chellapilla , Patrice Simard
发明人: Kumar Chellapilla , Patrice Simard
IPC分类号: A61F2/32 , F27D1/02 , G06F1/00 , G06F15/00 , G06F21/31 , G06F21/32 , G06Q10/00 , G06Q30/00 , G06T3/00 , H04L9/32
CPC分类号: G06T3/00 , G06F21/31 , G06F21/55 , G06Q10/107 , G06Q30/02
摘要: The subject invention provides a unique system and method that facilitates creating HIP challenges (HIPs) that can be readily segmented and solved by human users but that are too difficult for non-human users. More specifically, the system and method utilize a variety of unique alteration techniques that are segmentation-based. For example, the system and method employ thicker arcs or occlusions that do not intersect characters already placed in the HIP. The thickness of the arc can be measured or determined by the thickness of the characters in the HIP. In addition to increasing the thickness, the arcs can be lengthened because longer arcs tend to resemble pieces of characters and may be harder to erode. Usability maps can be generated and used to selectively place clutter or occlusions and to selectively warp characters or the character sequence to facilitate human recognition of the characters.
摘要翻译: 本发明提供了一种独特的系统和方法,其有助于创建可以容易地由人类用户分割和解决的HIP挑战(HIP),但是对于非人类用户来说太难了。 更具体地说,该系统和方法利用了基于分段的各种独特的改变技术。 例如,系统和方法采用较大的弧或闭合不与HIP中已经放置的字符相交。 电弧的厚度可以通过HIP中字符的厚度来测量或确定。 除了增加厚度之外,弧可以延长,因为较长的弧往往类似于一些字符,并且可能难以侵蚀。 可用性图可以被生成并用于选择性地放置杂乱或闭塞,并且选择性地扭曲字符或字符序列以促进人类对字符的识别。
-
公开(公告)号:US20120158488A1
公开(公告)日:2012-06-21
申请号:US12972417
申请日:2010-12-17
CPC分类号: G06Q30/0243
摘要: Counterfactual analysis can be performed “offline”, or “after the fact”, based on data collected during a trial in which random variations are applied to the output of the system whose parameters are to be the subject of the counterfactual analysis. A weighting factor can be derived and applied to data collected during the trial to emphasize that data obtained when the random variations most closely resembled the output that would be expected if counterfactual parameters were utilized to generate the output. If the counterfactual parameters being considered differ too much from the parameters under which the trial was conducted, the offline counterfactual analysis can estimate a direction and magnitude of the change of the system performance, as opposed to deriving a specific expected system performance value. In economic transactions, the random variations can be considered variations in the price paid by another party, thereby enabling derivation of their marginal cost.
摘要翻译: 反事实分析可以基于在试验期间收集的数据“离线”或“事后”进行,其中随机变量应用于其参数作为反事实分析的对象的系统的输出。 可以导出加权因子并将其应用于在试验期间收集的数据,以强调当随机变量最接近地类似于如果使用反事实参数来产生输出时将被预期的输出获得的数据。 如果所考虑的反事实参数与进行试验的参数有太大差异,那么脱机反事实分析可以估计系统性能变化的方向和幅度,而不是推导具体的预期系统性能值。 在经济交易中,随机变化可以被认为是另一方支付的价格变动,从而能够推算其边际成本。
-
-
-
-
-
-
-
-
-