Punched hole detection and removal
    11.
    发明授权
    Punched hole detection and removal 失效
    打孔检测和去除

    公开(公告)号:US08494304B2

    公开(公告)日:2013-07-23

    申请号:US11801958

    申请日:2007-05-11

    IPC分类号: G06K9/00 G06K9/40

    摘要: A method for removal of punched hole artifacts in digital images includes, for a scanned document page, deriving an original digital image that defines the page in terms of a plurality of input pixels. A reduced resolution bitonal image is generated from the original image. The method further includes providing for identifying of candidate punched hole artifacts in the reduced resolution bitonal image and providing for testing the candidate punched hole artifacts for at least one of shape, size, and location. Where a candidate punched hole artifact meets the at least one test, the method includes generating a modified image. This includes erasing the candidate punched hole artifact from the original digital image.

    摘要翻译: 用于去除数字图像中的穿孔伪影的方法包括对于扫描的文档页,导出根据多个输入像素定义页面的原始数字图像。 从原始图像生成缩小分辨率的双色图像。 该方法还包括提供在缩小分辨率双色图像中识别候选冲孔缺陷伪像,并提供用于测试形状,尺寸和位置中的至少一个的候选穿孔伪影。 在候选者穿孔孔伪影符合至少一个测试的情况下,该方法包括生成修改的图像。 这包括从原始数字图像中删除候选的穿孔伪影。

    System and method for identifying and labeling fields of text associated with scanned business documents
    12.
    发明授权
    System and method for identifying and labeling fields of text associated with scanned business documents 有权
    用于识别和标记与扫描的业务文档相关的文本字段的系统和方法

    公开(公告)号:US07965891B2

    公开(公告)日:2011-06-21

    申请号:US12710573

    申请日:2010-02-23

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00469

    摘要: A system for electronically distilling information from a business document uses a network scanner to electronically scan a platen area, having a business document thereon, to create a bitmap. A network server carries out a segmentation process to segment the scan generated bitmap into a bitmap object, the bitmap object corresponding to the scanned business document; a bitmap to text conversion process to convert the bitmap object into a block of text; a semantic recognition process to generate a structured representation of semantic entities corresponding to the scanned business document; and a document generation process to convert the structured representation into a structure text file. The semantic recognition process includes the processes of generating, for each line of text having a keyword therein, a terminal symbol corresponding to the keyword therein; generating, for each line of text not having a keyword therein and absent of numeric characters, an alphabetic terminal symbol; generating, for each line of text not having a keyword therein and having a numeric character therein, an alphanumeric terminal symbol; generating a string of terminal symbols from the generated terminal symbols; determining a probable parsing of the generated string of terminal symbols; labeling each text line, according to a determined function, with non-terminal symbols; and parsing the business document information text into fields of business document information text based upon the non-terminal symbol of each text line and the determined probable parsing of the generated string of terminal symbols.

    摘要翻译: 用于从商业文档电子地蒸馏信息的系统使用网络扫描器来电子扫描其上具有业务文档的压板区域以创建位图。 网络服务器执行分割过程,将扫描生成的位图分割成位图对象,对应于扫描的业务文档的位图对象; 将位图对象转换为文本块的文本转换过程的位图; 语义识别过程,用于生成对应于扫描的业务单据的语义实体的结构化表示; 以及将结构化表示转换成结构文本文件的文档生成处理。 语义识别处理包括对于其中具有关键词的每行文本生成与其中的关键词对应的终端符号的处理; 生成对于其中没有关键字的每行文本和不存在数字字符的字母的终端符号; 为每个不具有关键字的文本行和其中具有数字字符的每行文本生成字母数字终端符号; 从所生成的终端符号生成一串终端符号; 确定所生成的终端符号串的可能解析; 根据确定的功能标记每个文本行,具有非终端符号; 以及基于每个文本行的非终端符号以及确定的所生成的终端符号串的可能解析,将业务文档信息文本解析为商业文档信息文本的字段。

    SYSTEM AND METHOD FOR IDENTIFYING AND LABELING FIELDS OF TEXT ASSOCIATED WITH SCANNED BUSINESS DOCUMENTS
    13.
    发明申请
    SYSTEM AND METHOD FOR IDENTIFYING AND LABELING FIELDS OF TEXT ASSOCIATED WITH SCANNED BUSINESS DOCUMENTS 有权
    用于识别和标记与扫描业务文档相关联的文本字段的系统和方法

    公开(公告)号:US20100149606A1

    公开(公告)日:2010-06-17

    申请号:US12710573

    申请日:2010-02-23

    IPC分类号: G06F17/00 G06K9/34 H04N1/04

    CPC分类号: G06K9/00469

    摘要: A system for electronically distilling information from a business document uses a network scanner to electronically scan a platen area, having a business document thereon, to create a bitmap. A network server carries out a segmentation process to segment the scan generated bitmap into a bitmap object, the bitmap object corresponding to the scanned business document; a bitmap to text conversion process to convert the bitmap object into a block of text; a semantic recognition process to generate a structured representation of semantic entities corresponding to the scanned business document; and a document generation process to convert the structured representation into a structure text file. The semantic recognition process includes the processes of generating, for each line of text having a keyword therein, a terminal symbol corresponding to the keyword therein; generating, for each line of text not having a keyword therein and absent of numeric characters, an alphabetic terminal symbol; generating, for each line of text not having a keyword therein and having a numeric character therein, an alphanumeric terminal symbol; generating a string of terminal symbols from the generated terminal symbols; determining a probable parsing of the generated string of terminal symbols; labeling each text line, according to a determined function, with non-terminal symbols; and parsing the business document information text into fields of business document information text based upon the non-terminal symbol of each text line and the determined probable parsing of the generated string of terminal symbols.

    摘要翻译: 用于从商业文档电子地蒸馏信息的系统使用网络扫描器来电子扫描其上具有业务文档的压板区域以创建位图。 网络服务器执行分割过程,将扫描生成的位图分割成位图对象,对应于扫描的业务文档的位图对象; 将位图对象转换为文本块的文本转换过程的位图; 语义识别过程,用于生成对应于扫描的业务单据的语义实体的结构化表示; 以及将结构化表示转换成结构文本文件的文档生成处理。 语义识别处理包括对于其中具有关键词的每行文本生成与其中的关键词对应的终端符号的处理; 生成对于其中没有关键字的每行文本和不存在数字字符的字母的终端符号; 为每个不具有关键字的文本行和其中具有数字字符的每行文本生成字母数字终端符号; 从所生成的终端符号生成一串终端符号; 确定所生成的终端符号串的可能解析; 根据确定的功能标记每个文本行,具有非终端符号; 以及基于每个文本行的非终端符号以及确定的所生成的终端符号串的可能解析,将业务文档信息文本解析为商业文档信息文本的字段。

    AUTOMATIC EDUCATIONAL ASSESSMENT SERVICE
    14.
    发明申请
    AUTOMATIC EDUCATIONAL ASSESSMENT SERVICE 审中-公开
    自动教育评估服务

    公开(公告)号:US20100075291A1

    公开(公告)日:2010-03-25

    申请号:US12339771

    申请日:2008-12-19

    IPC分类号: G09B7/00

    CPC分类号: G09B7/00 G09B7/06

    摘要: A method and system for automatically helping a teacher/educator evaluate assessments administered to students for determining student's attributes. The teacher/educator reviews stored assessment forms at a digital user interface (DUI) at a multifunction device (MFD) and selects the desired forms and creates an Assessment Batch which includes a List of Students to be given the forms for marking. The system automatically codes each form with personalized student information and prints the individualized assessment forms. The system may include an assessment repository for storing assessment definitions, rubrics, and administered assessments (e.g., results sheets), an may additionally or alternatively include an assessment analyzer for interpreting scanned imaged of administered assessments. The teacher/educator administers the assessment and assessment forms are manually marked, collected and scanned at the MFD, entered into storage and the marked images automatically analyzed and the assessments automatically evaluated from stored rubrics and the teacher/educator is automatically notified by email that the evaluation has been performed. The system enables the teacher/educator to review the evaluations remotely and validate/annotate the evaluation and update the records in storage. The Assessment Batch may be created for a list of students in a group, a class, a grade level, a school, a plurality of schools and students in a geographical area.

    摘要翻译: 自动帮助教师/教育工作者评估对学生进行评估以确定学生属性的方法和系统。 教师/教育工作者在多功能设备(MFD)的数字用户界面(DUI)上审查存储的评估表,并选择所需的表格,并创建一个评估批,其中包括要给予标记表单的学生名单。 系统会自动为每个表单编写个性化的学生信息,并打印个性化的评估表单。 系统可以包括用于存储评估定义,标题和管理评估(例如,结果表)的评估储存库,另外还可以包括评估分析器,用于解释所管理的评估的扫描图像。 教师/教育工作者管理评估和评估表格,在MFD手动标记,收集和扫描,进入存储,自动分析标记图像,并从存储的标题自动评估评估,教师/教育者自动通过电子邮件通知 已经进行了评估。 该系统使教师/教育者能够远程审查评估,并对评估进行验证/注释,并更新存储中的记录。 评估批次可以创建一个地理区域中的小组,班级,年级,学校,多所学校和学生的学生名单。

    Punched hole detection and removal
    15.
    发明申请
    Punched hole detection and removal 失效
    打孔检测和去除

    公开(公告)号:US20080279474A1

    公开(公告)日:2008-11-13

    申请号:US11801958

    申请日:2007-05-11

    IPC分类号: G06K9/40

    摘要: A method for removal of punched hole artifacts in digital images includes, for a scanned document page, deriving an original digital image that defines the page in terms of a plurality of input pixels. A reduced resolution bitonal image is generated from the original image. The method further includes providing for identifying of candidate punched hole artifacts in the reduced resolution bitonal image and providing for testing the candidate punched hole artifacts for at least one of shape, size, and location. Where a candidate punched hole artifact meets the at least one test, the method includes generating a modified image. This includes erasing the candidate punched hole artifact from the original digital image.

    摘要翻译: 用于去除数字图像中的穿孔伪影的方法包括对于扫描的文档页,导出根据多个输入像素定义页面的原始数字图像。 从原始图像生成缩小分辨率的双色图像。 该方法还包括提供在缩小分辨率双色图像中识别候选冲孔缺陷伪像,并提供用于测试形状,尺寸和位置中的至少一个的候选穿孔伪影。 在候选者穿孔孔伪影符合至少一个测试的情况下,该方法包括生成修改的图像。 这包括从原始数字图像中删除候选的穿孔伪影。

    METHOD AND SYSTEM FOR EVALUATING ELECTRONIC DOCUMENT
    16.
    发明申请
    METHOD AND SYSTEM FOR EVALUATING ELECTRONIC DOCUMENT 审中-公开
    用于评估电子文件的方法和系统

    公开(公告)号:US20140093858A1

    公开(公告)日:2014-04-03

    申请号:US13632363

    申请日:2012-10-01

    IPC分类号: G09B7/00

    CPC分类号: G09B7/02

    摘要: The disclosed embodiment relates to methods and systems for evaluating an electronic document. The computer implemented method includes receiving the electronic document containing a first set of answers corresponding to one or more pre-stored questions. The first set of answers are compared with a pre-stored second set of answers based on an answer descriptor syntax dataset. The answer descriptor syntax dataset comprises one or more rules. One or more answer descriptors for each of the first set of answers are determined based on the comparing. The one or more answer descriptors correspond to one or more observations for each of the first set of answers. Finally, the electronic document is evaluated based on determining.

    摘要翻译: 所公开的实施例涉及用于评估电子文档的方法和系统。 计算机实现的方法包括接收包含与一个或多个预先存储的问题相对应的第一组答案的电子文档。 将第一组答案与基于答案描述符语法数据集的预先存储的第二组答案进行比较。 答案描述符语法数据集包括一个或多个规则。 基于比较确定第一组答案中的每一个的一个或多个答案描述符。 一个或多个答案描述符对应于第一组答案中的每一个的一个或多个观察值。 最后,电子文档是根据确定进行评估的。

    METHOD OF PROCESSING AN IMAGE TO CLARIFY TEXT IN THE IMAGE
    17.
    发明申请
    METHOD OF PROCESSING AN IMAGE TO CLARIFY TEXT IN THE IMAGE 有权
    处理图像以在图像中清除文本的方法

    公开(公告)号:US20120188612A1

    公开(公告)日:2012-07-26

    申请号:US13013890

    申请日:2011-01-26

    IPC分类号: H04N1/40

    摘要: An image file representing at least a portion of a printed document is processed to highlight the differences between foreground material (e.g., text or other characters) from background. The method includes selecting a neighborhood of pixels, determining a weighted average of an attribute values (e.g., luminance) for each pixel, and modifying each pixel's value based on the weighted average. Graylevel scaling, error diffusion, and a bit level conversion are also performed each pixel ends up with either a first attribute value level (e.g., luminance of 0) or a second attribute value level (e.g., luminance of 255).

    摘要翻译: 处理表示打印文档的至少一部分的图像文件以突出显示来自背景的前景材料(例如,文本或其他字符)之间的差异。 该方法包括选择像素的邻域,确定每个像素的属性值(例如,亮度)的加权平均值,以及基于加权平均值修改每个像素的值。 还执行灰度缩放,误差扩散和位电平转换,每个像素以第一属性值级别(例如,亮度为0)或第二属性值级别(例如,亮度为255)结束。

    SYSTEM AND METHOD FOR REPRESENTING DIGITAL ASSESSMENTS
    18.
    发明申请
    SYSTEM AND METHOD FOR REPRESENTING DIGITAL ASSESSMENTS 有权
    用于表示数字评估的系统和方法

    公开(公告)号:US20110151423A1

    公开(公告)日:2011-06-23

    申请号:US12640426

    申请日:2009-12-17

    申请人: Dennis L. Venable

    发明人: Dennis L. Venable

    IPC分类号: G09B7/00

    CPC分类号: G09B7/00

    摘要: A method and system for processing a digital assessment template are provided. The system includes at least one tangible processor and a memory with instructions to be executed by the at least one tangible processor for processing a digital assessment template. The template which includes a description of a plurality of data structures that are configured for interpreting an assessment associated with the template. The assessment was marked with strokes by an assessment-taker who was administered the assessment and responded to at least one problem provided by the assessment. The template describes a location of the marked assessment in which to find each stroke that corresponds to a response by the assessment-taker and how to interpret the strokes. Each of the locations and how to interpret the strokes are selectable.

    摘要翻译: 提供了一种处理数字评估模板的方法和系统。 该系统包括至少一个有形处理器和具有由至少一个有形处理器执行的用于处理数字评估模板的指令的存储器。 该模板包括被配置用于解释与模板相关联的评估的多个数据结构的描述。 该评估由评估员标记,并由评估员进行了评估,并对评估提供的至少一个问题进行了回应。 该模板描述了标记评估的位置,其中查找与评估者的回答相对应的每个笔画以及如何解释笔画。 每个位置和如何解释笔画是可选择的。

    SYSTEM AND METHOD FOR IDENTIFYING AND LABELING FIELDS OF TEXT ASSOCIATED WITH SCANNED BUSINESS DOCUMENTS

    公开(公告)号:US20100150397A1

    公开(公告)日:2010-06-17

    申请号:US12710568

    申请日:2010-02-23

    IPC分类号: G06K9/00 G06K9/34

    CPC分类号: G06K9/00469

    摘要: A system for electronically distilling information from a business document uses a network scanner to electronically scan a platen area, having a business document thereon, to create a bitmap. A network server carries out a segmentation process to segment the scan generated bitmap into a bitmap object, the bitmap object corresponding to the scanned business document; a bitmap to text conversion process to convert the bitmap object into a block of text; a semantic recognition process to generate a structured representation of semantic entities corresponding to the scanned business document; and a document generation process to convert the structured representation into a structure text file. The semantic recognition process includes the processes of generating, for each line of text having a keyword therein, a terminal symbol corresponding to the keyword therein; generating, for each line of text not having a keyword therein and absent of numeric characters, an alphabetic terminal symbol; generating, for each line of text not having a keyword therein and having a numeric character therein, an alphanumeric terminal symbol; generating a string of terminal symbols from the generated terminal symbols; determining a probable parsing of the generated string of terminal symbols; labeling each text line, according to a determined function, with non-terminal symbols; and parsing the business document information text into fields of business document information text based upon the non-terminal symbol of each text line and the determined probable parsing of the generated string of terminal symbols.

    Event driven plugin architecture for importing scanned image data into a production workflow
    20.
    发明申请
    Event driven plugin architecture for importing scanned image data into a production workflow 有权
    事件驱动的插件架构,用于将扫描的图像数据导入到生产工作流程中

    公开(公告)号:US20090006610A1

    公开(公告)日:2009-01-01

    申请号:US11824065

    申请日:2007-06-29

    申请人: Dennis L. Venable

    发明人: Dennis L. Venable

    IPC分类号: G06F15/173

    摘要: Systems and methods are described that facilitate importing scanned image data into a production workflow, in accordance with various features described herein. A plurality of loosely-coupled, dynamically loaded plugins can be defined in a configuration file for a given production scanning job. The plugins can be invoked in response to a trigger with which each plugin is associated, and triggers can be associated with different phases of the production workflow, such as image data acquisition (importation), data filtering (pre-scanning), image analysis (scanning), and metadata processing (post-scanning). In this manner, the overarching scanning architecture need not have direct knowledge of which plugins are triggered, or even present, and custom plugins as well as standard plugins can be provided for each production scanning job.

    摘要翻译: 描述了根据本文描述的各种特征的系统和方法,其便于将扫描的图像数据导入到生产工作流程中。 可以在用于给定生产扫描作业的配置文件中定义多个松散耦合的动态加载的插件。 可以响应于每个插件关联的触发器来调用插件,并且触发器可以与生产工作流程的不同阶段相关联,例如图像数据采集(导入),数据过滤(预扫描),图像分析( 扫描)和元数据处理(后扫描)。 以这种方式,总体扫描架构不需要直接了解哪些插件被触发甚至存在,并且可以为每个生产扫描作业提供定制插件以及标准插件。