Optical mark classification system and method
    1.
    发明授权
    Optical mark classification system and method 有权
    光标分类系统及方法

    公开(公告)号:US08600165B2

    公开(公告)日:2013-12-03

    申请号:US12704792

    申请日:2010-02-12

    IPC分类号: G06K9/46 G06K9/00 G06K7/10

    摘要: A system, method, and apparatus for mark recognition in an image of an original document are provided. The method/system takes as input an image of an original document in which at least one designated field is provided for accepting a mark applied by a user (which may or may not have been marked). A region of interest (RoI) is extracted from the image, roughly corresponding to the designated field. A center of gravity (CoG) of the RoI is determined, based on a distribution of black pixels in the RoI. Thereafter, for one or more iterations, the RoI is partitioned into sub-RoIs, based on the determined CoG, where at a subsequent iteration, sub-RoIs generated at the prior iteration serve as the RoI partitioned. Data is extracted from the RoI and sub-RoIs at one or more of the iterations, which allows a representation of the entire RoI to be generated which is useful in classifying the designated field, e.g., as positive (marked) or negative (not marked).

    摘要翻译: 提供了一种用于原始文档的图像中的标记识别的系统,方法和装置。 该方法/系统将原始文档的图像作为输入,其中提供至少一个指定字段用于接受用户应用的标记(其可以被标记或可能不被标记)。 从图像中提取感兴趣区域(RoI),大致对应于指定字段。 基于RoI中的黑色像素的分布,确定RoI的重心(CoG)。 此后,对于一次或多次迭代,基于所确定的CoG将RoI划分为子路由,其中​​在随后的迭代中,在先前迭代中生成的子路由用作RoI分区。 在一次或多次迭代中,从RoI和Sub-RoI中提取数据,这允许生成对整个指定字段进行分类的整个RoI的表示,例如作为正(标记)或否定(未标记) )。

    Method for one-step document categorization and separation using stamped machine recognizable patterns
    2.
    发明授权
    Method for one-step document categorization and separation using stamped machine recognizable patterns 有权
    使用印字机识别模式进行一步文档分类和分离的方法

    公开(公告)号:US08453922B2

    公开(公告)日:2013-06-04

    申请号:US12702897

    申请日:2010-02-09

    IPC分类号: G06F17/00 G06K19/06

    CPC分类号: G06F17/30563 G06F17/30011

    摘要: A method for separating and categorizing documents includes receiving a scanned batch of documents. The batch includes scanned documents to which document separator stamps have been applied before scanning. Each stamp includes machine recognizable patterns applied on a same page of a document, spaced by a designated field for receiving a user-applied category code. The scanned batch of documents is processed to identify pages that contain a document separator, including identifying at least one of two spaced patterns. For a document page for which a document separator is identified, the the corresponding designated field is located and the category code associated with the designated field identified. The document containing the is separated from other documents in the batch based the identified separator and a document category is assigned to the document, based on the identified category code.

    摘要翻译: 用于分离和分类文档的方法包括接收扫描的文件批次。 批次包括在扫描之前应用了文档分隔符的扫描文档。 每个印章包括应用在文档的相同页面上的机器可识别图案,间隔有用于接收用户应用的类别代码的指定字段。 处理扫描的文档批次以识别包含文档分隔符的页面,包括识别两个间隔图案中的至少一个。 对于识别文档分隔符的文档页面,定位相应的指定字段,并且识别与指定字段相关联的类别代码。 基于识别的类别代码,将包含该文档的文档与批处理中的其他文档分离,并根据识别的类别代码将文档类别分配给文档。

    METHOD FOR ONE-STEP DOCUMENT CATEGORIZATION AND SEPARATION
    3.
    发明申请
    METHOD FOR ONE-STEP DOCUMENT CATEGORIZATION AND SEPARATION 有权
    一步文件分类和分离方法

    公开(公告)号:US20110192894A1

    公开(公告)日:2011-08-11

    申请号:US12702897

    申请日:2010-02-09

    IPC分类号: G06F17/00 G06K7/00 G09F3/00

    CPC分类号: G06F17/30563 G06F17/30011

    摘要: A method, apparatus, and hardcopy document are provided. The method provides for separating and categorizing documents and includes receiving a scanned batch of documents. The batch includes a plurality of scanned documents to which document separator stamps have been applied before scanning. Each document separator stamp includes first and second machine recognizable patterns applied on a same page of a document, the first and second patterns being spaced by a designated field for receiving a user-applied category code. The scanned batch of documents is processed to identify pages that contain a document separator, the processing including identifying at least one of the first and second spaced patterns. For each of a plurality of document pages for which a document separator is identified, the method includes locating the corresponding designated field and identifying the category code associated with the designated field. The document containing the identified separator is separated from other documents in the batch based on at least the identified separator and a document category is assigned to the document from a set of document categories, based on the identified category code.

    摘要翻译: 提供了一种方法,设备和硬拷贝文档。 该方法用于分离和分类文档,并包括接收扫描的文件批次。 批次包括在扫描之前已经应用了文档分离器标记的多个扫描文档。 每个文档分隔符包括应用于文档的同一页面上的第一和第二机器可识别图案,第一和第二图案间隔有用于接收用户应用类别代码的指定字段。 处理扫描的文档批次以识别包含文档分隔符的页面,该处理包括识别第一和第二间隔图案中的至少一个。 对于识别出文档分离器的多个文档页面中的每一个,该方法包括定位相应的指定字段并且识别与指定字段相关联的类别代码。 基于所识别的类别代码,至少基于所标识的分离器将包含所识别的分离符的文档与批处理中的其他文档分开,并且从文档类别集合将文档类别分配给文档。