-
公开(公告)号:US08600165B2
公开(公告)日:2013-12-03
申请号:US12704792
申请日:2010-02-12
CPC分类号: G06K9/3233 , G06K9/2063 , G06K2209/01
摘要: A system, method, and apparatus for mark recognition in an image of an original document are provided. The method/system takes as input an image of an original document in which at least one designated field is provided for accepting a mark applied by a user (which may or may not have been marked). A region of interest (RoI) is extracted from the image, roughly corresponding to the designated field. A center of gravity (CoG) of the RoI is determined, based on a distribution of black pixels in the RoI. Thereafter, for one or more iterations, the RoI is partitioned into sub-RoIs, based on the determined CoG, where at a subsequent iteration, sub-RoIs generated at the prior iteration serve as the RoI partitioned. Data is extracted from the RoI and sub-RoIs at one or more of the iterations, which allows a representation of the entire RoI to be generated which is useful in classifying the designated field, e.g., as positive (marked) or negative (not marked).
摘要翻译: 提供了一种用于原始文档的图像中的标记识别的系统,方法和装置。 该方法/系统将原始文档的图像作为输入,其中提供至少一个指定字段用于接受用户应用的标记(其可以被标记或可能不被标记)。 从图像中提取感兴趣区域(RoI),大致对应于指定字段。 基于RoI中的黑色像素的分布,确定RoI的重心(CoG)。 此后,对于一次或多次迭代,基于所确定的CoG将RoI划分为子路由,其中在随后的迭代中,在先前迭代中生成的子路由用作RoI分区。 在一次或多次迭代中,从RoI和Sub-RoI中提取数据,这允许生成对整个指定字段进行分类的整个RoI的表示,例如作为正(标记)或否定(未标记) )。
-
公开(公告)号:US20110200256A1
公开(公告)日:2011-08-18
申请号:US12704792
申请日:2010-02-12
IPC分类号: G06K9/46
CPC分类号: G06K9/3233 , G06K9/2063 , G06K2209/01
摘要: A system, method, and apparatus for mark recognition in an image of an original document are provided. The method/system takes as input an image of an original document in which at least one designated field is provided for accepting a mark applied by a user (which may or may not have been marked). A region of interest (RoI) is extracted from the image, roughly corresponding to the designated field. A center of gravity (CoG) of the RoI is determined, based on a distribution of black pixels in the RoI. Thereafter, for one or more iterations, the RoI is partitioned into sub-RoIs, based on the determined CoG, where at a subsequent iteration, sub-RoIs generated at the prior iteration serve as the RoI partitioned. Data is extracted from the RoI and sub-RoIs at one or more of the iterations, which allows a representation of the entire RoI to be generated which is useful in classifying the designated field, e.g., as positive (marked) or negative (not marked).
摘要翻译: 提供了一种用于原始文档的图像中的标记识别的系统,方法和装置。 该方法/系统将原始文档的图像作为输入,其中提供至少一个指定字段用于接受用户应用的标记(其可以被标记或可能不被标记)。 从图像中提取感兴趣区域(RoI),大致对应于指定字段。 基于RoI中的黑色像素的分布,确定RoI的重心(CoG)。 此后,对于一次或多次迭代,基于所确定的CoG将RoI划分为子路由,其中在随后的迭代中,在先前迭代中生成的子路由用作RoI分区。 在一次或多次迭代中,从RoI和Sub-RoI中提取数据,这允许生成对整个指定字段进行分类的整个RoI的表示,例如作为正(标记)或否定(未标记) )。
-