System and method facilitating document image compression utilizing a mask
    11.
    发明授权
    System and method facilitating document image compression utilizing a mask 有权
    利用掩模促进文档图像压缩的系统和方法

    公开(公告)号:US07764834B2

    公开(公告)日:2010-07-27

    申请号:US11465083

    申请日:2006-08-16

    IPC分类号: G06K9/00

    CPC分类号: G06K9/38 G06K2209/01

    摘要: A system and method facilitating document image compression utilizing a mask separating a foreground of a document image from a background is provided. The invention includes a pixel energy analyzer adapted to partition regions into a foreground and background. The invention further provides for a merge region component adapted to attempt to merge regions if the merged region would not exceed a threshold energy. Merged regions are partitioned into a new foreground and new background. Thereafter, a mask storage component stores the partitioning information in a binary mask.

    摘要翻译: 提供了利用从背景分离文档图像的前景的掩模来促进文档图像压缩的系统和方法。 本发明包括适于将区域分割成前景和背景的像素能量分析器。 本发明还提供了一种合并区域组件,其适于在合并区域不超过阈值能量时试图合并区域。 合并的区域被划分为新的前景和新的背景。 此后,掩模存储部件将分割信息存储在二进制掩码中。

    Clustering
    12.
    发明授权
    Clustering 有权
    聚类

    公开(公告)号:US07376275B2

    公开(公告)日:2008-05-20

    申请号:US11198562

    申请日:2005-08-05

    IPC分类号: G06K9/68

    摘要: Systems and methods for performing clustering of a document image are disclosed. A property of an extracted mark from a document is compared to the properties of the existing clusters. If the property of the mark fails to match any of the properties of the existing clusters, the mark is added as a new cluster to the existing cluster. One property that can be utilized is x size and y size, which is the width and height, of the existing clusters. Another property that can be employed is ink size, which refers to the ratio of black pixels to total pixels in a cluster. Yet another property that can be utilized is a reduced mark or image, which is a pixel size reduced version the bitmap of the mark and/or cluster. The above properties can be employed to identify mismatches and reduce the number of bit by bit comparisons performed.

    摘要翻译: 公开了用于执行文档图像的聚类的系统和方法。 将来自文档的提取标记的属性与现有集群的属性进行比较。 如果标记的属性无法匹配现有集群的任何属性,则该标记作为新集群添加到现有集群。 可以使用的一个属性是x size和y size,这是现有集群的宽度和高度。 可以使用的另一个属性是墨水大小,其指的是群集中黑色像素与总像素的比例。 可以使用的另一个属性是缩小的标记或图像,其是像素尺寸缩小版本的标记和/或集群的位图。 可以采用上述特性来识别不匹配并减少进行的逐比较比较。

    Compression of bi-level images with explicit representation of ink clusters
    13.
    发明授权
    Compression of bi-level images with explicit representation of ink clusters 有权
    用油墨簇的显式表示压缩双层图像

    公开(公告)号:US07317838B2

    公开(公告)日:2008-01-08

    申请号:US11734299

    申请日:2007-04-12

    IPC分类号: G06K9/36 G06K9/46

    摘要: A system and method facilitating compression of bi-level images with explicit representation of ink clusters is provided. The present invention includes a cluster shape estimator that analyzes connected component information, extracts clusters and stores the cluster in a global dictionary, a page dictionary or a store of unclustered shapes. A bitmap estimation from clusters component determines dictionary positions for clusters stored in the global dictionary which are then encoded. A cluster position estimator determines page positions of clusters of the global dictionary and/or the page dictionary that are then encoded. Further, the global dictionary, the page dictionary and the store of unclustered shapes are also encoded.

    摘要翻译: 提供了一种利用墨簇的显式表示促进双层图像压缩的系统和方法。 本发明包括分析连接的分量信息的群集形状估计器,提取群集并且将群集存储在全局词典,页面字典或非群集形状的存储中。 来自簇组件的位图估计确定存储在全局字典中的簇的字典位置,然后对其进行编码。 集群位置估计器确定然后被编码的全局字典和/或页字典的集群的页面位置。 此外,还编码了全局字典,页字典和未分簇形状的存储。

    Clustering
    14.
    发明授权
    Clustering 有权
    聚类

    公开(公告)号:US07164797B2

    公开(公告)日:2007-01-16

    申请号:US10133558

    申请日:2002-04-25

    IPC分类号: G06K9/68

    摘要: Systems and methods for performing clustering of a document image are disclosed. A property of an extracted mark from a document is compared to the properties of the existing clusters. If the property of the mark fails to match any of the properties of the existing clusters, the mark is added as a new cluster to the existing cluster. One property that can be utilized is x size and y size, which is the width and height, of the existing clusters. Another property that can be employed is ink size, which refers to the ratio of black pixels to total pixels in a cluster. Yet another property that can be utilized is a reduced mark or image, which is a pixel size reduced version the bitmap of the mark and/or cluster. The above properties can be employed to identify mismatches and reduce the number of bit by bit comparisons performed.

    摘要翻译: 公开了用于执行文档图像的聚类的系统和方法。 将来自文档的提取标记的属性与现有集群的属性进行比较。 如果标记的属性无法匹配现有集群的任何属性,则该标记作为新集群添加到现有集群。 可以使用的一个属性是x size和y size,这是现有集群的宽度和高度。 可以使用的另一个属性是墨水大小,其指的是群集中黑色像素与总像素的比例。 可以使用的另一个属性是缩小的标记或图像,其是像素尺寸缩小版本的标记和/或集群的位图。 可以采用上述特性来识别不匹配并减少进行的逐比较比较。