Method for automated image indexing and retrieval
    1.
    发明授权
    Method for automated image indexing and retrieval 有权
    自动图像索引和检索的方法

    公开(公告)号:US07324711B2

    公开(公告)日:2008-01-29

    申请号:US11295405

    申请日:2005-12-05

    IPC分类号: G06K9/54

    摘要: A method of indexing images contained in scanned documents, wherein said scanned documents are stored in a repository, includes: for each document to be stored in the repository, dividing the document into a plurality of sections; scanning the plurality of sections; segmenting each scanned segment according to a predetermined coding model into image segment and non-image segments; associating each of the image segments with the document; and generating an index correlating the image segments with the document. The method may further include, at the time of image recall, displaying the index of image segments in a user interface; and responsive to selection of an image segment from the index, displaying the document information associated with the image segment in the user interface.

    摘要翻译: 一种对包含在扫描文档中的图像进行索引的方法,其中所述扫描的文档存储在存储库中,包括:对于要存储在存储库中的每个文档,将文档分成多个部分; 扫描多个部分; 根据预定的编码模型将每个扫描的片段分割成图像片段和非图像片段; 将每个图像段与文档相关联; 以及生成将图像片段与文档相关联的索引。 该方法还可以包括在图像调用时在用户界面中显示图像片段的索引; 并且响应于从索引中选择图像片段,在用户界面中显示与图像片段相关联的文档信息。

    Method for automated image indexing and retrieval
    2.
    发明申请
    Method for automated image indexing and retrieval 有权
    自动图像索引和检索的方法

    公开(公告)号:US20060072830A1

    公开(公告)日:2006-04-06

    申请号:US11295405

    申请日:2005-12-05

    IPC分类号: G06K9/62

    摘要: A method of indexing images contained in scanned documents, wherein said scanned documents are stored in a repository, includes: for each document to be stored in the repository, dividing the document into a plurality of sections; scanning the plurality of sections; segmenting each scanned segment according to a predetermined coding model into image segment and non-image segments; associating each of the image segments with the document; and generating an index correlating the image segments with the document. The method may further include, at the time of image recall, displaying the index of image segments in a user interface; and responsive to selection of an image segment from the index, displaying the document information associated with the image segment in the user interface.

    摘要翻译: 一种对包含在扫描文档中的图像进行索引的方法,其中所述扫描的文档存储在存储库中,包括:对于要存储在存储库中的每个文档,将文档分成多个部分; 扫描多个部分; 根据预定的编码模型将每个扫描的片段分割成图像片段和非图像片段; 将每个图像段与文档相关联; 以及生成将图像片段与文档相关联的索引。 该方法还可以包括在图像调用时在用户界面中显示图像片段的索引; 并且响应于从索引中选择图像片段,在用户界面中显示与图像片段相关联的文档信息。

    METHOD FOR AUTOMATED IMAGE INDEXING AND RETRIEVAL
    3.
    发明申请
    METHOD FOR AUTOMATED IMAGE INDEXING AND RETRIEVAL 失效
    自动图像索引和检索方法

    公开(公告)号:US20080055669A1

    公开(公告)日:2008-03-06

    申请号:US11925134

    申请日:2007-10-26

    IPC分类号: H04N1/40

    摘要: A method of indexing images contained in scanned documents, wherein said scanned documents are stored in a repository, includes: for each document to be stored in the repository, dividing the document into a plurality of sections; scanning the plurality of sections; segmenting each scanned segment according to a predetermined coding model into image segment and non-image segments; associating each of the image segments with the document; and generating an index correlating the image segments with the document. The method may further include, at the time of image recall, displaying the index of image segments in a user interface; and responsive to selection of an image segment from the index, displaying the document information associated with the image segment in the user interface.

    摘要翻译: 一种对包含在扫描文档中的图像进行索引的方法,其中所述扫描的文档存储在存储库中,包括:对于要存储在存储库中的每个文档,将文档分成多个部分; 扫描多个部分; 根据预定的编码模型将每个扫描的片段分割成图像片段和非图像片段; 将每个图像段与文档相关联; 以及生成将图像片段与文档相关联的索引。 该方法还可以包括在图像调用时在用户界面中显示图像片段的索引; 并且响应于从索引中选择图像片段,在用户界面中显示与图像片段相关联的文档信息。

    Method for automated image indexing and retrieval
    4.
    发明授权
    Method for automated image indexing and retrieval 失效
    自动图像索引和检索的方法

    公开(公告)号:US07813595B2

    公开(公告)日:2010-10-12

    申请号:US11925134

    申请日:2007-10-26

    IPC分类号: G06K9/54 H04N1/40

    摘要: A method of indexing images contained in scanned documents, wherein said scanned documents are stored in a repository, includes: for each document to be stored in the repository, dividing the document into a plurality of sections; scanning the plurality of sections; segmenting each scanned segment according to a predetermined coding model into image segment and non-image segments; associating each of the image segments with the document; and generating an index correlating the image segments with the document. The method may further include, at the time of image recall, displaying the index of image segments in a user interface; and responsive to selection of an image segment from the index, displaying the document information associated with the image segment in the user interface.

    摘要翻译: 一种对包含在扫描文档中的图像进行索引的方法,其中所述扫描的文档存储在存储库中,包括:对于要存储在存储库中的每个文档,将文档分成多个部分; 扫描多个部分; 根据预定的编码模型将每个扫描的片段分割成图像片段和非图像片段; 将每个图像段与文档相关联; 以及生成将图像片段与文档相关联的索引。 该方法还可以包括在图像调用时在用户界面中显示图像片段的索引; 并且响应于从索引中选择图像片段,在用户界面中显示与图像片段相关联的文档信息。

    Resizing a digital document image via background content removal
    5.
    发明授权
    Resizing a digital document image via background content removal 失效
    通过背景内容删除调整数字文档图像的大小

    公开(公告)号:US08274533B2

    公开(公告)日:2012-09-25

    申请号:US12369790

    申请日:2009-02-12

    IPC分类号: G09G5/02

    CPC分类号: G06T3/0012

    摘要: What is disclosed is a system and method for performing a background deletion that exploits both local and global context to remove background and other white space between objects with the aim of retaining structural relationships between objects in the document. A document image is received and seams are carved through the image. Seams composed of uniform background pixels are identified. Adjacent seams containing background pixels are collected into groups of seams. The background seam groups are classified according to their widths. A target number of seams to be removed for each background seam group is then determined based on the classification. Seam groups which are wider will have at least the same or a greater target number of seams to be deleted therefrom than will seam groups of narrower widths. The document image is then resized by deleting seams from the seam groups based on the assigned target number.

    摘要翻译: 公开的是用于执行背景删除的系统和方法,其利用本地和全局上下文来移除对象之间的背景和其他空白空间,目的是保留文档中的对象之间的结构关系。 收到文件图像,并通过图像刻成接缝。 识别由均匀背景像素构成的接缝。 包含背景像素的相邻接缝被收集成一组接缝。 背景缝组根据其宽度进行分类。 然后基于分类确定要为每个背景接缝组去除的目标接缝数目。 与较窄宽度的接缝组相比,更宽的接缝组将具有至少相同或更大的目标数量的接缝。 然后通过基于分配的目标号码从接缝组中删除接缝来调整文档图像的大小。

    Document type classification for scanned bitmaps
    6.
    发明授权
    Document type classification for scanned bitmaps 失效
    扫描位图的文档类型分类

    公开(公告)号:US08462394B2

    公开(公告)日:2013-06-11

    申请号:US12185904

    申请日:2008-08-05

    CPC分类号: H04N1/40062

    摘要: Systems and methods are described that facilitate determining an original document format for a scanned document by analyzing a bitmap thereof. Text objects are extracted from the document, binarized, and segmented to identify text. Page orientation and text size are used to distinguish between a slideshow-type document, and a word processing or spreadsheet-type document. To further distinguish between the word processing and spreadsheet types, text column structure and count is analyzed.

    摘要翻译: 描述了通过分析其位图来帮助确定扫描文档的原始文档格式的系统和方法。 从文档中提取文本对象,进行二值化和分段以识别文本。 页面方向和文字大小用于区分幻灯片式文档和文字处理或电子表格类型的文档。 为了进一步区分文字处理和电子表格类型,分析文本列结构和计数。

    Object based adaptive document resizing
    7.
    发明授权
    Object based adaptive document resizing 失效
    基于对象的自适应文档调整大小

    公开(公告)号:US08423900B2

    公开(公告)日:2013-04-16

    申请号:US12544561

    申请日:2009-08-20

    IPC分类号: G06F3/00 G06F3/14

    摘要: What is disclosed is a resizing method that utilizes segmentation information to classify objects found within a document and then selects the most appropriate resizing technique for each identified object. The present method employs readily available document parsers to reliably extract objects. e.g. text, background, images, graphics, etc., which compose the document. Information obtained from a document parser is utilized to identify the document components for classification. The extracted objects are then classified according to their object type. Each of classified objects are then resized using a resizing technique having been pre-selected for the object type based on their respective abilities to resize certain types of document content over other resizing techniques. The present method advantageously extends smart or content-based scaling and is especially useful for N-up or variable-information printing. The present method finds its intended uses in enhancing N-up and handout options currently provided in a variety of print-drivers.

    摘要翻译: 所公开的是一种调整大小的方法,其利用分段信息对在文档中找到的对象进行分类,然后为每个识别的对象选择最合适的调整大小的技术。 本方法使用容易获得的文档解析器来可靠地提取对象。 例如 文本,背景,图像,图形等等。 从文档解析器获得的信息用于识别用于分类的文档组件。 然后将提取的对象根据其对象类型进行分类。 然后,使用已经针对对象类型预先选择的调整大小的技术来调整每个分类对象的大小,这些大小基于它们各自的能力,以便通过其他大小调整技术调整某些类型的文档内容的大小。 本方法有利地扩展智能或基于内容的缩放,并且对于N上或可变信息打印特别有用。 本方法用于增强目前在各种打印驱动程序中提供的N-up和Handout选项。

    OBJECT BASED ADAPTIVE DOCUMENT RESIZING
    8.
    发明申请
    OBJECT BASED ADAPTIVE DOCUMENT RESIZING 失效
    基于对象的自适应文档

    公开(公告)号:US20110047505A1

    公开(公告)日:2011-02-24

    申请号:US12544561

    申请日:2009-08-20

    IPC分类号: G06F3/048

    摘要: What is disclosed is a resizing method that utilizes segmentation information to classify objects found within a document and then selects the most appropriate resizing technique for each identified object. The present method employs readily available document parsers to reliably extract objects. e.g. text, background, images, graphics, etc., which compose the document. Information obtained from a document parser is utilized to identify the document components for classification. The extracted objects are then classified according to their object type. Each of classified objects are then resized using a resizing technique having been pre-selected for the object type based on their respective abilities to resize certain types of document content over other resizing techniques. The present method advantageously extends smart or content-based scaling and is especially useful for N-up or variable-information printing. The present method finds its intended uses in enhancing N-up and handout options currently provided in a variety of print-drivers.

    摘要翻译: 所公开的是一种调整大小的方法,其利用分段信息对在文档中找到的对象进行分类,然后为每个识别的对象选择最合适的调整大小的技术。 本方法使用容易获得的文档解析器来可靠地提取对象。 例如 文本,背景,图像,图形等等。 从文档解析器获得的信息用于识别用于分类的文档组件。 然后将提取的对象根据其对象类型进行分类。 然后,使用已经针对对象类型预先选择的调整大小的技术来调整每个分类对象的大小,这些大小基于它们各自的能力,以便通过其他大小调整技术调整某些类型的文档内容的大小。 本方法有利地扩展智能或基于内容的缩放,并且对于N上或可变信息打印特别有用。 本方法用于增强目前在各种打印驱动程序中提供的N-up和Handout选项。

    RESIZING A DIGITAL DOCUMENT IMAGE VIA BACKGROUND CONTENT REMOVAL
    9.
    发明申请
    RESIZING A DIGITAL DOCUMENT IMAGE VIA BACKGROUND CONTENT REMOVAL 失效
    通过背景内容删除来修复数字文档图像

    公开(公告)号:US20100201711A1

    公开(公告)日:2010-08-12

    申请号:US12369790

    申请日:2009-02-12

    IPC分类号: G09G5/00

    CPC分类号: G06T3/0012

    摘要: What is disclosed is a system and method for performing a background deletion that exploits both local and global context to remove background and other white space between objects with the aim of retaining structural relationships between objects in the document. A document image is received and seams are carved through the image. Seams composed of uniform background pixels are identified. Adjacent seams containing background pixels are collected into groups of seams. The background seam groups are classified according to their widths. A target number of seams to be removed for each background seam group is then determined based on the classification. Seam groups which are wider will have at least the same or a greater target number of seams to be deleted therefrom than will seam groups of narrower widths. The document image is then resized by deleting seams from the seam groups based on the assigned target number.

    摘要翻译: 公开的是用于执行背景删除的系统和方法,其利用本地和全局上下文来移除对象之间的背景和其他空白空间,目的是保留文档中的对象之间的结构关系。 收到文件图像,并通过图像刻成接缝。 识别由均匀背景像素构成的接缝。 包含背景像素的相邻接缝被收集成一组接缝。 背景缝组根据其宽度进行分类。 然后基于分类确定要为每个背景接缝组去除的目标接缝数目。 与较窄宽度的接缝组相比,更宽的接缝组将具有至少相同或更大的目标数量的接缝。 然后通过基于分配的目标号码从接缝组中删除接缝来调整文档图像的大小。

    System for recording image data from a set of sheets having similar graphic elements
    10.
    发明申请
    System for recording image data from a set of sheets having similar graphic elements 有权
    用于从具有相似图形元素的一组纸张记录图像数据的系统

    公开(公告)号:US20050190981A1

    公开(公告)日:2005-09-01

    申请号:US10788944

    申请日:2004-02-26

    IPC分类号: G06K9/20 G06K9/36 H04N7/26

    CPC分类号: H04N19/90 G06K9/00449

    摘要: In an input scanning system, as would be present in a digital copier, a “template” of similar visual elements or objects, such as logos and other designs, is detected among a series of scanned images. The common objects form a reference image against which subsequently-recorded input images are compared. If bounding boxes around objects in the input images match those in the reference image, the objects in the bounding boxes are attempted to be matched to those in the reference image. If objects in the input image and reference image match, then the image data from the input image is coded using a pointer to the corresponding object in the reference image.

    摘要翻译: 在输入扫描系统​​中,如在数字复印机中所存在的那样,在一系列扫描图像中检测到类似的视觉元素或物体(诸如徽标和其他设计)的“模板”。 公共对象形成参考图像,随后记录的输入图像被比较。 如果输入图像中的对象周围的边框与参考图像中的边框相匹配,则尝试将边界框中的对象与参考图像中的对象进行匹配。 如果输入图像和参考图像中的对象匹配,则使用指向参考图像中的相应对象的指针对来自输入图像的图像数据进行编码。