Method and apparatus for producing a hybrid data structure for
displaying a raster image

    公开(公告)号:US5729637A

    公开(公告)日:1998-03-17

    申请号:US420827

    申请日:1995-04-10

    摘要: A system for producing a raster image derived from a data structure including a data processing apparatus, a recognizer which performs recognition on an input bitmap to the data processing apparatus to detect identifiable objects within the input bitmap, a mechanism for producing a hybrid data structure including coded data corresponding to the identifiable objects and to non-identifiable objects and the input bitmap, and an output device capable of developing a visually perceptible raster image derived from the input bitmap in the hybrid data structure. The raster image is derived from the input bitmap and thus includes no misrecognition errors. It includes a method for producing a hybrid data structure for a bitmap of an image having the steps of inputting a bitmap into a digital processing apparatus, partitioning the bitmap into a hierarchy of lexical units, assigning labels to a label list for each lexical unit of a predetermined hierarchical level, where labels in the label list have an associated confidence level, and storing each lexical unit in a hybrid data structure as either an identifiable object or a non-identifiable object. The entire input bitmap or portions thereof are also stored in the hybrid data structure to be displayed.

    Method and apparatus for removing noise from a digital image
    2.
    发明授权
    Method and apparatus for removing noise from a digital image 有权
    从数字图像中去除噪声的方法和装置

    公开(公告)号:US07660483B2

    公开(公告)日:2010-02-09

    申请号:US11291552

    申请日:2005-11-30

    IPC分类号: G06K9/40

    摘要: One embodiment of the present invention provides a system that removes noise from an image. During operation, the system first identifies blobs in the image, wherein a blob is a set of contiguous pixels which possibly represents a character or a portion of a character in the image. Next, the system analyzes the blobs to dynamically determine a “noise threshold” for the blobs. The system then removes blobs from the image which are below the noise threshold.

    摘要翻译: 本发明的一个实施例提供一种从图像中去除噪声的系统。 在操作期间,系统首先识别图像中的斑点,其中斑点是可能表示图像中的字符或字符的一部分的连续像素的集合。 接下来,系统分析斑点以动态地确定斑点的“噪声阈值”。 然后,系统从图像中除去低于噪声阈值的斑点。

    Reconstructing high-fidelity electronic documents from images via generation of synthetic fonts
    3.
    发明授权
    Reconstructing high-fidelity electronic documents from images via generation of synthetic fonts 有权
    通过生成合成字体从图像重建高保真电子文档

    公开(公告)号:US07519221B1

    公开(公告)日:2009-04-14

    申请号:US11069510

    申请日:2005-02-28

    摘要: A system creates an electronic version of a document from page-images of the document, wherein the electronic version replicates both the logical content and the physical appearance of the original document. During operation, the system receives the page-images for the document. Next, the system extracts character images from the page-images, and generates a synthetic font for the document from the extracted character images. Finally, the system constructs the electronic version of the document by, using the synthetic font to represent text regions of the document, and by using image-segments extracted from the pages-images to represent non-text regions of the document.

    摘要翻译: 系统从文档的页面图像创建文档的电子版本,其中电子版本复制原始文档的逻辑内容和物理外观。 在操作期间,系统接收文档的页面图像。 接下来,系统从页面图像中提取字符图像,并从提取的字符图像生成用于文档的合成字体。 最后,系统通过使用合成字体来表示文档的文本区域以及通过使用从页面图像提取的图像段来表示文档的非文本区域来构建文档的电子版本。

    Method and apparatus for removing noise from a digital image
    4.
    发明授权
    Method and apparatus for removing noise from a digital image 有权
    从数字图像中去除噪声的方法和装置

    公开(公告)号:US08064721B2

    公开(公告)日:2011-11-22

    申请号:US12648250

    申请日:2009-12-28

    IPC分类号: G06K9/40

    摘要: One embodiment of the present invention provides a system that removes noise from an image. During operation, the system first identifies blobs in the image, wherein a blob is a set of contiguous pixels which possibly represents a character or a portion of a character in the image. Next, the system analyzes the blobs to dynamically determine a “noise threshold” for the blobs. The system then removes blobs from the image which are below the noise threshold.

    摘要翻译: 本发明的一个实施例提供一种从图像中去除噪声的系统。 在操作期间,系统首先识别图像中的斑点,其中斑点是可能表示图像中的字符或字符的一部分的连续像素的集合。 接下来,系统分析斑点以动态地确定斑点的“噪声阈值”。 然后,系统从图像中除去低于噪声阈值的斑点。

    Method and apparatus for producing a hybrid data structure for displaying a raster image

    公开(公告)号:US06661919B2

    公开(公告)日:2003-12-09

    申请号:US10054944

    申请日:2002-01-25

    IPC分类号: G06K934

    摘要: A system for producing a raster image derived from coded and non-coded portions of a hybrid data structure from an input bitmap including (1) a data processing apparatus, (2) a recognizer which performs recognition on an input bitmap to the data processing apparatus to detect identifiable objects within the input bitmap, (3) a mechanism for producing a hybrid data structure including coded data corresponding to the identifiable objects and non-coded data derived from portions of the input bitmap which do not correspond to the identifiable objects, and (4) an output device capable of developing a visually perceptible raster image derived from the hybrid data structure. The raster image includes raster images of the identifiable objects and raster images derived from portions of the input bitmap that do not correspond to the identifiable objects. The invention includes a method for producing a hybrid data structure for a bitmap of an image having the steps of: (1) inputting a signal comprising a bitmap into a digital processing apparatus, (2) partitioning the bitmap into a hierarchy of lexical units, (3) assigning labels to a label list for each lexical unit of a predetermined hierarchical level, where labels in the label list have an associated confidence level, and (4) storing each lexical unit in a hybrid data structure as either an identifiable object or a non-identifiable object.

    Method and apparatus for producing a hybrid data structure for
displaying a raster image
    6.
    发明授权
    Method and apparatus for producing a hybrid data structure for displaying a raster image 失效
    用于产生用于显示光栅图像的混合数据结构的方法和装置

    公开(公告)号:US5999649A

    公开(公告)日:1999-12-07

    申请号:US736250

    申请日:1996-10-24

    摘要: A system for producing a raster image derived from coded and non-coded portions of a hybrid data structure from an input bitmap including (1) a data processing apparatus, (2) a recognizer which performs recognition on an input bitmap to the data processing apparatus to detect identifiable objects within the input bitmap, (3) a mechanism for producing a hybrid data structure including coded data corresponding to the identifiable objects and non-coded data derived from portions of the input bitmap which do not correspond to the identifiable objects, and (4) an output device capable of developing a visually perceptible raster image derived from the hybrid data structure. The raster image includes raster images of the identifiable objects and raster images derived from portions of the input bitmap that do not correspond to the identifiable objects. This includes a method for producing a hybrid data structure for a bitmap of an image having the steps of: (1) inputting a signal comprising a bitmap into a digital processing apparatus, (2) partitioning the bitmap into a hierarchy of lexical units, (3) assigning labels to a label list for each lexical unit of a predetermined hierarchical level, where labels in the label list have an associated confidence level, and (4) storing each lexical unit in a hybrid data structure as either an identifiable object or a non-identifiable object.

    摘要翻译: 一种用于从输入位图生成从混合数据结构的编码和非编码部分得到的光栅图像的系统,包括:(1)数据处理装置,(2)识别器,其对数据处理装置执行对输入位图的识别 以检测输入位图内的可识别对象,(3)用于产生混合数据结构的机制,该混合数据结构包括与可识别对象相对应的编码数据和从输入位图的不对应于可识别对象的部分导出的非编码数据,以及 (4)能够开发从混合数据结构导出的视觉上可感知的光栅图像的输出装置。 光栅图像包括可识别对象的光栅图像和从输入位图的与可识别对象不对应的部分导出的光栅图像。 这包括用于产生图像位图的混合数据结构的方法,该方法具有以下步骤:(1)将包括位图的信号输入到数字处理装置中,(2)将位图分割成词汇单元的层级( 3)将标签分配给预定分层级别的每个词汇单元的标签列表,其中标签列表中的标签具有相关联的置信度,以及(4)将混合数据结构中的每个词汇单元存储为可识别对象或 不可识别的对象。

    Method and apparatus for producing a hybrid data structure for
displaying a raster image
    7.
    发明授权
    Method and apparatus for producing a hybrid data structure for displaying a raster image 失效
    用于产生用于显示光栅图像的混合数据结构的方法和装置

    公开(公告)号:US5625711A

    公开(公告)日:1997-04-29

    申请号:US298655

    申请日:1994-08-31

    摘要: A system for producing a raster image derived from coded and non-coded portions of a hybrid data structure from an input bitmap including (1) a data processing apparatus, (2) a recognizer which performs recognition on an input bitmap to the data processing apparatus to detect identifiable objects within the input bitmap, (3) a mechanism for producing a hybrid data structure including coded data corresponding to the identifiable objects and non-coded data derived from portions of the input bitmap which do not correspond to the identifiable objects, and (4) an output device capable of developing a visually perceptible raster image derived from the hybrid data structure. The raster image includes raster images of the identifiable objects and raster images derived from portions of the input bitmap that do not correspond to the identifiable objects. The invention includes a method for producing a hybrid data structure for a bitmap of an image having the steps of: (1) inputting a signal comprising a bitmap into a digital processing apparatus, (2) partitioning the bitmap into a hierarchy of lexical units, (3) assigning labels to a label list for each lexical unit of a predetermined hierarchical level, where labels in the label list have an associated confidence level, and (4) storing each lexical unit in a hybrid data structure as either an identifiable object or a non-identifiable object.

    摘要翻译: 一种用于从输入位图生成从混合数据结构的编码和非编码部分得到的光栅图像的系统,包括:(1)数据处理装置,(2)识别器,其对数据处理装置执行对输入位图的识别 以检测输入位图内的可识别对象,(3)用于产生混合数据结构的机制,该混合数据结构包括与可识别对象相对应的编码数据和从输入位图的不对应于可识别对象的部分导出的非编码数据,以及 (4)能够开发从混合数据结构导出的视觉上可感知的光栅图像的输出装置。 光栅图像包括可识别对象的光栅图像和从输入位图的与可识别对象不对应的部分导出的光栅图像。 本发明包括一种用于产生图像位图的混合数据结构的方法,该方法具有以下步骤:(1)将包括位图的信号输入到数字处理装置中,(2)将位图分割成词汇单元的层级, (3)将标签分配给预定分层级别的每个词汇单元的标签列表,其中标签列表中的标签具有相关联的置信度,以及(4)将混合数据结构中的每个词法单元存储为可识别对象或 一个不可识别的对象。

    Method and apparatus for removing noise from a digital image
    9.
    发明申请
    Method and apparatus for removing noise from a digital image 有权
    从数字图像中去除噪声的方法和装置

    公开(公告)号:US20090022397A1

    公开(公告)日:2009-01-22

    申请号:US11291552

    申请日:2005-11-30

    IPC分类号: G06K9/34 G06K9/38

    摘要: One embodiment of the present invention provides a system that removes noise from an image. During operation, the system first identifies blobs in the image, wherein a blob is a set of contiguous pixels which possibly represents a character or a portion of a character in the image. Next, the system analyzes the blobs to dynamically determine a “noise threshold” for the blobs. The system then removes blobs from the image which are below the noise threshold.

    摘要翻译: 本发明的一个实施例提供一种从图像中去除噪声的系统。 在操作期间,系统首先识别图像中的斑点,其中斑点是可能表示图像中的字符或字符的一部分的连续像素的集合。 接下来,系统分析斑点以动态地确定斑点的“噪声阈值”。 然后,系统从图像中除去低于噪声阈值的斑点。

    Method and Apparatus for Removing Noise from a Digital Image
    10.
    发明申请
    Method and Apparatus for Removing Noise from a Digital Image 有权
    用于从数字图像中去除噪声的方法和装置

    公开(公告)号:US20100166307A1

    公开(公告)日:2010-07-01

    申请号:US12648250

    申请日:2009-12-28

    IPC分类号: G06K9/34

    摘要: One embodiment of the present invention provides a system that removes noise from an image. During operation, the system first identifies blobs in the image, wherein a blob is a set of contiguous pixels which possibly represents a character or a portion of a character in the image. Next, the system analyzes the blobs to dynamically determine a “noise threshold” for the blobs. The system then removes blobs from the image which are below the noise threshold.

    摘要翻译: 本发明的一个实施例提供一种从图像中去除噪声的系统。 在操作期间,系统首先识别图像中的斑点,其中斑点是可能表示图像中的字符或字符的一部分的连续像素的集合。 接下来,系统分析斑点以动态地确定斑点的“噪声阈值”。 然后,系统从图像中除去低于噪声阈值的斑点。