Method and apparatus for character recognition
    1.
    发明授权
    Method and apparatus for character recognition 失效
    用于字符识别的方法和装置

    公开(公告)号:US5680478A

    公开(公告)日:1997-10-21

    申请号:US265833

    申请日:1994-06-27

    IPC分类号: G06K9/20 G06K9/32 G06K9/34

    摘要: A character recognition system or the like in which character identities are stored in accordance with a hierarchical order established during processing to separate text image areas from non-text image areas. To separate text image areas from non-text image areas, blocks of pixels are selected from pixel image data by outlining contours of connected components in the pixel image data, determining whether the outlined connected components include text units or non-text units, selectively connecting text units widthwisely to form text lines, and selectively connecting text lines vertically to form text blocks. After blocks of pixels have been so selected, text blocks are segmented into lines of pixel image data, and characters are cut from the lines of pixel image data so obtained. If desired, the characters may be cut by a two-step cutting process in which non-touching and non-overlapping characters are first cut out, and touching characters are then cut out. The cut-out characters are then recognized, and the characters are stored in accordance with an order established during the block selecting process.

    摘要翻译: 一种字符识别系统等,其中根据在处理期间建立的分层顺序来存储字符标识以将文本图像区域与非文本图像区域分开。 为了将文本图像区域与非文本图像区域分离,通过概述像素图像数据中的连接分量的轮廓来选择像素图像数据块,确定所概述的连接的组件是否包括文本单元或非文本单元,选择性地连接 宽度方向的文本单元以形成文本行,并且垂直选择性地连接文本行以形成文本块。 在如此选择像素块之后,文本块被分割成像素图像数据的行,并且从如此获得的像素图像数据的线切割字符。 如果需要,可以通过两步切割处理来切割字符,其中首先切出非接触和不重叠的字符,然后切割接触的字符。 然后识别切出的字符,并且根据在块选择处理期间建立的顺序来存储字符。

    Method and apparatus for character recognition
    2.
    发明授权
    Method and apparatus for character recognition 失效
    用于字符识别的方法和装置

    公开(公告)号:US5680479A

    公开(公告)日:1997-10-21

    申请号:US873012

    申请日:1992-04-24

    IPC分类号: G06K9/20 G06K9/32 G06K9/34

    摘要: In a character recognition system or the like, method and apparatus for selecting blocks of pixels from pixel image data so as to permit identification and grouping of similarly-typed pixels, such as text-type pixels and non-text-type pixels. Pixel image data is inputted and, if the pixel image data is not binary image data then the pixel image data is converted into binary pixel image data. Blocks of pixel image data are selected by outlining contours of connected components in the pixel image data, determining whether the outlined connected components include text unit or non-text units based on the size of the outlined connected components, selectively connecting text units widthwisely to form text lines based on proximity of adjacent text units, and selectively connecting text lines vertically to form text blocks based on proximity of adjacent text lines and on the position of non-text units between text lines. A hierarchical tree is formed based on the outlined connected components.

    摘要翻译: 在字符识别系统等中,用于从像素图像数据中选择像素块的方法和装置,以便允许识别和分组类似类型的像素,例如文本型像素和非文本型像素。 输入像素图像数据,如果像素图像数据不是二进制图像数据,则将像素图像数据转换为二进制像素图像数据。 通过概述像素图像数据中的连接分量的轮廓来选择像素图像数据块,基于所概述的连接分量的大小来确定所概述的连接分量是包括文本​​单位还是非文本单位,选择性地连接文本单元以形成 基于相邻文本单元的邻近度的文本行,以及基于相邻文本行的邻近度以及文本行之间的非文本单元的位置,垂直地选择性地连接文本行以形成文本块。 基于概述的连接组件形成分层树。

    Feature extraction system for identifying text within a table image
    3.
    发明授权
    Feature extraction system for identifying text within a table image 失效
    用于识别表格图像内的文本的特征提取系统

    公开(公告)号:US5848186A

    公开(公告)日:1998-12-08

    申请号:US514252

    申请日:1995-08-11

    CPC分类号: G06K9/00463

    摘要: In a feature extraction system for analyzing image data of an input document image, a feature extraction method identifies image data as a table image and identifies text image within the table image by performing the steps of inputting image data of a document page, performing block selection processing on the document page, the block selection process identifies and separates the image data into blocks having the same image type, identifying table image data based on the separated blocks of image data, identifying text blocks within the table image data, horizontally sorting all text blocks located in the table image data based on horizontal position information, vertically sorting all text blocks located in the table image data based on vertical position information, separating text blocks into rows and columns based on a result of the vertical and the horizontal sorting steps, assigning column and row address coordinates to each text block in the table image data based on the separating step, and storing the assigned address of each text block.

    摘要翻译: 在用于分析输入文档图像的图像数据的特征提取系统中,特征提取方法将图像数据识别为表格图像,并且通过执行输入文档页面的图像数据,执行块选择的步骤来识别表格图像内的文本图像 在文档页面上进行处理,块选择处理将图像数据识别并分离成具有相同图像类型的块,基于分离的图像数据块识别表格图像数据,识别表格图像数据内的文本块,水平排列所有文本 基于水平位置信息位于表格图像数据中的块,基于垂直位置信息垂直排列位于表格图像数据中的所有文本块,基于垂直和水平分类步骤的结果将文本块分成行和列, 将列和行地址坐标分配给基于分离的表格图像数据中的每个文本块 并且存储每个文本块的分配的地址。

    Color editing system
    4.
    发明授权
    Color editing system 失效
    彩色编辑系统

    公开(公告)号:US06496198B1

    公开(公告)日:2002-12-17

    申请号:US09304687

    申请日:1999-05-04

    申请人: Shin-Ywan Wang

    发明人: Shin-Ywan Wang

    IPC分类号: G09G500

    CPC分类号: G06T11/001 G06T11/60

    摘要: A system to render a color image using a binarized image representing the color image and a hierarchical tree structure representing the color image, the hierarchical tree structure including nodes representing respective blocks of image data within the color image, the nodes containing color information for respective blocks. The system includes a defining step to define, in a memory, a color image rendering area corresponding to a block of image data in the color image, an obtaining step to obtain foreground color information from a node corresponding to the block of image data, a detecting step to detect black pixel locations in the binarized image within an area of the binarized image corresponding to the block of image data, and an assigning step to assign the foreground color to pixels at locations in the color image rendering area corresponding to the detected black pixel locations.

    摘要翻译: 使用表示彩色图像的二值化图像和表示彩色图像的分层树结构来渲染彩色图像的系统,分层树结构包括表示彩色图像内的图像数据的各个块的节点,所述节点包含各个块的颜色信息 。 该系统包括:在存储器中定义与彩色图像中的图像数据块相对应的彩色图像呈现区域的定义步骤,从与图像数据块对应的节点获得前景颜色信息的获取步骤, 检测步骤,用于检测对应于图像数据块的二值化图像的区域内的二值化图像中的黑色像素位置;以及分配步骤,用于将前景颜色分配给与检测到的黑色对应的彩色图像渲染区域中的像素 像素位置。

    Block selection review and editing system
    5.
    发明授权
    Block selection review and editing system 失效
    块选择审查和编辑系统

    公开(公告)号:US5825944A

    公开(公告)日:1998-10-20

    申请号:US834856

    申请日:1997-04-10

    申请人: Shin-Ywan Wang

    发明人: Shin-Ywan Wang

    摘要: A system for editing the hierarchical tree structure which is created by a block selection system to correspond to a block template which represents a document image, wherein the hierarchical tree structure includes a plurality of nodes, each of which represents a block of document image data in the block template of a document image and contains document feature data defining features of the block of image data. The system operates to download from memory the hierarchical tree structure, generate and display a block template representing a document image corresponding to the hierarchical tree structure in memory, select a block of document image data to be edited in the displayed block template, edit a feature of the selected block of image data and update the document feature data in a node corresponding to the selected block of image data. The system determines whether any document feature data in any node has been affected by updated feature data, and, if so, document feature data in the affected nodes are appropriately altered to reflect the new features of corresponding blocks of image data.

    摘要翻译: 一种用于编辑由块选择系统创建以对应于表示文档图像的块模板的分层树结构的系统,其中分层树结构包括多个节点,每个节点表示文档图像数据块 文档图像的块模板,并且包含定义图像数据块的特征的文档特征数据。 该系统从存储器中下载层次树结构,生成和显示表示与存储器中的分层树结构相对应的文档图像的块模板,在显示的块模板中选择要编辑的文档图像数据块,编辑特征 并且在与所选择的图像数据块对应的节点中更新文档特征数据。 系统确定任何节点中的任何文档特征数据是否受到更新的特征数据的影响,如果是,受影响的节点中的文档特征数据被适当地改变以反映对应的图像数据块的新特征。

    Color block selection
    6.
    发明授权
    Color block selection 失效
    色块选择

    公开(公告)号:US06360006B1

    公开(公告)日:2002-03-19

    申请号:US09161716

    申请日:1998-09-29

    申请人: Shin-Ywan Wang

    发明人: Shin-Ywan Wang

    IPC分类号: G06K900

    摘要: A system to identify features of a color document in which primary color values representing a color document are input, a threshold binarizing range is calculated based on the input values, the input values are binarized into binary values based on the threshold binarizing range, a colored region is identified within the document, and a frame is defined surrounding the identified colored region. A second threshold binarizing range is calculated based on input primary values corresponding to the colored region, and the input primary values corresponding to the colored region are binarized into binarized values based on the second threshold binarizing range.

    摘要翻译: 一种用于识别其中输入了表示彩色文档的基色值的彩色文档的特征的系统,基于输入值计算阈值二值化范围,基于阈值二值化范围将输入值二值化为二进制值, 区域被识别在文档内,并且围绕识别的有色区域定义框架。 基于对应于着色区域的输入主值计算第二阈值二值化范围,并且与彩色区域对应的输入主值基于第二阈值二值化范围二值化为二值化值。

    System for designating document direction
    7.
    发明授权
    System for designating document direction 失效
    系统指定文件方向

    公开(公告)号:US6014458A

    公开(公告)日:2000-01-11

    申请号:US697276

    申请日:1996-08-27

    申请人: Shin-Ywan Wang

    发明人: Shin-Ywan Wang

    摘要: A page analysis system, which utilizes a block selection application to analyze image data of a page in a multi-page document, includes the features of 1) returning an error code in the case that data to be stored in either a common memory work area or a hierarchical tree storage memory area exceeds the allocated memory space, 2) calculating a skew angle of a page and returning an error code in the case the skew angle exceeds a predefined maximum skew angle, 3) designating a default processing direction in the case a user fails to input directional information of the image data in the page, 4) determining and indicating whether identified picture image information represents a halftone image, a line drawing, a joint line, or unknown picture type, 5) analyzing image data of a portion of a page which has been designated by input coordinates, and 6) identifying a block which contains at least two image types as a composite block and identifying the type of image data within the composite block.

    摘要翻译: 利用块选择应用来分析多页文档中的页面的图像数据的页面分析系统包括以下特征:1)在存储在公共存储器工作区域中的数据的情况下返回错误代码 或分级树存储区域超过分配的存储器空间,2)计算页面的偏斜角度并在偏斜角度超过预定最大倾斜角度的情况下返回错误代码; 3)在该情况下指定默认处理方向 用户不能输入页面中的图像数据的方向信息,4)确定和指示所识别的图像信息是否表示半色调图像,线条图,联合线或未知图像类型,5)分析图像数据的图像数据 由输入坐标指定的页面的部分,以及6)将包含至少两个图像类型的块识别为复合块并识别复合图像内的图像数据的类型 块。

    Method for capturing a document image, a scanner using the method and a document image management system using the scanner
    8.
    发明授权
    Method for capturing a document image, a scanner using the method and a document image management system using the scanner 失效
    拍摄文件图像的方法,使用该方法的扫描仪和使用该扫描仪的文件图像管理系统

    公开(公告)号:US06449065B1

    公开(公告)日:2002-09-10

    申请号:US09306730

    申请日:1999-05-07

    IPC分类号: H04N140

    CPC分类号: H04N1/40062

    摘要: A document image capture method and scanner, and an image processing apparatus incorporating such a scanner, in which a document is scanned two or more times. The first scan preferably provides bi-level image data, which is analyzed to identify blocks of uniform image type (for example, text, line drawing, grayscale image, or full-color image) within the document. The second scan, preferably performed at lower resolution than the first, provides grayscale or color information, which is substituted in the grayscale or color blocks, respectively, for the bi-level information obtained in the first scan. A third scan, to provide information of the third type, may also be performed. An operator preferably views an image of the document, based on the scanned information, to be sure that the identification and typing of the various blocks has been done correctly, and may instruct that the document be rescanned to provide new data for a designated portion of the document image, if it appears that an error has occurred. The information representing the document image obtained in this way is preferably stored using a set of linked bit maps, one bit map for each block. The memory capacity needed to store the information can be reduced further by treating the page and its margins as a frame, and by storing information about the frame, and any horizontal or vertical lines in the document, in simple vector form. Any portion of the document which is just background is not stored.

    摘要翻译: 一种文件图像捕获方法和扫描仪,以及并入有扫描仪的图像处理装置,其中文件被扫描两次或更多次。 第一扫描优选地提供双层图像数据,其被分析以识别文档内的统一图像类型(例如,文本,线条图,灰度图像或全色图像)的块。 优选地以比第一扫描仪更低的分辨率执行的第二扫描提供灰度或颜色信息,灰度或颜色信息被分别代替在第一扫描中获得的双电平信息的灰阶或彩色块中。 还可以执行用于提供第三类型的信息的第三扫描。 操作者优选地基于扫描的信息来查看文档的图像,以确保已经正确地完成了各种块的识别和打字,并且可以指示文档被重新扫描以提供用于指定部分的指定部分的新数据 文档图像,如果看起来发生错误。 表示以这种方式获得的文档图像的信息优选地使用一组链接位图来存储,每个块的一个位图。 存储信息所需的存储容量可以通过将页面及其边距视为框架进行处理,并以简单的向量形式存储关于框架的信息以及文档中的任何水平或垂直线。 仅存在背景的文档的任何部分都不存储。

    System for extracting attached text
    9.
    发明授权
    System for extracting attached text 失效
    用于提取附加文本的系统

    公开(公告)号:US6157738A

    公开(公告)日:2000-12-05

    申请号:US664675

    申请日:1996-06-17

    申请人: Shin-Ywan Wang

    发明人: Shin-Ywan Wang

    摘要: A method for identifying and extracting text data from a table-cell frame. The method includes the steps of tracing connected components of a document image, tracing white contours within a connected component, defining a frame outline based on the white contours, identifying unattached character data inside the frame outline, and defining an initial rectangular area inside the frame outline. The method further includes detecting black pixels in a horizontal or vertical direction from the initial rectangular area in order to create an extended character area, locating boundary pixels lying inside the extended character area for each white contour, identifying black pixels positioned between boundary pixels lying inside the extended character area, combining black pixels positioned between boundary pixels lying inside the extended character area so as to form at least one connected component, recognizing the at least one connected component as a text component if it is not recognized as a vertical line, as a horizontal line, as part of a broken line, or as part of the frame, and defining a character node of a hierarchical tree structure corresponding to the extended character area and containing both the at least one connected component and any identified unattached connected components.

    摘要翻译: 一种用于从表格单元框架中识别和提取文本数据的方法。 该方法包括以下步骤:跟踪文档图像的连接的组件,在连接的组件内跟踪白色轮廓,基于白色轮廓定义框架轮廓,识别框架轮廓内部的未连接的字符数据,以及在框架内部定义初始矩形区域 大纲。 该方法还包括从初始矩形区域检测水平或垂直方向上的黑色像素,以便产生扩展字符区域,定位位于每个白色轮廓的扩展字符区域内的边界像素,识别位于内部的边界像素之间的黑色像素 所述扩展字符区域,组合位于所述扩展字符区域内的边界像素之间的黑色像素,以便形成至少一个连接部件,如果将所述至少一个连接的部件识别为文本部件,则将其识别为垂直线,如 作为虚线的一部分或作为帧的一部分的水平线,并且定义与扩展字符区域相对应的分层树结构的字符节点,并且包含至少一个连接的组件和任何已识别的未连接的连接组件。

    Block selection system in which overlapping blocks are decomposed
    10.
    发明授权
    Block selection system in which overlapping blocks are decomposed 失效
    重叠块分解的块选择系统

    公开(公告)号:US5774579A

    公开(公告)日:1998-06-30

    申请号:US514250

    申请日:1995-08-11

    CPC分类号: G06K9/00442

    摘要: In an image processing system for processing image data which includes both text areas and non-text areas, a method for extracting image data by performing block selection to obtain circumscribing rectangles around each block of text type areas in the image data and around each block of non-text type areas in the image data, obtaining outline pairs for each text and non-text block, determining whether the circumscribing rectangles overlap, decomposing overlapped rectangles based on the outline pairs, extracting image data based on the circumscribing rectangles for non-overlapped rectangles and based on the decomposed rectangles for overlapped rectangles, and processing the extracted image data.

    摘要翻译: 在用于处理包括文本区域和非文本区域的图像数据的图像处理系统中,通过执行块选择来提取图像数据的方法,以获得图像数据中的每个文本类型区域块周围的每个块周围的周边矩形 获取图像数据中的非文本类型区域,获得每个文本和非文本块的轮廓对,确定外接矩形是否重叠,基于轮廓对分解重叠矩形,基于非重叠的外接矩形提取图像数据 并且基于用于重叠矩形的分解矩形,并且处理提取的图像数据。