摘要:
A character recognition system or the like in which character identities are stored in accordance with a hierarchical order established during processing to separate text image areas from non-text image areas. To separate text image areas from non-text image areas, blocks of pixels are selected from pixel image data by outlining contours of connected components in the pixel image data, determining whether the outlined connected components include text units or non-text units, selectively connecting text units widthwisely to form text lines, and selectively connecting text lines vertically to form text blocks. After blocks of pixels have been so selected, text blocks are segmented into lines of pixel image data, and characters are cut from the lines of pixel image data so obtained. If desired, the characters may be cut by a two-step cutting process in which non-touching and non-overlapping characters are first cut out, and touching characters are then cut out. The cut-out characters are then recognized, and the characters are stored in accordance with an order established during the block selecting process.
摘要:
In a character recognition system or the like, method and apparatus for selecting blocks of pixels from pixel image data so as to permit identification and grouping of similarly-typed pixels, such as text-type pixels and non-text-type pixels. Pixel image data is inputted and, if the pixel image data is not binary image data then the pixel image data is converted into binary pixel image data. Blocks of pixel image data are selected by outlining contours of connected components in the pixel image data, determining whether the outlined connected components include text unit or non-text units based on the size of the outlined connected components, selectively connecting text units widthwisely to form text lines based on proximity of adjacent text units, and selectively connecting text lines vertically to form text blocks based on proximity of adjacent text lines and on the position of non-text units between text lines. A hierarchical tree is formed based on the outlined connected components.
摘要:
In a feature extraction system for analyzing image data of an input document image, a feature extraction method identifies image data as a table image and identifies text image within the table image by performing the steps of inputting image data of a document page, performing block selection processing on the document page, the block selection process identifies and separates the image data into blocks having the same image type, identifying table image data based on the separated blocks of image data, identifying text blocks within the table image data, horizontally sorting all text blocks located in the table image data based on horizontal position information, vertically sorting all text blocks located in the table image data based on vertical position information, separating text blocks into rows and columns based on a result of the vertical and the horizontal sorting steps, assigning column and row address coordinates to each text block in the table image data based on the separating step, and storing the assigned address of each text block.
摘要:
A system to render a color image using a binarized image representing the color image and a hierarchical tree structure representing the color image, the hierarchical tree structure including nodes representing respective blocks of image data within the color image, the nodes containing color information for respective blocks. The system includes a defining step to define, in a memory, a color image rendering area corresponding to a block of image data in the color image, an obtaining step to obtain foreground color information from a node corresponding to the block of image data, a detecting step to detect black pixel locations in the binarized image within an area of the binarized image corresponding to the block of image data, and an assigning step to assign the foreground color to pixels at locations in the color image rendering area corresponding to the detected black pixel locations.
摘要:
A system for editing the hierarchical tree structure which is created by a block selection system to correspond to a block template which represents a document image, wherein the hierarchical tree structure includes a plurality of nodes, each of which represents a block of document image data in the block template of a document image and contains document feature data defining features of the block of image data. The system operates to download from memory the hierarchical tree structure, generate and display a block template representing a document image corresponding to the hierarchical tree structure in memory, select a block of document image data to be edited in the displayed block template, edit a feature of the selected block of image data and update the document feature data in a node corresponding to the selected block of image data. The system determines whether any document feature data in any node has been affected by updated feature data, and, if so, document feature data in the affected nodes are appropriately altered to reflect the new features of corresponding blocks of image data.
摘要:
A system to identify features of a color document in which primary color values representing a color document are input, a threshold binarizing range is calculated based on the input values, the input values are binarized into binary values based on the threshold binarizing range, a colored region is identified within the document, and a frame is defined surrounding the identified colored region. A second threshold binarizing range is calculated based on input primary values corresponding to the colored region, and the input primary values corresponding to the colored region are binarized into binarized values based on the second threshold binarizing range.
摘要:
A page analysis system, which utilizes a block selection application to analyze image data of a page in a multi-page document, includes the features of 1) returning an error code in the case that data to be stored in either a common memory work area or a hierarchical tree storage memory area exceeds the allocated memory space, 2) calculating a skew angle of a page and returning an error code in the case the skew angle exceeds a predefined maximum skew angle, 3) designating a default processing direction in the case a user fails to input directional information of the image data in the page, 4) determining and indicating whether identified picture image information represents a halftone image, a line drawing, a joint line, or unknown picture type, 5) analyzing image data of a portion of a page which has been designated by input coordinates, and 6) identifying a block which contains at least two image types as a composite block and identifying the type of image data within the composite block.
摘要:
A document image capture method and scanner, and an image processing apparatus incorporating such a scanner, in which a document is scanned two or more times. The first scan preferably provides bi-level image data, which is analyzed to identify blocks of uniform image type (for example, text, line drawing, grayscale image, or full-color image) within the document. The second scan, preferably performed at lower resolution than the first, provides grayscale or color information, which is substituted in the grayscale or color blocks, respectively, for the bi-level information obtained in the first scan. A third scan, to provide information of the third type, may also be performed. An operator preferably views an image of the document, based on the scanned information, to be sure that the identification and typing of the various blocks has been done correctly, and may instruct that the document be rescanned to provide new data for a designated portion of the document image, if it appears that an error has occurred. The information representing the document image obtained in this way is preferably stored using a set of linked bit maps, one bit map for each block. The memory capacity needed to store the information can be reduced further by treating the page and its margins as a frame, and by storing information about the frame, and any horizontal or vertical lines in the document, in simple vector form. Any portion of the document which is just background is not stored.
摘要:
A method for identifying and extracting text data from a table-cell frame. The method includes the steps of tracing connected components of a document image, tracing white contours within a connected component, defining a frame outline based on the white contours, identifying unattached character data inside the frame outline, and defining an initial rectangular area inside the frame outline. The method further includes detecting black pixels in a horizontal or vertical direction from the initial rectangular area in order to create an extended character area, locating boundary pixels lying inside the extended character area for each white contour, identifying black pixels positioned between boundary pixels lying inside the extended character area, combining black pixels positioned between boundary pixels lying inside the extended character area so as to form at least one connected component, recognizing the at least one connected component as a text component if it is not recognized as a vertical line, as a horizontal line, as part of a broken line, or as part of the frame, and defining a character node of a hierarchical tree structure corresponding to the extended character area and containing both the at least one connected component and any identified unattached connected components.
摘要:
In an image processing system for processing image data which includes both text areas and non-text areas, a method for extracting image data by performing block selection to obtain circumscribing rectangles around each block of text type areas in the image data and around each block of non-text type areas in the image data, obtaining outline pairs for each text and non-text block, determining whether the circumscribing rectangles overlap, decomposing overlapped rectangles based on the outline pairs, extracting image data based on the circumscribing rectangles for non-overlapped rectangles and based on the decomposed rectangles for overlapped rectangles, and processing the extracted image data.