摘要:
An image processing device includes a feature emphasis unit for extracting first image frequency information from image data for each first unit area, a boundary provisional determination unit for defining a value obtained by adding a predetermined weight to the first image frequency information as representative feature information, and provisionally determining as a boundary a first unit area whose variance from the representative feature information of the adjacent area is at or higher than a predetermined level, and a boundary determination unit for extracting the second image frequency information for each second unit area smaller than the first unit area in the range of a provisionally determined position and the vicinity, and generating boundary information using as the boundary a second unit area whose value based on the variance from the second image frequency information of the adjacent area is at or higher than a predetermined level.
摘要:
An edge detecting method detects edge segments by searching all of the search lines forming an image from an end of the image in a direction perpendicular to the edges. If a line whose edge segment cannot be detected exists, a search is made in all of the search lines from the vicinity of the center of the image toward the end of the image, whereby edge segments are detected. A linear edge is determined from edge segments. A plurality of edge candidates are obtained from the edge segments for all of the search lines, and an optimum candidate is selected from among the edge candidates. Ruled lines are extracted from the source document in the image, and an optimum candidate is selected based on a comparison with a ruled line. As a result, an edge of the source document can be detected with high accuracy even if an image on a background side is unstable, or if materials of the background and the source document are similar.
摘要:
It is an object of the present invention to improve the compression ratio of a color image and to clearly display the outlines of characters and the like. A hue cluster classifying/unifying unit reduces the number of hue values of each pixel in a color image, based on a hue histogram, allocates the number-reduced hue value to each pixel and classifies pixels with the same hue value into one cluster. Furthermore, the unit unifies clusters whose hue values are below a predetermined value. The unit also traces the outline of a cluster whose size is below a reference value and determines that a cluster that has a lot of change points belongs to a character area. An encoding unit determines the characteristic of each cluster, based on both an area determined by an area determining unit and whether the cluster belongs to a ruled line area or a character area, and encodes pixels in each cluster by a coding method suitable for the characteristic of the cluster.
摘要:
It is an object of the present invention to improve the compression ratio of a color image and to clearly display the outlines of characters and the like. A hue cluster classifying/unifying unit reduces the number of hue values of each pixel in a color image, based on a hue histogram, allocates the number-reduced hue value to each pixel and classifies pixels with the same hue value into one cluster. Furthermore, the unit unifies clusters whose hue values are below a predetermined value. The unit also traces the outline of a cluster whose size is below a reference value and determines that a cluster that has a lot of change points belongs to a character area. An encoding unit determines the characteristic of each cluster, based on both an area determined by an area determining unit and whether the cluster belongs to a ruled line area or a character area, and encodes pixels in each cluster by a coding method suitable for the characteristic of the cluster.
摘要:
An edge detecting method detects edge segments by searching all of the search lines forming an image from an end of the image in a direction perpendicular to the edges. If a line whose edge segment cannot be detected exists, a search is made in all of the search lines from the vicinity of the center of the image toward the end of the image, whereby edge segments are detected. A linear edge is determined from edge segments. A plurality of edge candidates are obtained from the edge segments for all of the search lines, and an optimum candidate is selected from among the edge candidates. Ruled lines are extracted from the source document in the image, and an optimum candidate is selected based on a comparison with a ruled line. As a result, an edge of the source document can be detected with high accuracy even if an image on a background side is unstable, or if materials of the background and the source document are similar.
摘要:
The present invention is a character recognition apparatus, which comprises a background discriminating section, a non-character line discriminating section, a first non-character line removed image creating section that creates a first non-character line removed image, which is an original image from which the non-character line is removed, a first character area discriminating section, an enlarged image creating section, a second non-character line removed image creating section, an interference judgment section that judges whether or not the character and the non-character line interfere with each other in the original image, a character image restoring section that restores the character image, a second character area discriminating section, and a character recognizing section that digitizes the character area recognized by the second character area discriminating section and recognizes the character, thereby characters written on a color form are recognized at a high accuracy.
摘要:
Disclosed is a format recognition method, apparatus and its storage medium for automatically recognizing the format of a form, whereby the format is automatically determined by examining the arrangement of the smallest rectangles. According to the present invention, the smallest rectangles are extracted from a form, and the positional relationship of these rectangles is obtained. The attribute of the smallest rectangle is determined from the positional relationship. In accordance with the attribute, the smallest rectangles are sorted into a headline portion and a data portion, and a character string in the data portion is recognized.
摘要:
A method for identifying payment forms accurately identifies types of forms without adding special form identification data. A payment form discrimination method for discriminating payment forms which state a payee account number and a payment amount includes a step of acquiring an image of the form, a step of making a search for the payee account number in the image in accordance with an account number searching rule, and a step of discriminating the type of form based on the searched payee account number. Types of forms can be identified accurately and fast without adding special form identification data, because the form identification is performed using easily searchable account numbers.
摘要:
The present invention is a character recognition apparatus, which comprises a background discriminating section, a non-character line discriminating section, a first non-character line removed image creating section that creates a first non-character line removed image, which is an original image from which the non-character line is removed, a first character area discriminating section, an enlarged image creating section, a second non-character line removed image creating section, an interference judgment section that judges whether or not the character and the non-character line interfere with each other in the original image, a character image restoring section that restores the character image, a second character area discriminating section, and a character recognizing section that digitizes the character area recognized by the second character area discriminating section and recognizes the character, thereby characters written on a color form are recognized at a high accuracy.
摘要:
A data sheet identification device of the invention includes: a character/graphics extracting section, an identical shape deciding section, a graphics collating section, an identification code/data sheet ID identifying section for collating characters that have been decided to have the same shape with an identification code/data sheet ID database in which a plurality of characters showing features of a plurality of data sheets respectively have been registered, and an identifying section for uniquely identifying the data sheet based on a result of the collation by the graphics collating section and a result of the collation by the identification code/data sheet ID identifying section.