摘要:
A character recognition device includes: an acquiring unit that acquires image data describing pixel values representing colors of pixels constituting an image; a binarizing unit that binarizes the pixel values; an extracting unit that extracts boundaries of colors in the image; a delimiting unit that delimits plural image areas in the image; a specifying unit that specifies, with regard to first image areas arranged according to a predetermined rule, pixels binarized by the binarizing unit, as a subject for character recognition, and specifies, with regard to second image areas not arranged according to the predetermined rule, pixels of areas surrounded by boundaries extracted by the extracting unit, as a subject for character recognition; and a character recognition unit that recognizes characters represented by the pixels specified by the specifying unit as a subject for character recognition.
摘要:
An image processing apparatus includes: a first determination unit that determines whether each of an image regions is for a vector image; a color number counting unit that counts the number of colors in the pixels of high-resolution image data; a frequency counting unit that counts a number of times where the difference between the colors of each of the pixels in the high-resolution image data and the colors of at least one pixel located around each of those pixels is greater than or equal to a threshold; a second determination unit that determines that the image region is not an image region for a vector image; and a generation unit that generates image data in which a process for rendering images in the image regions determined by the first determination unit to be image regions for vector images is defined by numerical values or numerical formulas.
摘要:
An image processing apparatus includes: a memory; an obtaining unit that obtains image data representing an image including concatenated pixels; an isolating unit that isolates a rendering element, the rendering element being an image surrounded by border lines of a color in an image represented by the image data; and a classifying unit that, in a case where a plurality of rendering elements has been isolated by the isolating unit, and in a case where color difference between two of the plural rendering elements or the distance between the two rendering elements is less than a threshold, classifies the two rendering elements into the same group, associates pieces of image data that represent rendering elements belonging to the same group with one another, and stores the pieces of image data in the memory.
摘要:
An image processing apparatus includes a separation section, a background color setting section, and a generating section. The separation section separates at least one image component having different attributes contained in electronic document data expressing an original image. The background color setting section selects a coloring method, from among a plurality of coloring methods for coloring a background, according to software to be used and sets a background color. The generating section generates software data corresponding to the software by coloring the background color based on the setting of the background color setting section, and re-arranging the at least one image component.
摘要:
An image processing apparatus that includes a character recognition component, a determining component and a generating component is provided. The determining component determines, when document data is generated that contains first data representing the document and representing the entity in which the characters are mixed and second data containing character code data of the characters recognized by the character recognition component and representing a character block displaying the characters represented by the character code data, whether to hide the character block represented by the second data behind the entity represented by the first data or to display the character block represented by the second data in front of the entity represented by the first data when the document represented by the document data is displayed, based on lightness or distribution of the lightness of a background region around the characters of the entity or the like.
摘要:
An image processing apparatus includes the following elements. A document-type determining unit determines what type of document a document is on the basis of read information obtained as a result of reading the document by using a document reader. A compression-format setting unit sets, on the basis of the type of document determined by the document-type determining unit, a compression format used for generating image data from the read information. A generator compresses the read information by using the compression format set by the compression-format setting unit so as to generate image data corresponding to the document.
摘要:
An image processing apparatus includes the following elements. A document-type determining unit determines what type of document a document is on the basis of read information obtained as a result of reading the document by using a document reader. A compression-format setting unit sets, on the basis of the type of document determined by the document-type determining unit, a compression format used for generating image data from the read information. A generator compresses the read information by using the compression format set by the compression-format setting unit so as to generate image data corresponding to the document.
摘要:
An image processing apparatus includes a registering unit that registers a first language and a second language different from the first language, a character string extracting unit that extracts one or more character strings from reading information acquired by reading an original, plural feature character string creating sections that create a feature character string of the original on the basis of the one or more character strings extracted by the character string extracting unit, and a switching unit that switches the feature character string creating section used to create the feature character string on the basis of a combination of the registered first language and the registered second language.
摘要:
An image reader includes a reading unit that reads an image; a detection unit that detects marks from the read image read by the reading unit; a creation unit that creates a hiding image, which hides a region including the marks, on the basis of the marks detected by the detection unit; and a combining unit that combines the read image and the hiding image to create an electronic document.