摘要:
Precise grayscale character segmentation apparatus and method. The precise grayscale character segmentation apparatus comprises an adjustment and segmentation unit for adjusting and segmenting an inputted low resolution text line image undergone coarse segmentation, so as to generate an adjusted character image; a character image binarization unit for generating a binary character image from the character image inputted therein; a noise removal unit for removing noise information in the binary character image generated by the binarization unit; and a final character image segmentation unit for generating a precisely segmented character image from the binary character image from which noise has been removed.
摘要:
A pattern segmentation apparatus and a pattern recognition apparatus can improve the segmentation precision of a character touching pattern. The pattern segmentation apparatus includes a feature amount extraction unit for extracting the feature amount of an image, a feature amount setting unit for setting the feature amount of a category, a feature amount comparison unit for comparing the feature amount of the category with the feature amount of the image, and a segmentation unit for segmenting a portion corresponding to the feature amount of the category from the image based on the comparison result.
摘要:
A word recognizing apparatus extracts the feature amount from a given image, and dynamically composes the feature amount of a candidate word to be recognized which is registered in a word list, using feature amounts of characters registered in an individual character dictionary. Then, the apparatus collates the composed feature amount of the word with the feature amount extracted from the image, calculates the degree of similarity between the two feature amounts, and outputs a recognition result.
摘要:
An image extraction system includes a connected pattern extracting part for extracting partial patterns respectively having connected pixels from an image which is formed by a block frame having a table format and including one-character frames or a free format frame, characters, graphics or symbols, a one-character frame extracting part for extracting one-character frames from the image based on the partial patterns extracted by the connected pattern extracting part, a straight line extracting part for extracting straight lines from the partial patterns which are extracted by the connected pattern extracting part and is eliminated of the one-character frames by the one-character frame extracting part, a frame detecting part for detecting straight lines forming the frame from the straight lines extracted by the straight line extracting part, and a frame separating part for separating the straight lines detected by the frame detecting part from the partial patterns so as to extract the characters, graphics or symbols.
摘要:
In a character segmenting apparatus the extracting section extracts the character segment pattern on the basis of the connection data imparted to the segment pattern. The character size calculating section calculates a histogram of a lengthwise or crosswise character size of a circumscribed rectangle circumscribed with the extracted character segment pattern and also calculates an average character size and its variance value on the basis of the histogram of the character size. The character pitch calculating section calculates a histogram of a pitch between the circumscribed rectangles and also calculates an average character pitch and its variance value on the basis of the histogram of the character pitch. The integrating section integrates the character while changing character integrating conditions in accordance with the average character size, the size variance value, the average character pitch and the pitch variance value. The segment integrating section integrates the character by distinguishing the small segment patterns in the character segment pattern on the basis of the average character size.
摘要:
An image processing method includes estimating corners of a contour of an object area in an obtained image, searching for contour lines of the object area between every two points which are offset from the estimated corners within a predetermined degree or distance along a direction away from the object area respectively, and determining intersection points of the contour lines as final corners of the contour of the object area, and determining contour lines between the final corners as a final contour of the object area.
摘要:
The present embodiments disclose a method of and a device for identifying the direction of characters in an image block. The method includes: performing optical character recognition processing on the image block by assuming various directions as assumed character directions, respectively, to obtain sub image blocks, recognized characters corresponding to the sub image blocks and correctness measures thereof in each of the assumed character directions; determining a language group to which the characters in the image block belong; adjusting a correctness measure corresponding to a sub image block which corresponds to a recognized character not belonging to the determined language group in each of the assumed character directions; calculating an accumulative correctness measure in each of the assumed character directions based on the adjusted correctness measure; and identifying the direction of the characters in the image block according to the accumulative correctness measures.
摘要:
A pointing information extraction unit extracts pointing information indicating a pointing position and a pointing time on a slide from a slide file used in a lecture and a video file of a lecture video using a pointing device. A word information generation unit analyzes a text sentence extracted from the slide file to generate a word information file indicating a word and a position thereof. A word pointing information generation unit estimates a word closest to the pointing position on the slide to generate a word pointing information file with the pointing time assigned. A fill-in-the-blank word extraction unit extracts a word having a pointing time equal to or longer than a predetermined time from the word pointing information as a fill-in-the-blank word file. A fill-in-the-blank test question is generated by setting the fill-in-the-blank word of the slide information as a blank region.
摘要:
A document image search apparatus generates a text by performing the character recognition of a document image and determines a re-process scope. Then, the apparatus generates a candidate character lattice from the re-recognition result of the re-process scope, generates character strings from the candidate character lattice and adds the character strings to the text. Then, the apparatus performs index search using the text with the character strings added.
摘要:
Character recognition apparatus and method for recognizing characters in an image, of which the character recognition apparatus comprises a text line extraction unit for extracting a plurality of text lines from an input image, a feature recognition unit for recognizing one or more features of each of the text lines, a synthetic pattern generation unit for generating synthetic character images for each of the text lines by using the features recognized by the feature recognition unit and the original character images, a synthetic dictionary generation unit for generating a synthetic dictionary for each of the text lines by using the synthetic character images, and a text line recognition unit for recognizing characters in each of the text lines by using the synthetic dictionary.