摘要:
The present embodiments disclose a method of and a device for identifying the direction of characters in an image block. The method includes: performing optical character recognition processing on the image block by assuming various directions as assumed character directions, respectively, to obtain sub image blocks, recognized characters corresponding to the sub image blocks and correctness measures thereof in each of the assumed character directions; determining a language group to which the characters in the image block belong; adjusting a correctness measure corresponding to a sub image block which corresponds to a recognized character not belonging to the determined language group in each of the assumed character directions; calculating an accumulative correctness measure in each of the assumed character directions based on the adjusted correctness measure; and identifying the direction of the characters in the image block according to the accumulative correctness measures.
摘要:
A pointing information extraction unit extracts pointing information indicating a pointing position and a pointing time on a slide from a slide file used in a lecture and a video file of a lecture video using a pointing device. A word information generation unit analyzes a text sentence extracted from the slide file to generate a word information file indicating a word and a position thereof. A word pointing information generation unit estimates a word closest to the pointing position on the slide to generate a word pointing information file with the pointing time assigned. A fill-in-the-blank word extraction unit extracts a word having a pointing time equal to or longer than a predetermined time from the word pointing information as a fill-in-the-blank word file. A fill-in-the-blank test question is generated by setting the fill-in-the-blank word of the slide information as a blank region.
摘要:
A grayscale character dictionary generation apparatus, comprising a first synthetic grayscale degraded character image generation unit for generating first synthetic grayscale degraded character images using binary character images inputted therein; a clustering unit for dividing each category of the first synthetic grayscale degraded character images generated by the first synthetic grayscale degraded character image generation unit into a plurality of clusters; a template generation unit for generating template for each of the clusters; a transformation matrix generation unit for generating transformation matrix in relation to each of the templates; and a second synthetic grayscale degraded character dictionary generation unit for obtaining character feature of every grayscale degraded character of each of the clusters using the transformation matrix, and for constructing eigenspace of each category of the synthetic grayscale degraded character, which is the second synthetic grayscale character dictionary.
摘要:
A document image search apparatus generates a text by performing the character recognition of a document image and determines a re-process scope. Then, the apparatus generates a candidate character lattice from the re-recognition result of the re-process scope, generates character strings from the candidate character lattice and adds the character strings to the text. Then, the apparatus performs index search using the text with the character strings added.
摘要:
Character recognition apparatus and method for recognizing characters in an image, of which the character recognition apparatus comprises a text line extraction unit for extracting a plurality of text lines from an input image, a feature recognition unit for recognizing one or more features of each of the text lines, a synthetic pattern generation unit for generating synthetic character images for each of the text lines by using the features recognized by the feature recognition unit and the original character images, a synthetic dictionary generation unit for generating a synthetic dictionary for each of the text lines by using the synthetic character images, and a text line recognition unit for recognizing characters in each of the text lines by using the synthetic dictionary.
摘要:
The connected element extraction device stores a binary image signal as binary image data; calculates a rectangle circumscribing a connected element; determines whether the rectangle overlaps with others and stores the label and coordinates of an overlapping rectangle; generates a label image of the overlapping rectangle; based on the label image, divides the overlapping rectangle as a parent rectangle into child rectangles and repeatedly subdivides the child-rectangle into final child rectangles which respectively have a single label; and outputs information comprising the binary image data, and the coordinates of the non-overlapping rectangle, parent-rectangle and final child-rectangles.
摘要:
In an image extraction system, an extracting part for extracting wide lines, an extracting part for extracting narrow lines and a frame detector detect a frame from a pattern which is extracted by a connected pattern extracting part. An attribute adder adds attributes of a character (graphic and symbol inclusive), frame, and a contact pattern of the character and frame to a partial pattern, and a separating part separates the frame from the contact pattern. An intersection calculator calculates intersections of the character and frame, and the calculated intersections are associated by an intersection associating part. An interpolator obtains a character region within the frame and interpolates this region based on the associated intersections. A connection confirming part confirms a connection of the pattern with respect to the extracted character pattern, and patterns confirmed of their connection are integrated in a connected pattern integrating part to thereby extract the character.
摘要:
The present embodiments disclose a method of and device for identifying the direction of characters in an image block. The method includes: performing optical character recognition processing on the image block by assuming various directions as assumed character directions to obtain sub image blocks, recognized characters corresponding to the sub image blocks and correctness measures thereof in each assumed character directions; in sub image blocks in the assumed character directions with 180° mutual relation, searching for a minimum matching pair of the sub image blocks; adjusting the sub image blocks in the searched minimum matching pair to eliminate the effect, on an identification result, of different numbers of sub image blocks in various assumed character directions; calculating an accumulative correctness measure in each assumed character directions based on the adjusted sub image blocks; and identifying the direction of characters in the image block according to the accumulative correctness measures.
摘要:
An image processing method generally includes: obtaining a vanishing point on a curved surface in a two-dimension image; extracting all the straight line segments between a top contour line and a bottom contour line of the curved surface by the vanishing point; removing a perspective distortion to get parallel straight line segments; obtaining the lengths of the straight line segments, obtaining the true width of each of the straight line segments in a three-dimension space and the depth increment of the straight line segments according to the lengths; obtaining the expanded width of each straight line segment according to the true width and the depth increment; obtaining the total expanded width of the curved surface to transform it into a flat surface; transforming image contents on the curved surface onto the flat surface.
摘要:
Method and apparatus for processing an image including a character are disclosed. The method may include: searching in a set of characters one or more characters having highest similarities of shape to a character in the set of characters, hereinafter the character being referred to as a first character, the one or more searched characters forming a similar character list of the first character; searching in the set of characters one or more characters having highest similarities of shape to each character in the similar character list of the first character, to form a similar character list of each character in the similar character list of the first character; and selecting in the similar character lists one or more characters having a high mutual similarity between each other, as a character cluster.