摘要:
In an image extraction system, an extracting part for extracting wide lines, an extracting part for extracting narrow lines and a frame detector detect a frame from a pattern which is extracted by a connected pattern extracting part. An attribute adder adds attributes of a character (graphic and symbol inclusive), frame, and a contact pattern of the character and frame to a partial pattern, and a separating part separates the frame from the contact pattern. An intersection calculator calculates intersections of the character and frame, and the calculated intersections are associated by an intersection associating part. An interpolator obtains a character region within the frame and interpolates this region based on the associated intersections. A connection confirming part confirms a connection of the pattern with respect to the extracted character pattern, and patterns confirmed of their connection are integrated in a connected pattern integrating part to thereby extract the character.
摘要:
In an image extraction system, an extracting part for extracting wide lines, an extracting part for extracting narrow lines and a frame detector detect a frame from a pattern which is extracted by a connected pattern extracting part. An attribute adder adds attributes of a character (graphic and symbol inclusive), frame, and a contact pattern of the character and frame to a partial pattern, and a separating part separates the frame from the contact pattern. An intersection calculator calculates intersections of the character and frame, and the calculated intersections are associated by an intersection associating part. An interpolator obtains a character region within the frame and interpolates this region based on the associated intersections. A connection confirming part confirms a connection of the pattern with respect to the extracted character pattern, and patterns confirmed of their connection are integrated in a connected pattern integrating part to thereby extract the character.
摘要:
An image extraction system includes a connected pattern extracting part for extracting partial patterns respectively having connected pixels from an image which is formed by a block frame having a table format and including one-character frames or a free format frame, characters, graphics or symbols, a one-character frame extracting part for extracting one-character frames from the image based on the partial patterns extracted by the connected pattern extracting part, a straight line extracting part for extracting straight lines from the partial patterns which are extracted by the connected pattern extracting part and is eliminated of the one-character frames by the one-character frame extracting part, a frame detecting part for detecting straight lines forming the frame from the straight lines extracted by the straight line extracting part, and a frame separating part for separating the straight lines detected by the frame detecting part from the partial patterns so as to extract the characters, graphics or symbols.
摘要:
A character box extracting unit extracts a line forming a character box. Then, the character box intersection calculating unit calculates the intersection of the character box with a character pattern. An intersection corresponding unit associates intersections with each other based on the directional property of character lines, distance between the character lines, etc. An in-box character extracting unit extracts a virtual image according to the association information between the intersections. A character size evaluating unit obtains from an optional character string an average character size of a character including the virtual image, and extracts a true character pattern by removing a redundant virtual image based on the average character size. A character structure analyzing and evaluating unit obtains from a prepared table a true image corresponding to the virtual image and extracts a true character pattern, thereby correctly extracting the pattern from the image in which the line crosses the pattern.11
摘要:
A character box extracting unit extracts a line forming a character box. Then, the character box intersection calculating unit calculates the intersection of the character box with a character pattern. An intersection corresponding unit associates intersections with each other based on the directional property of character lines, distance between the character lines, etc. An in-box character extracting unit extracts a virtual image according to the association information between the intersections. A character size evaluating unit obtains from an optional character string an average character size of a character including the virtual image, and extracts a true character pattern by removing a redundant virtual image based on the average character size. A character structure analyzing and evaluating unit obtains from a prepared table a true image corresponding to the virtual image and extracts a true character pattern, thereby correctly extracting the pattern from the image in which the line crosses the pattern.
摘要:
An image processing method includes estimating corners of a contour of an object area in an obtained image, searching for contour lines of the object area between every two points which are offset from the estimated corners within a predetermined degree or distance along a direction away from the object area respectively, and determining intersection points of the contour lines as final corners of the contour of the object area, and determining contour lines between the final corners as a final contour of the object area.
摘要:
The present embodiments disclose a method of and a device for identifying the direction of characters in an image block. The method includes: performing optical character recognition processing on the image block by assuming various directions as assumed character directions, respectively, to obtain sub image blocks, recognized characters corresponding to the sub image blocks and correctness measures thereof in each of the assumed character directions; determining a language group to which the characters in the image block belong; adjusting a correctness measure corresponding to a sub image block which corresponds to a recognized character not belonging to the determined language group in each of the assumed character directions; calculating an accumulative correctness measure in each of the assumed character directions based on the adjusted correctness measure; and identifying the direction of the characters in the image block according to the accumulative correctness measures.
摘要:
A set of straight lines that associate a top parallel geodesic projection positioned at an upper end with a bottom parallel geodesic projection positioned at a lower end, among sets of parallel geodesic projections, is extracted as a set of ruled-line candidate projections as a search target of a set of ruled line projections. A deviation of neighborhood, which is a distance between a cross ratio vector of the ruled-line candidate projection and a cross ratio vector of a neighboring line obtained by shifting the ruled-line candidate projection by a predetermined interval, is calculated for each ruled-line candidate projection. A set of straight lines having the smallest sum total of deviations of neighborhood, in the set of straight lines, which do not intersect with each other, among the sets of ruled-line projection candidates is extracted as a set of ruled line projections by continuous dynamic programming.
摘要:
A pointing information extraction unit extracts pointing information indicating a pointing position and a pointing time on a slide from a slide file used in a lecture and a video file of a lecture video using a pointing device. A word information generation unit analyzes a text sentence extracted from the slide file to generate a word information file indicating a word and a position thereof. A word pointing information generation unit estimates a word closest to the pointing position on the slide to generate a word pointing information file with the pointing time assigned. A fill-in-the-blank word extraction unit extracts a word having a pointing time equal to or longer than a predetermined time from the word pointing information as a fill-in-the-blank word file. A fill-in-the-blank test question is generated by setting the fill-in-the-blank word of the slide information as a blank region.
摘要:
A grayscale character dictionary generation apparatus, comprising a first synthetic grayscale degraded character image generation unit for generating first synthetic grayscale degraded character images using binary character images inputted therein; a clustering unit for dividing each category of the first synthetic grayscale degraded character images generated by the first synthetic grayscale degraded character image generation unit into a plurality of clusters; a template generation unit for generating template for each of the clusters; a transformation matrix generation unit for generating transformation matrix in relation to each of the templates; and a second synthetic grayscale degraded character dictionary generation unit for obtaining character feature of every grayscale degraded character of each of the clusters using the transformation matrix, and for constructing eigenspace of each category of the synthetic grayscale degraded character, which is the second synthetic grayscale character dictionary.