摘要:
An image processing apparatus includes an extracting unit that extracts each tablespace image from each page of image data containing plural pages read by a document reading device, a generating unit that generates each table structure data of the tables from each tablespace image extracted by the extracting unit, a discrimination unit that discriminates a connection possibility between the tables based on table structure data of the tables of each page generated by the generating unit, a determination unit that determines a connection sequence for restoring an original table by connecting each of the tables based on the connection possibility between the tables discriminated by the discrimination unit, and a restoring unit that restores data on a single table before division by connecting each of the tables based on the connection sequence determined by the determination unit.
摘要:
An image processing apparatus includes: a structure information acquisition portion that acquires, from a list which is included in each of plural pieces of image data sorted in a predetermined order and is formed of rows and columns, structure information which includes row information including at least the number of the rows of the list and heights of the rows thereof and column information including at least the number of the columns thereof and the widths of the columns thereof; a list connection determination portion that determines, based on the acquired structure information, a set of connected lists among the lists respectively included in the plural pieces of the image data, and a connection direction of the connected lists; and a list connection portion that connects the set of the determined lists in the determined connection direction in an order of the plural pieces of the image data listed.
摘要:
An image processing apparatus includes a header acquiring part, a table connection determining part, and a table connecting part. The header acquiring part acquires a header from a table, having rows and columns, included in each of plural pieces of image data arranged in a predetermined order. The table connection determining part determines whether the headers acquired from the tables match one another and determines, as a set of tables to be connected, adjacent tables having the matching headers. The table connecting part deletes the header from each of one or more second tables and connects a first table and the one or more second tables to each other in accordance with the predetermined order. The first table is included in a first piece of pieces of image data of the determined set of tables in the predetermined order.