Invention Application
US20050050459A1 Automatic partition method and apparatus for structured document information blocks 审中-公开
结构化文档信息块的自动分割方法和装置

Automatic partition method and apparatus for structured document information blocks
Abstract:
An automatic partition method and apparatus for structured document information blocks capable of correct identification and partition of information blocks in structured documents, even if the structures and repetition patterns of the structured documents are relatively complicated and the information blocks are not entirely consistent with one another. The automatic partition apparatus for structured document information blocks includes: a document structure information generating unit, which receives the structured document and generates document structure information based on the structured document; an information block scope determining unit, which determines the scope of information blocks according to the document structure information generated by the document structure information generating unit; a partition rule generating unit, which generates a partition rule according to the document structure information generated by the document structure information generating unit and the scope determined by the information block scope determining unit; and a partition unit, which partitions the structured document and outputs the partition result according to the partition rule generated by the partition rule generating unit.
Information query
Patent Agency Ranking
0/0