Apparatus and a method for logically processing a composite graph in a formatted document
    1.
    发明授权
    Apparatus and a method for logically processing a composite graph in a formatted document 有权
    用于逻辑处理格式化文档中的复合图的装置和方法

    公开(公告)号:US09569407B2

    公开(公告)日:2017-02-14

    申请号:US14095682

    申请日:2013-12-03

    摘要: The present invention provides an apparatus for logically processing a composite graph in a formatted document, the apparatus comprising: a composite graph block extraction unit, used to extract a composite graph block in the formatted document; a document parsing unit, used to parse the formatted document to obtain a text element contained therein; a cutline element extraction unit, used to extract a cutline element from the text element; a correlativity detection unit, used to detect correlativity between the composite graph block and the cutline element; a correlativity storage unit, used to store the detected correlativity. The present invention also provides a method for logically processing a composite graph in a formatted document. According to the technical scheme disclosed in the present invention, it is easily achieve layout understanding of the composite graph in a graph-text mixed layout of the formatted document, so as to avoid a logical error.

    摘要翻译: 本发明提供一种用于逻辑处理格式化文档中的复合图形的装置,该装置包括:复合图块块提取单元,用于提取格式化文档中的复合图块; 文档解析单元,用于解析格式化的文档以获得其中包含的文本元素; 切割元素提取单元,用于从文本元素提取切割元素; 相关性检测单元,用于检测复合图形块和切割线元素之间的相关性; 相关性存储单元,用于存储检测到的相关性。 本发明还提供了一种在格式化文档中逻辑地处理复合图形的方法。 根据本发明公开的技术方案,可以容易地在格式化文档的图形文本混合布局中实现组合图的布局理解,以避免逻辑错误。

    Apparatus And A Method For Logically Processing A Composite Graph In A Formatted Document
    3.
    发明申请
    Apparatus And A Method For Logically Processing A Composite Graph In A Formatted Document 有权
    用于逻辑处理格式化文档中的复合图形的装置和方法

    公开(公告)号:US20140337719A1

    公开(公告)日:2014-11-13

    申请号:US14095682

    申请日:2013-12-03

    IPC分类号: G06F3/0484 G06F17/27

    摘要: The present invention provides an apparatus for logically processing a composite graph in a formatted document, the apparatus comprising: a composite graph block extraction unit, used to extract a composite graph block in the formatted document; a document parsing unit, used to parse the formatted document to obtain a text element contained therein; a cutline element extraction unit, used to extract a cutline element from the text element; a correlativity detection unit, used to detect correlativity between the composite graph block and the cutline element; a correlativity storage unit, used to store the detected correlativity. The present invention also provides a method for logically processing a composite graph in a formatted document. According to the technical scheme disclosed in the present invention, it is easily achieve layout understanding of the composite graph in a graph-text mixed layout of the formatted document, so as to avoid a logical error.

    摘要翻译: 本发明提供一种用于逻辑处理格式化文档中的复合图形的装置,该装置包括:复合图块块提取单元,用于提取格式化文档中的复合图块; 文档解析单元,用于解析格式化的文档以获得其中包含的文本元素; 切割元素提取单元,用于从文本元素提取切割元素; 相关性检测单元,用于检测复合图形块和切割线元素之间的相关性; 相关性存储单元,用于存储检测到的相关性。 本发明还提供了一种在格式化文档中逻辑地处理复合图形的方法。 根据本发明公开的技术方案,可以容易地在格式化文档的图形文本混合布局中实现组合图的布局理解,以避免逻辑错误。

    EXTRACTION DEVICE FOR COMPOSITE GRAPH IN FIXED LAYOUT DOCUMENT AND EXTRACTION METHOD THEREOF
    4.
    发明申请
    EXTRACTION DEVICE FOR COMPOSITE GRAPH IN FIXED LAYOUT DOCUMENT AND EXTRACTION METHOD THEREOF 审中-公开
    固定布置文件中复合图的提取装置及其提取方法

    公开(公告)号:US20150046784A1

    公开(公告)日:2015-02-12

    申请号:US14104064

    申请日:2013-12-12

    IPC分类号: G06F17/21

    CPC分类号: G06K9/00463

    摘要: An extraction device for the composite graph in a fixed layout document comprising: a document parsing unit, for parsing the fixed layout document, and determining the primitives of the fixed layout document and their types; a layer generation unit, for extracting text primitives so as to form a text layer, and using the rest non-text primitives to form a non-text layer; a page analysis unit, for processing the text layer and the non-text layer with page analyses respectively; a block generation unit, for generating a text block in the text layer and a graph block in the non-text layer; a correlation block determination unit, for determining text blocks correlating to every graph block and merging those correlated text blocks and graph blocks into a composite graph block; an identifier storage unit, for storing the identifiers of all the primitives contained in the composite graph block.

    摘要翻译: 一种用于固定布局文档中的复合图形的提取装置,包括:文档解析单元,用于解析固定布局文档,以及确定固定布局文档及其类型的图元; 层生成单元,用于提取文本图元以形成文本层,并使用其余的非文本图元来形成非文本层; 页面分析单元,用于分别用页面分析处理文本层和非文本层; 块生成单元,用于在文本层中生成文本块和非文本层中的图块; 相关块确定单元,用于确定与每个图形块相关联的文本块,并将所述相关文本块和图形块合并到合成图形块中; 标识符存储单元,用于存储复合图形块中包含的所有原语的标识符。

    Apparatus and a method for logically processing a composite graph in a formatted document

    公开(公告)号:US09542362B2

    公开(公告)日:2017-01-10

    申请号:US14095682

    申请日:2013-12-03

    摘要: The present invention provides an apparatus for logically processing a composite graph in a formatted document, the apparatus comprising: a composite graph block extraction unit, used to extract a composite graph block in the formatted document; a document parsing unit, used to parse the formatted document to obtain a text element contained therein; a cutline element extraction unit, used to extract a cutline element from the text element; a correlativity detection unit, used to detect correlativity between the composite graph block and the cutline element; a correlativity storage unit, used to store the detected correlativity. The present invention also provides a method for logically processing a composite graph in a formatted document. According to the technical scheme disclosed in the present invention, it is easily achieve layout understanding of the composite graph in a graph-text mixed layout of the formatted document, so as to avoid a logical error.