- 专利标题: DOCUMENT REVISION CHANGE SUMMARIZATION
-
申请号: US15922720申请日: 2018-03-15
-
公开(公告)号: US20190286741A1公开(公告)日: 2019-09-19
- 发明人: Arvind Agarwal , Vitobha Munigala , Riddhiman Dasgupta , Arun Kumar
- 申请人: International Business Machines Corporation
- 主分类号: G06F17/30
- IPC分类号: G06F17/30 ; G06F17/27 ; G06K9/00
摘要:
One embodiment provides a method, including: obtaining at least two documents, wherein one of the at least two documents comprises a different revision of another of the at least two documents; identifying a structure of each of the at least two documents by parsing each of the at least two documents to extract text from each of the at least two documents; aligning sections of the at least two documents, wherein the aligning comprises matching a section from one of the at least two documents and a corresponding section from another of the at least two documents; identifying at least one difference between the at least two documents; assigning a semantic label to the identified at least one difference; and providing a summary of the identified at least one difference by compressing the text surrounding the identified at least one difference using the assigned semantic label.
信息查询