IDENTIFICATION OF CHANGES BETWEEN DOCUMENT VERSIONS

    公开(公告)号:US20210271718A1

    公开(公告)日:2021-09-02

    申请号:US16806438

    申请日:2020-03-02

    摘要: One embodiment provides a method, including: obtaining at least two documents, wherein one of the at least two documents comprises a revision different than another of the at least two documents; identifying, within each of the at least two documents, portions corresponding to groups of text containing a conceptual unit; assigning at least a subset of the identified portions to a category type corresponding to a topic of a given portion, wherein the assigning comprises (i) generating a semantic tag for the identified portions in the subset and (ii) tagging the identified portions in the subset with the semantic tag; and determining changes between the at least two documents, wherein the determining comprises (iii) aligning given portions across the at least two documents based upon a relationship between the given portions across the at least two documents, (iv) identifying semantic differences between the aligned portions, and (v) identifying any remaining unaligned portions.

    Contrasting document-embedded structured data and generating summaries thereof

    公开(公告)号:US11500840B2

    公开(公告)日:2022-11-15

    申请号:US16804399

    申请日:2020-02-28

    摘要: Methods, systems, and computer program products for contrasting document-embedded structured data and generating summaries thereof are provided herein. A computer-implemented method includes extracting two or more tables from two or more input documents, wherein each of the two or more input documents comprises structured data and unstructured data; normalizing the two or more extracted tables using one or more alignment techniques; determining at least one of (i) one or more differences and (ii) one or more similarities across the two or more extracted tables by performing a comparison of the two or more normalized tables; deriving one or more insights from the comparison by applying at least one analytical model to the at least one of the one or more determined differences and one or more determined similarities; and outputting at least a portion of the one or more insights to at least one user.

    Contrasting Document-Embedded Structured Data and Generating Summaries Thereof

    公开(公告)号:US20210271654A1

    公开(公告)日:2021-09-02

    申请号:US16804399

    申请日:2020-02-28

    摘要: Methods, systems, and computer program products for contrasting document-embedded structured data and generating summaries thereof are provided herein. A computer-implemented method includes extracting two or more tables from two or more input documents, wherein each of the two or more input documents comprises structured data and unstructured data; normalizing the two or more extracted tables using one or more alignment techniques; determining at least one of (i) one or more differences and (ii) one or more similarities across the two or more extracted tables by performing a comparison of the two or more normalized tables; deriving one or more insights from the comparison by applying at least one analytical model to the at least one of the one or more determined differences and one or more determined similarities; and outputting at least a portion of the one or more insights to at least one user.

    Inter-reviewer conflict resolution

    公开(公告)号:US10885019B2

    公开(公告)日:2021-01-05

    申请号:US16163289

    申请日:2018-10-17

    摘要: One embodiment provides a method, including: receiving a plurality of review comments from each of a plurality of reviewers tasked with reviewing a document; categorizing each of the plurality of review comments into one of a plurality of review topics; identifying a conflict between a first review comment provided by one of the plurality of reviewers and a second review comment provided by another of the plurality of reviewers, wherein the identifying a conflict comprises (i) identifying a sentiment of the first review comment and a sentiment of the second review comment and (ii) determining that the sentiment of the first review comment and the sentiment of the second review comment are different; and generating a question set comprising a plurality of questions based upon a conflict identified for a review comment of the corresponding reviewer, wherein the corresponding reviewer answering the generated question resolves the conflict.

    Identification of changes between document versions

    公开(公告)号:US11630869B2

    公开(公告)日:2023-04-18

    申请号:US16806438

    申请日:2020-03-02

    摘要: One embodiment provides a method, including: obtaining at least two documents, wherein one of the at least two documents comprises a revision different than another of the at least two documents; identifying, within each of the at least two documents, portions corresponding to groups of text containing a conceptual unit; assigning at least a subset of the identified portions to a category type corresponding to a topic of a given portion, wherein the assigning comprises (i) generating a semantic tag for the identified portions in the subset and (ii) tagging the identified portions in the subset with the semantic tag; and determining changes between the at least two documents, wherein the determining comprises (iii) aligning given portions across the at least two documents based upon a relationship between the given portions across the at least two documents, (iv) identifying semantic differences between the aligned portions, and (v) identifying any remaining unaligned portions.